Exp. Mail No. EM035613879US 
IDS Filed on March 5, 2008 
For US AN 10/714,389 



(12) INTERNATIONAL APPLICATION PUBLISHED tINDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 
Iniemational Bureau 

(43) International Publication Date 
1 August 2002 (01.08.2002) 




PCT 



lillllil 

(10) International Publication Number 

wo 02/059377 A2 



(51) International Patent Classification'': 



CI2Q 1/68 



(21) International Application Number: PCT/US02/02242 



(22) International Filing Date: 24 January 2002 (24.01.2002) 



(25) Filing Language: 



(26) Publication Language: 



English 



English 



(30) Priority Data: 






60/263,965 


24 January 2001 (24.01.2001) 


US 


60/265,928 


2 February 2001 (02.02.2001) 


us 


09/829,472 


9 April 2001 (09.04.2001) 


us 


60/282,698 


9 April 2001 (09.04.2001) 


us 


60/288,590 


4 May 2001 (04.05.2001) 


us 


60/294,443 


29 May 2001(29.05.2001) 


us 



(71) Applicant: EOS BIOTECHNOLOGY, INC. [US/US]; 
225A Gateway Boulevard, South San Francisco, CA 
94080-7019 (US). 

(72) Inventors: MACK, David, H.; 2076 Monterey Avenue, 
Menlo Park, CA 94025 (US). GISH, Kurt, C; 40 Perego 
Terrace #2, San Francisco, CA 94 1 3 1 (US). AFAR, Daniel; 
435 Visitacion Avenue, Brisbane, CA 94005 (US). 



(74) Agents: BASTIAN, Kevin, L. et al.; Townsend and 
Townsend and Crew LLP, Two Embarcadero Center, 8th 
Floor, San Francisco. CA 941 1 1-3834 (US). 

(81) Designated States (national): AE, AG, AL, AM, AT, AU, 

AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CO, CR, CU, 
CZ, DE, DK, DM, DZ, EC, EE, ES, H, GB, GD, GE, GH, 
GM, HR, KU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, 
LK, LR, LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, 
MX, MZ, NO, NZ, CM, PlI, PL, PT, RO, RU, SD, SE, SG, 
SI. SK, SL. TJ. TM, TN, TR, TT, TZ, UA, UG, UZ, VN, 
YU, ZA, ZM, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, 1.^, MW, MZ, SD, SL, SZ, TZ, UG, ZM, ZW), 
Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
European patent (AT, BE, CH, CY, DE, DK, ES, H, FR, 
GB, GR, IE, IT, LU, MC, NL. PT, SE, TR), OAPI patent 
(BF, BJ, CF, CG, CI, CM, GA, GN, GQ, GW, ML, MR, 
NE, SN, TD, TG). 

Published: 

— without international search report and to be republished 
upon receipt of that report 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
t^ifig of each regular issue of the PCT Gazette. 



< 
t> 

ON 

tn 

^ (54) Title: METHODS 01^^ DIAGNOSIS OF BREAST CANCER, COMPOSITIONS AND METHODS OF SCREENING FOR 
n MODULATORS OF BREAST CANCER 

Q (57) Abstract: Described herein are genes whose expression are up-regulated or down-regulated in breast cancer. Related methods 
^ and compositions that can be used for diagnosis and treatment of breast cancer are disclosed. Also described herein are methods that 
^ can be used to identify modulators of breast cancer. 



wo 02/059377 



1 



PCT/US02/02242 



METHODS OF DIAGNOSIS OF BREAST CANCER, COMPOSITIONS 
AND METHODS OF SCREENING FOR MODULATORS OF BREAST 

CANCER 

CROSS-REFERENCES TO RELATED APPLICATIONS 

This application claims priority to USSN 60/263,965, filed January 24, 2001; 
trSSN 60/265,928, filed February 2, 2001; USSN 09/829,472 filed April 9, 2001; USSN 
60/282,698, filed April 9, 2001; USSN 60/288,590, filed May 4, 2001; and USSN 
60/294,443, filed May 29, 2001, all of which are incorporated herein by reference in their 
entirety. 

FIELD OF THE INVENTION 

The invention relates to the identification of nucleic acid and protein 
expression profiles and nucleic acids, products, and antibodies thereto that are involved in 
breast cancer; and to the use of such expression profiles and compositions in the diagnosis, 
prognosis and therapy of breast cancer. The invention fiirther relates to methods for 
identifying and using agents and/or targets that mhibit breast canc^. 

BACKGROUND OF THE INVENTION 
Breast cancer is one of the most fi-equently diagnosed cancers and the second 
leading cause of female cancer death in North America and northern Europe, with lung 
cancer being the leading cause. Lifetime incidence of the disease in the United States is one- 
in-eight, with a l-in-29 lifetime risk of dying fi*om breast cancer. Early detection of breast 
cancer, using mammography, clinical breast examination, and self breast examination, has 
dramatically improved the treatment of the disease, although sensitivity is still major concern, 
as mammographic sensitivity has been estimated at only 60%-90%. Treatment of breast 
cancer consists largely of surgical Imnpectomy or mastectomy, radiation therapy, anti- 



1 



wo 02/059377 



PCT/US02/02242 



hormone therapy, and/or chemotherapy. Although many breast cancer patients are effectively 
treated, the current therapies can all induce serious side effects wliich diminish quality of life. 
Deciding on a particular course of treatment is typically based on a variety of prognostic 
parameters and markers (Fitzgibbons et al, 2000, Arch. Pathol. Lab. Med. 124:966-978; 
5 Hamilton and Piccart, 2000, Ann. Oncol. 1 1 :647-663), including genetic predispostion 
markers BRCA-1 and BRCA-2 (Robson, 2000, J. Clin. Oncol 18:113sup-l 18sup). 

Imaging of breast cancer for diagnosis has been problematic and limited. In 
addition, dissemination of tumor cells (metastases) to locoregional lymph nodes is an 
important prognostic factor; five year survival rates drop fix>m 80 percent in patients with no 

10 lyn5)h node metastases to 45 to 50 percent in those patients who do have lymph node 

metastases. A recent report showed that micrometastases can be detected fi-om lymph nodes 
using reverse transcriptase-PCR methods based on the presence of mRNA for 
carcinoembryonic antigen, which has previously been shown to be present in the vast 
majority of breast cancers but not in normal tissues. Liefers et al., New England J. of Med, 

15 339(4):223 (1998). 

The identification of novel therapeutic targets and diagnostic markers is 
essential for improving the current treatment of breast cancer patients. Recent advances in 
molecular medicine have increased the interest m tumor-specific cell surface antigens that 
could serve as targets for various immunother^eutic or small molecule strategies. Antigens 

20 suitable for immunotherapeutic strategies should be highly expressed in cancer tissues and 
ideally not expressed in normal adult tissues. Expression in tissues that are dispensable for 
life, however, may be tolerated. Examples of such antigens mclude Her2/neu and the B-cell 
antigen CD20. Humanized monclonal antibodies directed to Her2/neu 
(Herceptin®/trastu2umab) are currently in use for the treatment of metastatic breast cancer 

25 (Ross and Fletcher, 1998, Stem Cells 16:413-428). Similarly, anti-CD20 monoclonal 
antibodies (Rituxin®/rituximab) are used to effectively treat non-Hodgekin's lymphoma 
(Maloney et al, 1997, Blood 90:2188-2195; Leget and Czuczman, 1998, Cuir. Opin. Oncol. 
10:548-551). 

Other potential immunotherapeutic targets have been identified for breast 
30 cancer. One such target is polymorphic epithelial mucin (MUCl). MUCl is a transmembrane 
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protein, present at the apical surface of glandular epithelial cells. It is often overexpressed in 
breast cancer, and typically exhibits an altered glycosylation pattern, resulting in an 
antigenically distinct molecule, and is in early clinical trials as a vaccine target (Gilewski et 
al., 2000, Clin. Cancer Res. 6:1693-1701; Scholl et al., 2000, J. Immunother. 23:570-580). 
5 The tumor-expressed protein is often cleaved into the circulation, where it is detectable as the 
tumor marker, CA 15-3 (Bon et al., 1997, CUn. Chem. 43:585-593). However, many patients 
have tumors that express neither HER2 nor MUC-1; therefore, it is clear that other targets 
need to be identified to manage localized and metastatic disease. Many other genes have been 
reported to be overexpressed in breast cancer, such as EGFR (Sainsbury et al., 1987, Lancet 

10 1(8547):1398-1402), c-erbB3 (Naidu et al., 1988, Br. J. Cancer 78:1385-1390), FGFR2 
(Penault-Llorca et al, 1991, Int. J. Cancer 61:170-176), PKW (Preiherr et al., 2000, 
Anticancer Res. 20:2255-2264), MTAl (Nawa et al, 2000, J. Cell Biochem. 79:202-212), 
breast cancer associated gene 1 (Kurt et al., 2000, Breast Cancer Res. Treat. 59:41-48). 
Although monoclonal antibodies to the protein products of some of these overexpressed 

1 5 genes have been reported (for review, see Green et al., 2000, Cancer Treat. Rev. 26:269-286), 
none are currently approved for breast cancer therapy in the US. 

Disclosures of certain genes and ESTs described as being expressed in breast 
cancer are found in international patent applications WO-99/33869, WO-97/25426, WO- 
97/02280 and WO-00/55173, WO-98/45328 and WO-00/22130. Similarly, genes and ESTs 

20 described as being expressed in breast cancer are disclosed in US Patent Nos. 5,759,776 and 
5,€93,522. The utihty of such genes is described in each of these publications, and their 
disclosures are incorporated herein in their entirety. 

While industry and academia have identified novel sequences, there has not 
been an equal effort exerted to identify the fimction of these novel sequences. The 

25 elucidation of a role for novel proteins and compoimds in disease states for identification of 
therapeutic targets and diagnostic markers is essential for improving the current treatment of 
breast cancer patients. Accordingly, provided herein are molecular targets for therapeutic 
intervention in breast and other cancers. Additionally, provided herein are methods that can 
be used in diagnosis and prognosis of breast cancer. Further provided are methods that can 

30 be used to screen candidate bioactive agents for the ability to modulate breast cancer. 
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SUMMARY OF THE INVENTION 
The present invention therefore provides nucleotide sequences of genes that 
are up- and down-regulated in breast cancer cells. Such genes are useful for diagnostic 
5 purposes, and also as targets for screening for therapeutic compounds that modulate breast 
cancer, such as hormones or antibodies. Other aspects of the invention will become apparent 
to the skilled artisan by the following description of the invention. 

In one aspect, the present invention provides a method of detecting a breast 
cancer-associated transcript in a cell from a patient, the method comprising contacting a 
10 biological sample from the patient with a polynucleotide that selectively hybridizes to a 
sequence at least 80% identical to a sequence as shown in Tables 1-25. 

In one embodiment, the present invention provides a method of determining 
the level of a breast cancer associated transcript in a cell from a patient. 

In one embodiment, the present invention provides a method of detecting a 
IS breast cancer-associated transcript in a cell from a patient, the method comprising contacting 
a biological sample from the patient with a polynucleotide that selectively hybridizes to a 
sequence at least 80% identical to a sequence as shown in Tables 1-25. 

In one embodiment, the polynucleotide selectively hybridizes to a sequence at 
least 95% identical to a sequence as shown in Tables 1-25. 
20 In one embodiment, the biological sample is a tissue sample. In another 

erfibodiment, the biological sample comprises isolated nucleic acids, e.g., mRNA. 

In one embodiment, the polynucleotide is labeled, e.g., with a fluorescent 

label. 

In one embodiment, the polynucleotide is immobilized on a solid siuface. 
25 In one embodiment, the patient is undergoing a therapeutic regimen to treat 

breast cancer. In another embodiment, the patient is suspected of having metastatic breast 
cancer. 

In one embodiment, the patient is a human. 

In one embodiment, the breast cancer associated transcript is mRNA. 
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In one embodiment, the method further comprises the step of amplifying 
nucleic acids before the step of contacting the biological sample with the polynucleotide. 

In another aspect, the present invention provides a method of monitoring the 
efficacy of a therapeutic treatment of breast cancer, the method comprising the steps of: (i) 
5 providing a biological sample from a patient undergoing the therapeutic treatment; and (ii) 
determining the level of a breast cancer-associated transcript in the biological sample by 
contacting the biological sample with a polynucleotide that selectively hybridizes to a 
sequence at least 80% identical to a sequence as shown in Tables 1-25, thereby monitoring 
the efficacy of the therapy. In a further embodiment, the patient has metastatic breast cancer. 
10 In a further embodiment, the patient has a drug resistant form of breast cancer. 

In one embodiment, ttie method further comprises the step of: (iii) comparing 
the level of the breast cancer-associated transcript to a level of the breast cancer-associated 
transcript in a biological sample from the patient prior to, or earlier in, the therapeutic 
treatment. 

1 5 Additionally, provided herein is a method of evaluating the effect of a 

candidate breast cancer drug comprising administering the drug to a patient and removing a 
cell sample from the patient. The expression profile of the cell is then determined. This 
method may further comprise comparing the expression profile to an expression profile of a 
healthy individual. In a preferred embodiment, said expression profile includes a gene of 

20 Tables 1-25. 

In one aspect, the present invention provides an isolated nucleic acid molecule 
consisting of a polynucleotide sequence as shown in Tables 1-25. 

In one embodiment, an expression vector or cell comprises the isolated nucleic 

acid. 

25 In one aspect, the present invention provides an isolated polypeptide which is 

encoded by a nucleic acid molecule having polynucleotide sequence as shown in Tables 1-25. 

In another aspect, the present invention provides an antibody that specifically 
binds to an isolated polypeptide which is encoded by a nucleic acid molecule having 
polynucleotide sequence as shown in Tables 1-25. 
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In one embodiment, the antibody is conjugated to an effector component, e.g., 
a fluorescent label, a radioisotope or a cytotoxic chemical. 

In one embodiment, the antibody is an antibody fragment. In another 
embodiment, the antibody is humanized. 
5 In one aspect, the present invention provides a method of detecting a breast 

cancer cell in a biological sample from a patient, the method comprising contacting the 
biological sample with an antibody as described herein. 

In another aspect, the present invention provides a method of detecting 
antibodies specific to breast cancer in a patient, the method comprising contacting a 
1 0 biological sample from the patient with a polypeptide encoded by a nucleic acid comprising a 
sequence from Tables 1-25. 

In another aspect, the present invention provides a method for identifying a 
compound that modulates a breast cancer-associated polypeptide, the method comprising the 
steps of: (i) contacting the compound with a breast cancer-associated polypeptide, tiie 
1 5 polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at least 
80% identical to a sequence as shown in Tables 1-25; and (ii) detemiining the functional 
effect of the compound upon the polypeptide. 

In one embodiment, the functional effect is a physical effect, an enzymatic 
effect, or a chemical effect. 
20 In one embodiment, the polypeptide is expressed in a eukaryotic host cell or 

cell membrane. In another embodiment, the polypeptide is recombinant. 

In one embodiment, the functional effect is determined by measuring ligand 
binding to the polypeptide. 

In another aspect, the present invention provides a method of inhibiting 
25 proliferation of a breast cancer-associated cell to treat breast cancer in a patient, the method 
comprising the step of administering to the subject a therapeutically effective amount of a 
compound identified as described herein. 

In one embodiment, the compound is an antibody. 
In another aspect, the present invention provides a drug screening assay 
30 comprising the steps of: (i) administering a test compound to a mammal having breast cancer 

6 
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or to a cell sample isolated therefix>m; (ii) comparing the level of gene expression of a 
polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 
as shown in Tables 1-25 in a treated cell or mammal with the level of gene expression of the 
polynucleotide in a control cell sample or mammal, wherein a test compound that modulates 
5 the level of expression of the polynucleotide is a candidate for the treatment of breast cancer. 

In one embodiment, the control is a mammal with breast cancer or a cell 
sample therefrom that has not been treated with the test compound. In another embodiment, 
the control is a normal cell or mammal. 

In one embodiment, the test compound is administered in varying amounts or 
10 concentrations. In another embodiment, the test compound is administered for varying time 
periods. In another embodiment, the comparison can occur after addition or removal of the 
drug candidate. 

In one embodiment, the levels of a plurality of polynucleotides that selectively 
hybridize to a sequence at least 80% identical to a sequence as shown in Tables 1-25 are 
15 individually compared to their respective levels in a control cell sample or mammal. In a 
preferred embodiment the plurality of polynucleotides is from three to ten. 

In another aspect, the present invention provides a method for treating a 
mammal having breast cancer comprising administering a compound identified by the assay 
described herein. 

20 In another aspect, the present invention provides a pharmaceutical 

composition for treating a mammal having breast cancer, the composition comprising a 
compound identified by the assay described herein and a physiologically acceptable 
excipient. 

In one aspect, the present invention provides a method of screening drug 
25 candidates by providing a cell expressing a gene that is up- and down-regulated as in a breast 
cancer. In one embodiment, a gene is selected from Tables 1-25. The method further 
includes adding a drug candidate to the cell and determining the effect of the drug candidate 
on the expression of the expression profile gene. 

In one embodiment, the method of screening drug candidates includes 
30 comparing the level of expression in the absence of the drug candidate to the level of 
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expression in the presence of the drug candidate, wherein the concentration of the drug 
candidate can vary when present, and wherein the comparison can occur after addition or 
removal of the drug candidate. In a preferred embodiment, the cell expresses at least two 
expression profile genes. The profile genes may show an increase or decrease. 
5 Also provided is a method of evaluating the effect of a candidate breast cancer 

drug comprising administering the drug to a transgenic animal expressing or over-expressing 
the breast cancer modulatory protein, or an animal lacking the breast cancer modulatory 
protein, for example as a result of a gene knockout.. 

Moreover, provided herein is a biochip comprising one or more nucleic acid 

10 segments of Tables 1-25, wherein the biochip comprises fewer than 1000 nucleic acid probes. 
Preferably, at least two nucleic acid segments are included. More preferably, at least three 
nucleic acid segments are included. 

Furthermore, a method of diagnosing a disorder associated with breast cancer 
is provided. The method comprises determining the expression of a gene of Tables 1-25, 

1 5 preferably a gene of Table 25, in a first tissue type of a first individual, and comparing the 
distribution to the expression of the gene firom a second normal tissue type fi-om the first 
individual or a second unaffected individual. A difference in the expression indicates that the 
first individual has a disorder associated with breast cancer. 

In a further embodiment, the biochip also includes a polynucleotide sequence 

20 of a gene that is not up- and down-regulated in breast cancer. 

In one embodiment a method for screening for a bioactive agent capable of 
interfering with the binding of a breast cancer modulating protein (breast cancer modulatory 
protein) or a fragment thereof and an antibody which binds to said breast cancer modulatory 
protein or firagment thereof. In a preferred embodiment, the method comprises combining a 

25 breast cancer modulatory protein or fragment thereof, a candidate bioactive agent and an 
antibody which binds to said breast cancer modulatory protein or fragment thereof. The 
method further includes determining the binding of said breast cancer modulatory protein or 
firagment thereof and said antibody. Wherein there is a change in binding, an agent is 
identified as an interfering agent. The interfering agent can be an agonist or an antagonist. 

30 Preferably, the agent inhibits breast cancer. 
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Also provided herein are methods of eliciting an immune response in an 
individual. In one embodiment a method provided herein comprises administering to an 
individual a composition comprising a breast cancer modulating protein, or a ftagment 
thereof. In another embodiment, the protein is encoded by a nucleic acid selected from those 
5 of Tables 1-25. 

Further provided herein are compositions capable of eUciting an unmune 
response in an individual. In one embodiment, a composition provided herein comprises a 
breast cancer modulating protein, preferably encoded by a nucleic acid of Tables 1-25, more 
preferably of Table 25, or a fragment thereof, and a pharmaceutically acceptable carrier. In 
10 another embodiment, said composition comprises a nucleic acid comprising a sequence 
encoding a breast cancer modulating protein, preferably selected from the nucleic acids of 
Tables 1-25, and a pharmaceutically acceptable carrier. 

Also provided are methods of neutralizing the effect of a breast cancer protein, 
or a fragment thereof, comprising contacting an agent specific for said protein with said 
15 protein in an amount sufficient to effect neutralization. In another embodiment, the protein is 
encoded by a nucleic acid selected from those of Tables 1-25. 

In another aspect of the invention, a method of treating an individual for breast 
cancer is provided. In one embodiment, the method comprises administering to said 
individual an inhibitor of a breast cancer modulating protein. In another embodiment, the 
20 method comprises administering to a patient having breast cancer an antibody to a breast 

cbicer modulating protein conjugated to a therapeutic moiety. Such a therapeutic moiety can 
be a cytotoxic agent or a radioisotope. 

DETAILED DESCRIPTION OF THE INVENTION 
25 In accordance with the objects outlined above, the present invention provides 

novel methods for diagnosis and prognosis evaluation for breast cancer (PC), including 
metastatic breast cancer, as well as methods for screening for compositions which modulate 
breast cancer. Also provided are methods for treating breast cancer. 

Tables 1-24B provide unigene cluster identification numbers for the 
30 nucleotide sequence of genes that exhibit increased or decreased expression in breast cancer 

9 
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samples. Tables 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 18, 19, 20, 21, and 22 list those genes that are 
up-regulated in breast cancer cells. Table 14 lists those genes that are highly upregulated in 
breast cancer cells. Table 1, 2, 3, 15, and 23 hst genes that are down-regulated in breast 
cancer cells and Table 16, lists genes that are highly down-regulated in breast cancer genes. 
5 The Tables also provide an exemplar accession number that provides a nucleotide sequence 
that is part of the unigene cluster. 

Definitions 

The term **breast cancer protein" or "breast cancer polynucleotide" or "breast 

10 cancer-associated transcript" refers to nucleic acid and polypeptide polymorphic variants, 
alleles, mutants, and interspecies homologues that: (1) have a nucleotide sequence that has 
greater than about 60% nucleotide sequence identity, 65%, 70%, 75%, 80%, 85%, 90%, 
preferably 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% or greater nucleotide 
sequence identity, preferably over a region of over a region of at least about 25, 50, 100, 200, 

15 500, 1000, or more nucleotides, to a nucleotide sequence of or associated with a gene of 

Tables 1-25; (2) bind to antibodies, e.g., polyclonal antibodies, raised against an immunogen 
comprising an amino acid sequence encoded by a nucleotide sequence of or associated with a 
gene of Tables 1-25, and conservatively modified variants thereof; (3) specifically hybridize 
under stringent hybridization conditions to a nucleic acid sequence, or the complement 

20 thereof of Tables 1-25 and conservatively modified variants thereof or (4) have an amino acid 
sequence that has greater than about 60% amino acid sequence identity, 65%, 70%, 75%, 
80%, 85%, 90%, preferably 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% or greater 
amino sequence identity, preferably over a region of over a region of at least about 25, 50, 
100, 200, 500, 1000, or more amino acid, to an amino acid sequence encoded by a nucleotide 

25 sequence of or associated with a gene of Tables 1 -25 . A polynucleotide or polypeptide 
sequence is typically from a mammal including, but not limited to, primate, e.g., human; 
rodent, e.g., rat, mouse, hamster; cow, pig, horse, sheep, or other mammal. A "breast cancer 
polypeptide" and a **breast cancer polynucleotide," include both naturally occurring or 
recombinant forms. 

10 
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A "full length" breast cancer protein or nucleic acid refers to a breast cancer 
polypeptide or polynucleotide sequence, or a variant thereof, that contains all of the elements 
normally contained in one or more naturally occurring, wild type breast cancer 
polynucleotide or polypeptide sequences. The "fiiU length" may be prior to, or after, various 
5 stages of post-translation processing or spUcing, including alternative splicing. 

"Biological sample" as used herein is a sample of biological tissue or fluid that 
contains nucleic acids or polypeptides, e.g., of a breast cancer protein, polynucleotide or 
transcript. Such samples include, but are not limited to, tissue isolated from primates, e.g., 
humans, or rodents, e.g., mice, and rats. Biological samples may also include sections of 

10 tissues such as biopsy and autopsy samples, frozen sections taken for histologic purposes, 
blood, plasma, serum, sputum, stool, tears, mucus, hair, skin, etc. Biological samples also 
include explants and primary and/or transformed cell cultures derived from patient tissues. A 
biological sample is typically obtained from a eukaryotic organism, most preferably a 
manmial such as a primate e.g., chin^anzee or human; cow; dog; cat; a rodent, e.g., guinea 

15 pig, rat, mouse; rabbit; or a bird; reptile; or fish. 

"Providing a biological sample" means to obtain a biological sample for use in 
methods described in this invention. Most often, this will be done by removing a sample of 
cells from an animal, but can also be accomplished by using previously isolated cells (e.g., 
isolated by another person, at another time, and/or for another purpose), or by performing the 

20 methods of the invention in vivo. Archival tissues, having treatment or outcome history, will 
be particularly usefiil. 

The terms "identical" or percent "identity," in the context of two or more 
nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences that 
are the same or have a specified percentage of amino acid residues or nucleotides that are the 

25 same (i.e., about 60% identity, preferably 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 
95%, 96%, 97%, 98%, 99%, or higher identity over a specified region, when compared and 
aligned for maximum correspondence over a comparison window or designated region) as 
measured using a BLAST or BLAST 2.0 sequence comparison algorithms with default 
parameters described below, or by manual alignment and visual inspection (see, e.g,, NCBI 

30 web site http://www.ncbi.nlm.nih.gov/BLAST/ or the like). Such sequences are then said to 

11 
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be "substantially identical." This definition also refers to, or may be applied to, the 
compliment of a test sequence. The definition also includes sequences that have deletions 
and/or additions, as well as those that have substitutions, as well as naturally occurring, e.g., 
polymorphic or allelic variants, and man-made variants. As described below, the preferred 
5 algorithms can account for gaps and the like. Preferably, identity exists over a region that is 
at least about 25 amino acids or nucleotides in length, or more preferably over a region that is 
50-100 amino acids, or nucleotides in length. 

For sequence comparison, typically one sequence acts as a reference sequence, 
to which test sequences are compared. When using a sequence comparison algorithm, test 

10 and reference sequences are entered into a computer, subsequence coordinates are designated, 
if necessary, and sequence algorithm program parameters are designated. Preferably, default 
program parameters can be used, or altemative parameters can be designated. The sequence 
comparison algorithm then calculates the percent sequence identities for the test sequences 
relative to the reference sequence, based on the program parameters. 

15 A "comparison window", as used herein, includes reference to a segment of 

one of the number of contiguous positions selected fi-om the group consisting typically of 
fix)m 20 to 600, usually about 50 to about 200, more usually about 100 to about 150 in which 
a sequence may be compared to a reference sequence of the same number of contiguous 
positions after the two sequences are optimally aligned. Methods of alignment of sequences 

20 for comparison are well-known in the art. Optimal alignment of sequences for comparison 
can be conducted, e.g., by the local homology algorithm of Smith & Waterman, Adv. Appl 
Math, 2:482 (1981), by the homology aligiunent algorithm of Needleman & Wunsch, J. Mol 
Biol 48:443 (1970), by the search for similarity method of Pearson & Lipman, Proc. Nat'L 
Acad. Set USA 85:2444 (1988), by computerized implementations of these algorithms 

25 (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, 
Genetics Computer Group, 575 Science Dr., Madison, WI), or by manual alignment and 
visual inspection {see, e.g.. Current Protocols in Molecular Biology (Ausubel et al., eds. 
1995 supplement)). 

Preferred examples of algorithms that are suitable for determining percent 

30 sequence identity and sequence similarity include the BLAST and BLAST 2.0 algorithms, 

12 
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which are described in Altschul et aU Nuc. Acids Res. 25:3389-3402 (1977) and Altschul et 
al, J. Mol. Biol 215:403-410 (1990). BLAST and BLAST 2.0 are used, with the parameters, 
described herein, to determine percent sequence identity for the nucleic acids and proteins of 
the invention. Software for performing BLAST analyses is publicly available through the 
5 National Center for Biotechnology Information (http://www.ncbi.nhn.nih.gov/). This 

algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short 
words of length W in the query sequence, which either match or satisfy some positive-valued 
threshold score T when aUgned with a word of the same length in a database sequence. T is 
referred to as the neighborhood word score threshold (Altschul et aL, supra). These initial 

10 neighborhood word hits act as seeds for initiating searches to find longer HSPs containing 
them. The word hits are extended in both directions along each sequence for as far as the 
cumulative alignment score can be increased. Cumulative scores are calculated using, e.g., 
for nucleotide sequences, the parameters M (reward score for a pair of matching residues; 
always > 0) and N (penalty score for mismatching residues; always < 0). For amino acid 

15 sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word 
hits in each direction are halted when: the cumulative aUgnment score falls ofFby the 
quantity X fiom its maximimi achieved value; the cumulative score goes to zero or below, 
due to the accmnulation of one or more negative-scoring residue aligimcients; or the end of 
either sequence is reached. The BLAST algorithm parameters W, T, and X determine the - 

20 sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) 
uses as defauhs a wordlength (W) of 1 1, an expectation (E) of 10, M=5, N=-4 and a 
comparison of both strands. For amino acid sequences, the BLASTP program uses as 
defaults a wordlength of 3, and expectation (E) of 1 0, and the BLOSUM62 scoring matrix 
(see HenikofF& Henikoff, Proc, Natl. Acad. Sci. USA 89:10915 (1989)) alignments (B) of 

25 50, expectation (E) of 10, M=5, N=-4, and a comparison of both strands. 

The BLAST algorithm also performs a statistical analysis of the similarity 
between two sequences (see, e.g., Karlin & Altschul, Proc. Nat'L Acad. Sci, USA 90:5873- 
5787 (1993)), One measure of similarity provided by the BLAST algorithm is the smallest 
sum probability (P(N))j which provides an indication of the probability by which a match 

30 between two nucleotide or amino acid sequences would occur by chance. For example, a 
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nucleic acid is considered similar to a reference sequence if the smallest sum probability in a 
comparison of the test nucleic acid to the reference nucleic acid is less than about 0.2, more 
preferably less than about 0.01, and most preferably less than about 0.001 . Log values may 
be large negative numbers, e.g., 5, 10, 20, 30, 40, 40, 70, 90, 110, 150, 170, etc. 
5 An indication that two nucleic acid sequences or polypeptides are substantially 

identical is that the polypeptide encoded by the first nucleic acid is immimologically cross 
reactive with the antibodies raised against the polypeptide encoded by the second nucleic 
acid, as described below. Thus, a polypeptide is typically substantially identical to a second 
polypeptide, e.g., where the two peptides differ only by conservative substitutions. Another 

10 indication that two nucleic acid sequences are substantially identical is that the two molecules 
or their complements hybridize to each other under stringent conditions, as described below. 
Yet another indication that two nucleic acid sequences are substantially identical is that the 
same primers can be used to ampHfy the sequences. 

A "host cell" is a naturally occurring cell or a transformed cell that contains an 

15 expression vector and supports the replication or expression of the expression vector. Host 
cells may be cultured cells, explants, cells in vivo, and the like. Host cells may be 
prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, insect, amphibian, or 
mammalian cells such as CHO, HeLa, and the like (see, e.g„ the American Type Culture 
Collection catalog or web site, www.atcc.org). 

20 The terms "isolated," "purified," or "biologically pure" refer to material that is 

substantially or essentially fi-ee firom components that normally accompany it as found in its 
native state. Purity and homogeneity are typically determined using analytical chemistry 
techniques such as polyacrylamide gel electrophoresis or high performance liquid 
chromatography. A protein or nucleic acid that is the predominant species present in a 

25 preparation is substantially purified. In particular, an isolated nucleic acid is separated firom 
some open reading frames that naturally flank the gene and encode proteins other than protein 
encoded by the gene. The term '*p\irified" in some embodiments denotes that a nucleic acid 
or protein gives rise to essentially one band in an electrophoretic gel. Preferably, it means 
that the nucleic acid or protein is at least 85% pure, more preferably at least 95% pure, and 

30 most preferably at least 99% pure. 'Turify" or "purification" in other embodiments means 

14 
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lemoving at least one contaminant fix)m the composition to be purified. In this sense, 
purification does not require that the purified compoimd be homogenous, e.g., 100% pure. 

The terras "polypeptide," "peptide" and "protein" are used interchangeably 
herein to refer to a polymer of amino acid residues. The terms apply to amino acid polymers 
5 in which one or more amino acid residue is an artificial chemical mimetic of a corresponding 
naturally occurring amino acid, as well as to naturally occurring amino acid polymers, those 
containing modified residues, and non-naturally occurring amino acid polymer. 

The term "amino acid" refers to naturally occurring and synthetic amino acids, 
as well as amino acid analogs and amino acid mimetics that fimction similarly to the naturally 

10 occurring amino acids. Naturally occurring amino acids are those encoded by the genetic 
code, as well as those amino acids that are later modified, e.g., hydroxyproline, y- 
carboxyglutamate, and 0-phosphoserine. Amino acid analogs refers to compounds that have 
the same basic chemical structure as a naturally occurring amino acid, e.g., an a carbon that is 
bound to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., homoserine, 

1 5 norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs may have 

modified R groups (e.g., norleucine) or modified peptide backbones, but retain the same basic 
chemical structure as a naturally occurring amino acid. Amino acid mimetics refers to 
chemical compoimds that have a structure that is different firom the general chemical 
structure of an amino acid, but that functions similarly to a naturally occurring amino acid. 

•20 Amino acids may be referred to herein by either their commonly known three 

letter symbols or by the one-letter symbols recommended by the lUPAC-IUB Biochemical 
Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly 
accepted single-letter codes. 

"Conservatively modified variants" applies to both amino acid and nucleic 

25 acid sequences. With respect to particular nucleic acid sequences, conservatively modified 
variants refers to those nucleic acids which encode identical or essentially identical amino 
acid sequences, or where the nucleic acid does not encode an amino acid sequence, to 
essentially identical or associated, e.g., naturally contiguous, sequences. Because of the 
degeneracy of the genetic code, a large mmiber of functionally identical nucleic acids encode 

30 most proteins. For instance, the codons GCA, GCC, GCG and GCU all encode the amino 
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acid alanine. Thus, at every position where an alanine is specified by a codon, the codon can 
be altered to another of the corresponding codons described without altering the encoded 
polypeptide. Such nucleic acid variations are "silent variations," which are one species of 
conservatively modified variations. Every nucleic acid sequence herein which encodes a 
5 polypeptide also describes silent variations of the nucleic acid. One of skill will recognize 
that in certain contexts each codon in a nucleic acid (except AUG, which is ordinarily the 
only codon for methionine, and TGG, which is ordinarily the only codon for tryptophan) can 
be modified to yield a functionally identical molecule. Accordingly, often silrat variations of 
a nucleic acid which encodes a polypeptide is implicit in a described sequence with respect to 

10 the expression product, but not with respect to actual probe sequences. 

As to amino acid sequences, one of skill will recognize that individual 
substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein 
sequence which alters, adds or deletes a single amino acid or a small percentage of amino 
acids in the encoded sequence is a "conservatively modified varianf ' where the alteration 

15 results in the substitution of an amino acid with a chemically similar amino acid. 

Conservative substitution tables providing fimctionally similar amino acids are well known in 
the art. Such conservatively modified variants are in addition to and do not exclude 
polymorphic variants, interspecies homologs, and alleles of the invention, typically 
conservative substitutions for one another: 1) Alanine (A), Glycine (G); 2) Aspartic acid (D), 

20 Glutamic acid (E); 3) Asparagine (N), Glutamine (Q); 4) Arginine (R), Lysine (K); 5) 

Isoleucine (I), Leucine (L), Methionine (M), Valine (V); 6) Phenylalanine (F), Tyrosine (Y), 
Tryptophan (W); 7) Serine (S), Threonine (T); and 8) Cysteine (C), Methionine (M) (see, 
eg., Creighton, Proteins (1984)). 

Macromolecular structures such as polypeptide structures can be described in 

25 terms of various levels of organization. For a general discussion of this organization, see, 
e.g., Alberts et al. Molecular Biology of the Cell (3'^ ed., 1994) and Cantor & Schimmel, 
Biophysical Chemistry Part I: Tlte Conformation of Biological Macromolecules (1980). 
'Trimary structure" refers to the amino acid sequence of a particular peptide. "Secondary 
structure" refers to locally ordered, three dimensional structures within a polypeptide. These 

30 structures are commonly known as domains. Domains are portions of a polypeptide that 
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often form a compact unit of the polypeptide and are typically 25 to approximately 500 
amino acids long. Typical domains are made up of sections of lesser organization such as 
stretches of p-sheet and a-helices. "Tertiary structure" refers to the complete three 
dimensional structure of a polypeptide monomer. "Quaternary structure" refers to the three 
5 dimensional structure formed, usually by the noncovalent association of independent tertiary 
units. Anisotropic terms are also known as energy terms. 

'TNucleic acid" or "oligonucleotide" or "polynucleotide" or grammatical 
equivalents used herein means at least two nucleotides covalently linked together. 
Oligonucleotides are typically from about 5, 6, 7, 8, 9, 10, 12, 15, 25, 30, 40, 50 or more 

10 nucleotides in length, up to about 100 nucleotides in length. Nucleic acids and 

polynucleotides are a polyra^s of any length, including longer lengths, e.g., 200, 300, 500, 
1000, 2000, 3000, 5000, 7000, 10,000, etc. A nucleic acid of the present invention will 
generally contain phosphodiester bonds, although in some cases, nucleic acid analogs are 
included that may have alternate backbones, comprising, e.g., phosphoramidate, 

1 5 phosphorothioate, phosphorodithioate, or O-methylphophoroamidite linkages (see Eckstein, 
Oligonucleotides and Analogues: A Practical Approach, Oxford University Press); and 
peptide nucleic acid backbones and linkages. Other analog nucleic acids include those with 
positive backbones; non-ionic backbones, and non-ribose backbones, including those 
described in U.S. Patent Nos. 5,235,033 and 5,034,506, and Chapters 6 and 7, ASC 

20 Symposium Series 580, Carbohydrate Modifications in Antisense Research^ Sanghui & 

Cook, eds.. Nucleic acids containing one or more carbocyclic sugars are also included within 
one definition of nucleic acids. Modifications of the ribose-phosphate backbone may be done 
for a variety of reasons, e.g. to increase the stability and half-life of such molecules in 
physiological environments or as probes on a biochip. Mixtures of naturally occurring 

25 nucleic acids and analogs can be made; alternatively, mixtures of different nucleic acid 
analogs, and mixtures of naturally occurring nucleic acids and analogs may be made. 

A variety of references disclose such nucleic acid analogs, including, for 
example, phosphoramidate (Beaucage et al. Tetrahedron 49(10): 1925 (1993) and references 
therein; Letsinger, J. Org. Chem. 35:3800 (1970); Sprinzl et al., Eur. J. Biochem. 81:579 

30 (1977); Letsinger et al., Nucl. Acids Res. 14:3487 (1986); Sawai et al, Chem. Lett. 805 
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(1984), Letsinger et al., L Am. Chem. Soc. 110:4470 (1988); and Pauwels et ah, Chemica 
Scripta 26:141 91986)), phosphorothioate (Mag et al., Nucleic Acids Res. 19:1437 (1991); 
and U.S. Patent No. 5,644,048), phosphorodithioate (Briu et al., J. Am. Chem. Soc. 1 1 1 :2321 
(1989), O-methylphophoroamidite linkages (see Eckstein, Oligonucleotides and Analogues: 
5 A Practical Approach, Oxford University Press), and peptide nucleic acid backbones and 
linkages (see Egholm, J. Am. Chem. Soc. 114:1895 (1992); Meier et al, Chem. Int. Ed. Engl. 
31:1008 (1992); Nielsen, Nature, 365:566 (1993); Carlsson et al., Nature 380:207 (1996), all 
of which are incorporated by reference). Other analog nucleic acids include those with 
positive backbones (Denpcy et al., Proc. Natl. Acad. Sci. USA 92:6097 (1995); non-ionic 

10 backbones (U.S. Patent Nos. 5,386,023, 5,637,684, 5,602,240, 5,216,141 and 4,469,863; 
Kiediowshi et al, Angew. Chem. Intl. Ed. English 30:423 (1991); Letsinger et al., J. Am. 
Chem. Soc. 110:4470 (1988); Letsinger etal., Nucleoside & Nucleotide 13:1597(1994); 
Chapters 2 and 3, ASC Symposium Series 580, "Carbohydrate Modifications in Antisense 
Research", Ed. Y.S. Sanghui and P. Dan Cook; Mesmaeker et al., Bioorganic & Medicinal 

15 Chem. Lett. 4:395 (1994); JeflFs et al., J. Biomolecular NMR 34:17 (1994); Tetrahedron Lett. 
37:743 (1996)) and non-ribose backbones, including those described in U.S. Patent Nos. 
5,235,033 and 5,034,506, and Chapters 6 and 7, ASC Symposium Series 580, "Carbohydrate 
Modifications in Antisense Research", Ed. Y.S. Sanghui and P. Dan Cook. Nucleic acids 
containing one or more carbocyclic sugars are also included within one definition of nucleic 

20 acids (see Jenkms et al., Chem. Soc, Rev. (1995) pp 169-176). Several nucleic acid analogs 
are described in Rawls, C & E News June 2, 1997 page 35. All of these references are hereby 
expressly incorporated by reference. 

Particularly preferred are peptide nucleic acids (PNA) which includes peptide 
nucleic acid analogs. These backbones are substantially non-ionic under neutral conditions, in 

25 contrast to the highly charged phosphodiester backbone of naturally occurring nucleic acids. 
This results in two advantages. First, the PNA backbone exhibits improved hybridization 
kinetics. PNAs have larger changes in the melting temperature (Tm) for mismatched versus 
perfectly matched basepairs. DNA and RNA typically exhibit a 2-4°C drop in Tn, for an 
intemal mismatch. With the non-ionic PNA backbone, the drop is closer to 7-9°C. Similarly, 

30 due to their non-ionic nature, hybridization of the bases attached to these backbones is 
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relatively insensitive to salt concentration. In addition, PNAs are not degraded by cellular 
enzymes, and thus can be more stable. 

The nucleic acids may be single stranded or double stranded, as specified, or 
contain portions of both double stranded or single stranded sequence. As will be appreciated 
5 by those in the art, the depiction of a single strand also defines the sequence of the 

complementary strand; thus the sequences described herein also provide the complement of 
the sequence. The nucleic acid may be DNA, both genomic and cDNA, RNA or a hybrid, 
where the nucleic acid may contain combinations of deoxyribo- and ribo«nucleotides, and 
combinations of bases, including uracil, adenine, thymine, cytosine, guanine, inosine, 

10 xanthine hypoxanthine, isocytosine, isoguanine, etc. "Transcript" typically refers to a 

naturally occurring RNA, e.g., a pre-mRNA, hnRNA, or mRNA. As used herein, the term 
'"nucleoside" includes nucleotides and nucleoside and nucleotide analogs, and modified 
nucleosides such as amino modified nucleosides. In addition, "nucleoside" includes non- 
naturally occurring analog structures. Thus, e.g. the individual units of a peptide nucleic 

15 acid, each containing a base, are referred to herein as a nucleoside. 

A "label" or a "detectable moiety*' is a composition detectable by 
spectroscopic, photochemical, biochemical, immunochemical, chemical, or other physical 
means. For example, usefiil labels include ^^P, fluorescent dyes, electron-dense reagents, 
enzymes (eg., as commonly used in an ELISA), biotin, digoxigenin, or haptens and proteins 

20 or other entities which can be made detectable, e.g., by incorporating a radiolabel into the 
peptide or used to detect antibodies specifically reactive with the peptide. The labels may be 
incorporated into the breast cancer nucleic acids, proteins and antibodies at any position. 
Any method known in the art for conjugating the antibody to the label may be employed, 
including those methods described by Hunter et al. Nature, 144:945 (1962); David et al., 

25 Biochemistry. 13:1014 (1974); Pain et al., J. Immunol. Meth.. 4Q:219 (1981); and Nygren, L 
Histochem. and Cvtochem.> 3Q:407 (1982). 

An "effector" or "effector moiety" or "effector component" is a molecule that 
is boimd (or linked, or conjugated), either covalently, through a linker or a chemical bond, or 
noncovalently, through ionic, van der Waals, electrostatic, or hydrogen bonds, to an antibody. 

30 The "effector^* can be a variety of molecules including, e.g., detection moieties including 
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radioactive compounds, fluorescent compounds, an enzyme or substrate, tags such as epitope 
tags, a toxin; activatable moieties, a chemotherapeutic agent; a lipase; an antibiotic; or a 
radioisotope emitting "hard" e.g., beta radiation. 

A "labeled nucleic acid probe or oUgonucleotide'' is one that is boxmd, either 
5 covalently, through a Unker or a chemical bond, or noncovalently, through ionic, van der 
Waals, electrostatic, or hydrogen bonds to a label such that the presence of the probe may be 
detected by detecting the presence of the label bound to the probe. Alternatively, method 
using high affinity interactions may achieve the same results where one of a pair of binding 
partners binds to the other, e.g., biotin, strq)tavidin. 

10 As used herein a "nucleic acid probe or oligonucleotide" is defined as a 

nucleic acid capable of binding to a target nucleic acid of complementary sequence through 
one or more types of chemical bonds, usually through complementary base pairing, usually 
through hydrogen bond formation. As used herein, a probe may include natural (i.e.. A, G, C, 
or T) or modified bases (7-deazaguanosine, inosine, etc.). In addition, the bases in a probe 

1 5 may be joined by a linkage other than a phosphodiester bond, so long as it does not 

functionally interfere with hybridization. Thus, e.g., probes may be peptide nucleic acids in 
which the constituent bases are joined by peptide bonds rather than phosphodiester linkages. 
It will be understood by one of skill in the art that probes may bind target sequences lacking 
complete complementarity with the probe sequence depending upon the stringency of the 

20 hybridization conditions. The probes are preferably directly labeled as with isotopes, 

chromophores, lumiphores, chromogens, or indirectly labeled such as with biotin to which a 
streptavidin complex may later bind. By assaying for the presence or absence of the probe, 
one can detect the presence or absence of the select sequence or subsequence. Diagnosis or 
prognosis may be based at the genomic level, or at the level of RNA or protein expression. 

25 The term "recombinant" when used with reference, e.g., to a cell, or nucleic 

acid, protein, or vector, indicates that the cell, nucleic acid, protein or vector, has been 
modified by the introduction of a heterologous nucleic acid or protein or the alteration of a 
native nucleic acid or protein, or that the cell is derived fix)m a cell so modified. Thus, e.g., 
recombinant cells express genes that are not found within the native (non-recombinant) form 

30 of the cell or express native genes that are otherwise abnormally expressed, under expressed 

20 
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or not expressed at all. By the term ''recombinant nucleic acid" herein is meant nucleic acid, 
originally formed in vitro, in general, by the manipulation of nucleic acid, e.g., using 
polymerases and endonucleases, in a form not normally found in nature. In this manner, 
operably linkage of different sequences is achieved. Thus an isolated nucleic acid, in a linear 
5 form, or an expression vector formed in vitro by ligating DNA molecules that are not 
normally joined, are both considered recombinant for the purposes of this invention. It is 
understood that once a recombinant nucleic acid is made and reintroduced into a host cell or 
organism, it will replicate non-recombinantly, i.e., using the in vivo cellular machinery of the 
host cell rather than in vitro manipulations; however, such nucleic acids, once produced 

10 recombinantly, although subsequently replicated non-recombinantly, are still considered 

recombinant for the purposes of the invention. Similarly, a "recombinant protein" is a protem 
made using recombinant techniques, i.e., through the expression of a recombinant nucleic 
acid as depicted above. 

The term "heterologous" when used with reference to portions of a nucleic 

1 5 acid indicates that the nucleic acid comprises two or more subsequences that are not normally 
found in the same relationship to each other in nature. For instance, the nucleic acid is 
typically recombinantly produced, having two or more sequences, e.g., from unrelated genes 
arranged to make a new functional nucleic acid, e.g., a promoter from one source and a 
coding region from another source. Similarly, a heterologous protein will often refer to two 

20 or more subsequences that are not found in the same relationship to each other in nature (e.g., 
a fusion protein). 

A "promoter" is defined as an array of nucleic acid control sequences that 
direct transcription of a nucleic acid. As used herein, a promoter includes necessary nucleic 
acid sequences near the start site of transcription, such as, in the case of a polymerase II type 

25 promoter, a TATA element. A promoter also optionally includes distal enhancer or repressor 
elements, which can be located as much as several thousand base pairs from the start site of 
transcription. A "constitutive" promoter is a promoter that is active under most 
enviroimiental and developmental conditions. An "inducible" promoter is a promoter that is 
active under environmental or developmental regulation. The term "operably linked" refers 

30 to a functional linka.ge between a nucleic acid expression control sequence (such as a 

21 
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promoter, or array of transcription factor binding sites) and a second nucleic acid sequence, 
wherein the expression control sequence directs transcription of the nucleic acid 
corresponding to the second sequence. 

An "expression vector" is a nucleic acid construct, generated recombinantly or 
5 synthetically, with a series of specified nucleic acid elements that permit transcription of a 
particular nucleic acid in a host cell. The expression vector can be part of a plasmid, virus, or 
nucleic acid firagment. Typically, the expression vector includes a nucleic acid to be 
transcribed operably linked to a promoter. 

The phrase "selectively (or specifically) hybridizes to*' refers to the binding, 

10 duplexing, or hybridizing of a molecule only to a particular nucleotide sequence under 

stringent hybridization conditions when that sequence is present in a complex mixture (e.g., 
total cellular or library DNA or RNA). 

The phrase "stringent hybridization conditions'* refers to conditions under 
which a probe will hybridize to its target subsequence, typically in a complex mixture of 

15 nucleic acids, but to no other sequences. Stringent conditions are sequence-dq)endent and 
will be different in different circimistances. Longer sequences hybridize specifically at 
higher temperatures. An extensive guide to the hybridization of nucleic acids is foimd in 
Tijssen, Techniques in Biochemistry and Molecular Biology-Hybridization with Nucleic 
Probes^ "Overview of principles of hybridization and the strategy of nucleic acid assays" 

20 (1993). Generally, stringent conditions are selected to be about 5-10°C lower than the 

thermal melting point (Tm) for the specific sequence at a defined ionic strength pH. The Tm is 
the temperature (imder defined ionic strength, pH, and nucleic concentration) at which 50% 
of the probes complementary to the target hybridize to the target sequence at equilibrium (as 
the target sequences are present in excess, at T^, 50% of the probes are occupied at 

25 equilibrium). Stringent conditions will be those in which the salt concentration is less than 
about hO M sodiimi ion, typically about 0.01 to 1.0 M sodium ion concentration (or other 
salts) at pH 7.0 to 8.3 and the temperature is at least about 30^C for short probes (e.g., 10 to 
50 nucleotides) and at least about 6(fC for long probes (e.g., greater than 50 nucleotides). 
Stringent conditions may also be achieved with the addition of destabilizing agents such as 

30 formamide. For selective or specific hybridization, a positive signal is at least two times 

22 



wo 02/059377 PCT/US02/02242 



background, preferably 10 times background hybridization. Exemplary stringent 
hybridization conditions can be as following: 50% formamide, 5x SSC, and 1% SDS, 
incubating at 42°C, or, 5x SSC, 1% SDS, incubating at eS^'C, with wash in 0.2x SSC, and 
0.1% SDS at 65°C. For PCR, a temperature of about 36°C is typical for low stringency 
5 ampUfication, although annealing temperatures may vary between about 32°C and 48°C 
depending on primer length. For high stringency PCR amplification, a temperature of about 
62**C is typical, although high stringency annealmg temperatures can range fi-om about 50°C 
to about 65°C, depending on the primer length and specificity. Typical cycle conditions for 
both high and low stringency amplifications include a denaturation phase of 90®C - 95°C for 

10 30 sec - 2 min., an annealing phase lasting 30 sec. - 2 min., and an extension phase of about 
72°C for 1 - 2 min. Protocols and guidelines for low and high stringency amplification 
reactions are provided, e.g., in Innis et ah (1990) PCR Protocols, A Guide to Methods and 
Applications^ Academic Press, Inc. N. Y.). 

Nucleic acids that do not hybridize to each other under stringent conditions are 

15 still substantially identical if the polypeptides which they encode are substantially identical. 
This occurs, e.g., when a copy of a nucleic acid is created using the maximum codon 
degeneracy permitted by the genetic code. In such cases, the nucleic acids typically hybridize 
under moderately stringent hybridization conditions. Exemplary "moderately stringent 
hybridization conditions" include a hybridization in a buffer of 40% formamide, 1 M NaCl, 

20 1% SDS at 37°C, and a wash in IX SSC at 45°C. A positive hybridization is at least twice 
background. Those of ordinary skill will readily recognize that alternative hybridization and 
wash conditions can be utilized to provide conditions of similar stringency. Additional 
guideUnes for determining hybridization parameters are provided in numerous reference, e.g., 
and Current Protocols in Molecular Biology, ed. Ausubel, et al 

25 The phrase "fimctional effects" in the context of assays for testing compounds 

that modulate activity of a breast cancer protein includes the determination of a parameter 
that is indirectly or directly imder the influence of the breast cancer protem or nucleic acid, 
e.g., a fimctional, physical, or chemical effect, such as the ability to decrease breast cancer. It 
includes ligand binding activity; cell growth on soft: agar; anchorage dependence; contact 
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inhibition and density limitation of growth; cellular proliferation; cellular transformation; 
growth factor or serum dependence; tumor specific marker levels; invasiveness into Matrigel; 
tumor growth and metastasis in vivo; mRNA and protein expression in cells undergoing 
metastasis, and other characteristics of breast cancer cells. "Functional effects" include in 
5 vitro^ in vivOy and ex vivo activities. 

By "detennining the functional effect" is meant assaying for a compound that 
increases or decreases a parameter that is indirectly or directly xmder the influence of a breast 
cancer protein sequence, e.g., functional, enzymatic, physical and chemical eflFects. Such 
functional effects can be measured by any means known to those skilled in the art, e.g., 

10 changes in spectroscopic characteristics (e.g., fluorescence, absorbance, refractive index), 
hydrodynamic (e.g., shape), chromatographic, or solubility properties for the protein, 
measuring inducible markers or transcriptional activation of the breast cancer protein; 
measuring binding activity or binding assays, e.g. binding to antibodies or other ligands, and 
measuring cellular proliferation. Determination of the functional effect of a compound on 

15 breast cancer can also be performed using breast cancer assays known to those of skill in the 
art such as an in vitro assays, e.g., cell growth on soft agar; anchorage dependence; contact 
inhibition and density limitation of growth; cellular proliferation; cellular transformation; 
growth factor or serum dependence; tumor specific marker levels; invasiveness into Matrigel; 
tumor growth and metastasis in vivo; mRNA and protein expression in cells undergoing 

20 metastasis, and other characteristics of breast cancer cells. The functional effects can be 

evaluated by many means known to those skilled in the art, e.g., microscopy for quantitative 
or qualitative measures of alterations in morphological features, measurement of changes in 
RNA or protein levels for breast cancer-associated sequences, measurement of RNA stability, 
identification of downstream or reporter gene expression (CAT, luciferase, P-gal, GFP and 

25 the like), e.g., via chemiluiiiinescence, fluorescence, colorimetric reactions, antibody binding, 
inducible markers, and ligand binding assays. 

"Inhibitors", "activators", and "modulators" of breast cancer polynucleotide 
and polypeptide sequences are used to refer to activating, inhibitory, or modulating molecules 
or compounds identified using in vitro and in vivo assays of breast cancer polynucleotide and 

30 polypeptide sequences. Inhibitors are compounds that, e.g., bind to, partially or totally block 
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activity, decrease, prevent, delay activation, inactivate, desensitize, or down regulate the 
activity or expression of breast cancer proteins, e.g., antagonists. Antisense nucleic acids 
may seem to inhibit expression and subsequent function of the protein. "Activators" are 
compounds that increase, open, activate, facilitate, enhance activation, sensitize, agonize, or 
5 up regulate breast cancer protein activity. Inhibitors, activators, or modulators also include 
genetically modified versions of breast cancer proteins, e.g., versions with altered activity, as 
well as naturally occurring and synthetic ligands, antagonists, agonists, antibodies, small 
chemical molecules and the like. Such assays for inhibitors and activators include, e.g., 
expressing the breast cancer protein in vitro, in cells, or cell membranes, applying putative 

10 modulator compounds, and then determining the functional effects on activity, as described 
above. Activators and inhibitors of breast cancer can also be identified by incubating breast 
cancer cells with the test compound and determining increases or decreases in the expression 
of 1 or more breast cancer protems, e.g., 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50 or more breast 
cancer proteins, such as breast cancer proteins encoded by the sequences set out in Tables 1- 

15 25. 

Samples or assays comprising breast cancer proteins that are treated with a 
potential activator, inhibitor, or modulator are compared to control samples without the 
inhibitor, activator, or modulator to examine the extent of inhibition. Control samples 
(untreated with inhibitors) are assigned a relative protein activity value of 1 00%. Inhibition 

20 of a polypeptide is achieved when the activity value relative to the control is about 80%, 
preferably 50%, more preferably 25-0%. Activation of a breast cancer polypeptide is 
achieved when the activity value relative to the control (untreated with activators) is 1 10%, 
more preferably 150%, more preferably 200-500% (i.e., two to five fold higher relative to the 
control), more preferably 1000-3000% higher. 

25 The phrase "changes in cell growth" refers to any change in cell growth and 

proliferation characteristics in vitro or in vivo, such as formation of foci, anchorage 
independence, semi-solid or soft agar growth, changes in contact inhibition and density 
limitation of growth, loss of growth factor or serum requirements, changes in cell 
morphology, gaining or losing immortalization, gaining or losing tumor specific markers, 

30 ability to form or suppress tumors when injected into suitable animal hosts, and/or 
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immortalization of the cell. See, e,g., Freshney, Culture of Animal Cells a Manual of Basic 
Techniguepp. 231-241 (3'^ ed. 1994). 

*Tmnor cell'* refers to precancerous, cancerous, and normal cells in a tumor. 
"Cancer cells," 'transformed'* cells or "transformation*' in tissue culture, refers 
5 to spontaneous or induced phenotypic changes that do not necessarily involve the uptake of 
new genetic material. Although transformation can arise from infection with a transforming 
virus and incorporation of new genomic DNA, or uptake of exogenous DNA, it can also arise 
spontaneously or following exposure to a carcinogen, thereby mutating an endogenous gene. 
Transformation is associated with phenotypic changes, such as immortalization of cells, 
10 aberrant growfli control, nonmorphological changes, and/or malignancy (see, Freshney, 
Culture of Animal Cells a Manual of Basic Technique (3^^ ed. 1994)). 

"Antibody" refers to a polypeptide comprising a framework region from an 
immunoglobulin gene or fragments thereof that specifically binds and recognizes an antigen. 
The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, 
15 epsilon, and mu constant region genes, as well as the myriad immunoglobulin variable region 
genes. Light chains are classified as either kappa or lambda. Heavy chains are classified as 
gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, 
IgM, IgA, IgD and IgE, respectively. Typically, the antigen-binding region of an antibody or 
its functional equivalent will be most critical in specificity and affinity of binding. See Paul, 
20 Fundamental Immunology. 

An exemplary immunoglobulin (antibody) structural unit comprises a 
tetramer. Each tetramer is composed of two identical pairs of polypeptide chains, each pair 
having one "light" (about 25 kD) and one "heavy" chain (about 50-70 kD). TheN-terminus 
of each chain defines a variable region of about 100 to 1 10 or more amino acids primarily 
25 responsible for antigen recognition. The terms variable light chain (Vl) and variable heavy 
chain (Vh) refer to these light and heavy chains respectively. 

Antibodies exist, e.g., as intact immunoglobulins or as a number of well- 
characterized fragments produced by digestion with various peptidases. Thus, e.g., pepsin 
digests an antibody below the disulfide linkages in the hinge region to produce F(ab)*2, a 
30 dimer of Fab which itself is a light chain joined to Vh-Ch1 by a disulfide bond. The F(ab)'2 
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may be reduced under mild conditions to break the disulfide linkage in the hinge region, 
thereby converting the F(ab)*2 dimer into an Fab' monomer. The Fab' monomer is 
essentially Fab with part of the hinge region (see Fwidamental Immunology (Paul ed., 3d ed. 
1993). While various antibody firagments are defined in terras of the digestion of an intact 
5 antibody, one of skill will appreciate that such fragments may be synthesized de novo either 
chemically or by using recombinant DNA methodology. Thus, the term antibody, as used 
herein, also includes antibody fragments either produced by the modification of whole 
antibodies, or those synthesized de novo using recombinant DNA methodologies (e.g., single 
chain Fv) or those identified using phage display libraries (see, e.g., McCafferty et aL, Nature 

10 348:552-554(1990)) 

For preparation of antibodies, e.g., recombinant, monoclonal, or polyclonal 
antibodies, many technique known in the art can be used (see, e.g., Kohler & Milstein, 
Nature 256:495-497 (1975); Kozbor et aL, Immunology Today 4:72 (1983); Cole et al, pp. 
11 -96 in Monoclonal Antibodies and Cancer Therapy (1985); Coligan, Current Protocols in 

15 Immunology (1991); Harlow & Lane, Antibodies, A Laboratory Manual (1988); and Goding, 
Monoclonal Antibodies: Principles and Practice (2d ed. 1986)). Techniques for the 
production of single chain antibodies (U.S. Patent 4,946,778) can be adapted to produce 
antibodies to polypeptides of this invention. Also, transgenic mice, or other organisms such 
as other mammals, may be used to express humanized antibodies. Alternatively, phage 

20 display technology can be used to identify antibodies and heteromeric Fab fragments that 
specifically bind to selected antigens (see, e.g., McCafferty et al. Nature 348:552-554 
(1990); Marks et al. Biotechnology 10:779-783 (1992)). 

A ''chimeric antibody" is an antibody molecule in which (a) the constant 
region, or a portion thereof, is altered, replaced or exchanged so that the antigen binding site 

25 (variable region) is linked to a constant region of a different or altered class, effector fimction 
and/or species, or an entirely different molecule which confers new properties to the chimeric 
antibody, e.g., an enzyme, toxin, hormone, growth factor, drug, etc.; or (b) the variable 
region, or a portion thereof, is altered, replaced or exchanged with a variable region having a 
different or altered antigen specificity. 

30 
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Identification of breast cancer-associated sequences 

In one aspect, the expression levels of genes are determined in different 
patient samples for which diagnosis information is desired, to provide expression profiles. 
An expression profile of a particular sample is essentially a "fingerprinf * of the state of the 
5 sample; while two states may have any particular gene smiilarly expressed, the evaluation of 
a number of genes simultaneously allows the generation of a gene expression profile that is 
characteristic of the state of the cell. That is, normal tissue (e.g., normal breast or other 
tissue) may be distinguished from cancerous or metastatic cancerous tissue of the breast, or 
breast cancer tissue or metaistatic breast cancerous tissue can be compared with tissue 
10 samples of breast and other tissues from surviving cancer patients. By comparing expression 
profiles of tissue in known different breast cancer states, information regarding which genes 
are important (including both up- and down-regulation of genes) in each of these states is 
obtained. 

The identification of sequences that are differentially expressed in breast 
15 cancer versus non-breast cancer tissue allows the use of this information in a number of ways. 
For example, a particular treatment regime may be evaluated: does a chemotherapeutic drug 
act to down-regulate breast cancer, and thus tumor growth or recurrence, in a particular 
patient. Similarly, diagnosis and treatment outcomes may be done or confirmed by 
comparing patient samples vidth the known expression profiles. Metastatic tissue can also be 
20 analyzed to determine the stage of breast cancer in the tissue. Furthermore, these gene 

expression profiles (or individual genes) allow screening of drug candidates with an eye to 
mimicking or altering a particular expression profile; e.g., screening can be done for drugs 
that suppress the breast cancer expression profile. This may be done by making biochips 
comprising sets of the important breast cancer genes, which can then be used in these screens. 
25 These methods can also be done on the protem basis; that is, protein expression levels of the 
breast cancer proteins can be evaluated for diagnostic purposes or to screen candidate agents. 
. In addition, the breast cancer nucleic acid sequences can be administered for gene therapy 
purposes, including the administration of antisense nucleic acids, or the breast cancer proteins 
(including antibodies and other modulators thereof) administered as therapeutic drugs. 



28 



wo 02/059377 



PCT/US02/02242 



Thus the present invention provides nucleic acid and protein sequences that 
are differentially expressed in breast cancer, herein termed 'Ijreast cancer sequences." As 
outlined below, breast cancer sequences include those that are up-regulated (i.e., expressed at 
a higher level) in breast cancer, as well as those that are down-regulated (i.e., expressed at a 
5 lower level). In a preferred embodiment, the breast cancer sequences are from humans; 
however, as will be appreciated by those in the art, breast cancer sequences from other 
organisms may be useful in animal models of disease and drug evaluation; thus, other breast 
cancer sequences are provided, from vertebrates, including mammals, including rodents (rats, 
mice, hamsters, guinea pigs, etc.), primates, farm animals (including sheep, goats, pigs, cows, 

10 horses, etc.) and pets, e.g., (dogs, cats, etc.). Breast cancer sequences from other organisms 
may be obtamed using the techniques outlined below. 

Breast cancer sequences can include both nucleic acid and amino acid 
sequences. As will be appreciated by those in the art and is more fiilly outlined below, breast 
cancer nucleic acid sequences are useful in a variety of applications, including diagnostic 

1 5 applications, which will detect naturally occurring nucleic acids, as well as screening 
applications; e.g., biochips comprising nucleic acid probes or PGR microtiter plates wifli 
selected probes to the breast cancer sequences can be generated. 

A breast cancer sequence can be initially identified by substantial nucleic acid 
and/or amino acid sequence homology to the breast cancer sequences outlined herein. Such 

20 homology can be based upon the overall nucleic acid or amino acid sequence, and is 

generally determined as outlined below, using either homology programs or hybridization 
conditions. 

For identifying breast cancer-associated sequences, the breast cancer screen 
typically includes comparing genes identified in different tissues, e.g., normal and cancerous 

25 tissues, or tumor tissue samples from patients who have metastatic disease vs. non metastatic 
tissue. Other suitable tissue comparisons include comparing breast cancer samples with 
metastatic cancer samples from other cancers, such as lung, breast, gastrointestinal cancers, 
ovarian, etc. Samples of different stages of breast cancer, e.g., survivor tissue, drug resistant 
states, and tissue undergoing metastasis, are applied to biochips comprising nucleic acid 

30 probes. The samples are first microdissected, if ^plicable, and treated as is known in the art 
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for the preparation of mRNA. Suitable biochips are commercially available, e.g. from 
Affymetrix. Gene expression profiles as described herein are generated and the data 
analyzed. 

In one embodiment, the genes showing changes in expression as between 
5 normal and disease states are compared to genes expressed in other normal tissues, preferably 
normal breast, but also including, and not limited to lung, heart, brain, liver, breast, kidney, 
muscle, colon, small intestine, large intestine, spleen, bone and placenta. In a preferred 
embodiment, those genes identified during the breast cancer screen that are expressed in any 
significant amount in other tissues are removed from the profile, although in some 

10 embodiments, this is not necessary. That is, when screening for drugs, it is usually preferable 
that the target be disease specific, to minimize possible side effects. 

In a preferred embodiment, breast cancer sequences are those that are up- 
regulated in breast cancer; that is, the expression of these genes is higher in the breast cancer 
tissue as compared to non-cancerous tissue. "Up-regulation" as used herein often means at 

1 5 least about a two-fold change, preferably at least about a three fold change, with at least about 
five-fold or higjier being preferred. All unigene cluster identification numbers and accession 
numbers herein are for the GenBank sequence database and the sequences of the accession 
numbers are hereby expressly incorporated by reference. GenBank is known in the art, see, 
e,g., Benson, DA, et al. Nucleic Acids Research 26:1-7 (1998) and 

20 http://www.ncbi.nlm.nih.gov/. Sequences are also available in other databases, e.g., 

European Molecular Biology Laboratory (EMBL) and DNA Database of Japan (DDBJ). 
U.S. Patent Application N. 09/687,576, with the same assignee as the present application, 
further discloses related sequences, compositions, and methods of diagnosis and treatment of 
breast cancer is hereby expressly incorporated by reference. 

25 In another preferred embodiment, breast cancer sequences are those that are 

down-regulated in the breast cancer; that is, the expression of these genes is lower in breast 
cancer tissue as compared to non-cancerous tissue {see, e.g., Tables 1,2, 3, 15, 16 etc.). 
'*Down-regulation" as used herein often means at least about a two-fold change, preferably at 
least about a three fold change, with at least about five-fold or higher being preferred. 

30 
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Informatics 

The ability to identify genes that are over or under expressed in breast cancer 
can additionally provide high-resolution, high-sensitivity datasets which can be used in the 
5 areas of diagnostics, therapeutics, drug development, pharmacogenetics, protein structure, 
biosensor development, and other related areas. For example, the expression profiles can be 
used in diagnostic or prognostic evaluation of patients with breast cancer. Or as another 
example, subcellular toxicological information can be generated to better direct drug structure 
and activity correlation (see Anderson, Pharmaceutical Proteomics: Targets, Mechanism, 
10 and Function, paper presented at the IBC Proteomics conference, Coronado, CA (June 11-12, 
1998)). Subcellular toxicological information can also be utilized in a biological sensor 
device to predict the likely toxicological effect of chemical exposures and likely tolerable 
e?qx)sure thresholds {see U.S. Patent No. 5,81 1,231). Similar advantages acorue from 
datasets relevant to other biomolecules and bioactive agents (e.g., nucleic acids, saccharides, 
1 5 lipids, drugs, and the like). 

Thus, in another embodiment, the present invention provides a database that 
includes at least one set of assay data. The data contained in the database is acquired, e.g., 
using array analysis either singly or in a library format. The database can be in substantially 
any form in which data can be maintained and transmitted, but is preferably an electronic 
20 database. The electronic database of the invention can be maintamed on any electronic 
device allowing for the storage of and access to the database, such as a personal computer, 
but is preferably distributed on a wide area network, such as the World Wide Web. 

The focus of the present section on databases that include peptide sequence 
data is for clarity of illustration only. It will be apparent to those of skill in the art that similar 
25 databases can be assembled for any assay data acquired using an assay of the invention. 

The compositions and methods for identifying and/or quantitating the relative 
and/or absolute abundance of a variety of molecular and macromolecular species fix>m a 
biological sample undergoing breast cancer, i.e., the identification of breast cancer-associated 
sequences described herein, provide an abundance of information, which can be correlated 
30 with pathological conditions, predisposition to disease, drug testing, therapeutic monitoring, 
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gene-disease causal linkages, identification of correlates of immunity and physiological 
status, among others. Although the data generated Jfrom the assays of the invention is suited 
for manual review and analysis, in a preferred embodiment, prior data processing using high- 
speed computers is utilized. 
5 An array of methods for indexing and retrieving biomolecular information is 

known in the art. For example, U.S. Patents 6,023,659 and 5,966,712 disclose a relational 
database system for storing biomolecular sequence information in a manner that allows 
sequences to be catalogued and searched according to one or more protein function 
hierarchies. U.S. Patent 5,953,727 discloses a relational database having sequence records 

10 containing information in a format that allows a collection of partial-length DNA sequences 
to be catalogued and searched according to association with one or more sequencing projects 
for obtaining fiill-lenglh sequences firom the collection of partial length sequences. U.S. 
Patent 5,706,498 discloses a gene database retrieval system for making a retrieval of a gene 
sequence similar to a sequence data item in a gene database based on the degree of similarity 

15 between a key sequence and a target sequmce. U.S. Patent 5,538,897 discloses a method 

using mass spectroscopy firagmentation patterns of peptides to identify amino add sequences 
in computer databases by comparison of predicted mass spectra with experimentally-derived 
mass spectra using a closeness-of-fit measure. U.S. Patent 5,926,818 discloses a multi- 
dimensional database comprising a functionality for multi-dimensional data analysis 

20 described as on-line analytical processing (OLAP), which entails the consolidation of 

projected and actual data according to more than one consoUdation path or dimension. U.S. 
Patent 5,295,261 reports a hybrid database structure in which the fields of each database 
record are divided into two classes, navigational and informational data, with navigational 
fields stored in a hierarchical topological map which can be viewed as a tree structure or as 

25 the merger of two or more such tree structures. 

See also Mount et ai, Bioinformatics (2001); Biological Sequence Analysis: 
Probabilistic Models of Proteins and Nucleic Acids (Durbin et aL, eds., 1 999); 
Bioinformatics: A Practical Guide to the Analysis of Genes and Proteitis (Baxevanis & 
OeuUette eds., 1998)); Rashidi & Buehler, Bioinformatics: Basic Applications in Biological 

30 Science and Medicine (1999); Introduction to Computational Molecular Biology (Setubal et 
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a/., eds 1997); Bioinfortnatics: Methods and Protocols (Misener & Krawetz, eds, 2000); 
Bioinformatics: Sequence, Structure, and Databanks: A Practical Approach (Higgins & 
Taylor, eds., 2000); Brown, Bioinformatics: A Biologist 's Guide to Biocomputing and the 
Internet (2001); Han & Kamber, Data Mining: Concepts and Techniques (2000); and 
5 Waterman, Introduction to Computational Biology: Maps, Sequences, and Genomes (1995). 

The present invention provides a computer database comprising a computer 
and software for storing in computer-retrievable form assay data records cross-tabulated, e.g., 
with data specifying the source of the target-containing sample from which each sequence 
specificity record was obtained. 

10 In an exemplary embodiment, at least one of the sources of target-containing 

sample is from a control tissue sample known to be free of pathological disorders. In a 
variation, at least one of the sources is a known pathological tissue specimen, e.g., a 
neoplastic lesion or another tissue specimen to be analyzed for breast cancer. In another 
variation, the assay records cross-tabulate one or more of the following parameters for each 

15 target species in a sample: (1) a unique identification code, which can include, e.g., a target 
molecular structure and/or characteristic separation coordinate (e.g., electrophoretic 
coordinates); (2) sample source; and (3) absolute and/or relative quantity of the target species 
present in the sample. 

The invention also provides for the storage and retrieval of a collection of 

20 target data in a computer data storage apparatus, which can include magnetic disks, optical 
disks, magneto-optical disks, DRAM, SRAM, SGRAM, SDRAM, RDRAM, DDR RAM, 
magnetic bubble memory devices, and other data storage devices, including CPU registers 
and on-CPU data storage arrays. Typically, the target data records are stored as a bit pattern 
in an array of magnetic domains on a magnetizable medium or as an array of charge states or 

25 transistor gate states, such as an array of cells in a DRAM device (e.g., each cell comprised of 
a transistor and a charge storage area, which may be on the transistor). In one embodiment, 
the invention provides such storage devices, and computer systems built therewith, 
comprising a bit pattern encoding a protein expression fingerprint record comprising unique 
identifiers for at least 10 target data records cross-tabulated with target source. 
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When the target is a peptide or nucleic acid, the invention preferably provides 
a method for identifying related peptide or nucleic acid sequences, comprising performing a 
computerized comparison between a peptide or nucleic acid sequence assay record stored in 
or retrieved from a computer storage device or database and at least one other sequence. The 
5 comparison can include a sequence analysis or comparison algorithm or computer program 
embodiment thereof (e.g., FASTA, TFASTA, GAP, BESTFIT) and/or the comparison may 
be of the relative amount of a peptide or nucleic acid sequence in a pool of sequences 
determined from a polypeptide or nucleic acid sample of a specimen. 

The invention also preferably provides a magnetic disk, such as an IBM- 

10 compatible (DOS, Windows, Windows95/98/2000, Windows NT, OS/2) or other format 

(e.g., Linux, SunOS, Solaris, ADC, SCO Unix, VMS, MV, Macintosh, etc.) floppy diskette or 
hard (fixed, Winchester) disk drive, comprising a bit pattern encoding data from an assay of 
the invention in a file format suitable for retrieval and processing in a computerized sequence 
analysis, comparison, or relative quantitation method. 

15 The invention also provides a network, comprising a plurality of computing 

devices linked via a data link, such as an Ethernet cable (coax or lOBaseT), telephone line, 
ISDN line, wireless network, optical fiber, or other suitable signal transmission medium, 
whereby at least one network device (e.g., computer, disk array, etc.) comprises a pattern of 
magnetic domains (e.g., magnetic disk) and/or charge domains (e.g., an array of DRAM 

20 cells) composing a bit pattern encoding data acquired from an assay of the invention. 

The invention also provides a method for transmitting assay data that includes 
generating an electronic signal on an electronic communications device, such as a modem, 
ISDN terminal adapter, DSL, cable modem, ATM switch, or the like, wherein the signal 
includes (in native or encrypted format) a bit pattern encoding data from an assay or a 

25 database comprising a plurality of assay results obtained by the method of the invention. 

In a preferred embodiment, the invention provides a computer system for 
comparing a query target to a database contaming an array of data structures, such as an assay 
result obtained by the method of the invention, and ranking database targets based on the 
degree of identity and gap weight to the target data. A central processor is preferably 

30 initialized to load and execute the computer program for alignment and/or comparison of the 
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assay results. Data for a query target is entered into the central processor via an I/O de\dce. 
Execution of the computer program results in the central processor retrieving the assay data 
from the data jfile, which comprises a binary description of an assay result. 

The target data or record and the computer program can be transferred to 
5 secondary memory, which is typically random access memory (e.g., DRAM, SRAM, 
SGRAM, or SDRAM). Targets are ranked according to the degree of correspondence 
between a selected assay characteristic (e.g., binding to a selected affinity moiety) and the 
same characteristic of the query target and results are output via an I/O device. For example, 
a central processor can be a conventional computer (e.g., Intel Pentium, PowerPC, Alpha, 

10 PA-8000, SPARC, MIPS 4400, MIPS 10000, VAX, etc.); a program can be a commercial or 
public domain molecular biology software package (e.g., UWGCG Sequence Analysis 
Software, Darwin); a data file can be an optical or magnetic disk, a data server, a memory 
device (e.g., DRAM, SRAM, SGRAM, SDRAM, EPROM, bubble memory, flash memory, 
etc.); an I/O device can be a terminal comprising a video display and a keyboard, a modem, 

15 an ISDN terminal adapter, an Ethernet port, a punched card reader, a magnetic strip reader, or 
other suitable I/O device. 

The invention also preferably provides the use of a computer system, such as 
that described above, which comprises: (1) a computer; (2) a stored bit pattern encoding a 
collection of peptide sequence specificity records obtained by the methods of the invention, 

20 which may be stored in the computer; (3) a comparison target, such as a query target; and (4) 
a program for alignment and comparison, typically with rank-ordering of comparison results 
on the basis of computed similarity values. 

Characteristics of breast cancer-associated proteins 

25 Breast cancer proteins of the present invention may be classified as secreted 

proteins, transmembrane protems or intracellular proteins. In one embodiment, the breast 
cancer protein is an mtracellular protein. Intracellular proteins may be foimd in the 
cytoplasm and/or in the nucleus. Intracellular proteins are involved in all aspects of cellular 
ftmction and replication (including, e.g., signaling pathways); aberrant expression of such 

30 proteins often results in unregulated or disregulated cellular processes (see, e.g.. Molecular 
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Biology of the Cell (Alberts, ed., 3rd ed., 1994). For example, many intracellular proteins 
have enzymatic activity such as protein kinase activity, protein phosphatase activity, protease 
activity, nucleotide cyclase activity, polymerase activity and the like. Intracellular proteins 
also serve as docking proteins that are involved in organizing complexes of proteins, or 
5 targeting proteins to various subcellular localizations, and are involved in maintaining the 
structural integrity of organelles. 

An increasingly appreciated concept in characterizing proteins is the presence 
in the proteins of one or more motifs for which defined functions have been attributed. In 
addition to the highly conserved sequences found in the enzymatic domain of proteins, highly 

1 0 conserved sequences have been identified in proteins that are involved in protein-protein 
interaction. For example, Src-homology-2 (SH2) domains bind tyiosine-phosphorylated 
targets in a sequence dependent naanner. PTB domains, which are distinct fi:om SH2 
domains, also bind tyrosine phosphorylated targets. SH3 domains bind to proline-rich 
targets. In addition, PH domains, tetratricopeptide repeats and WD domains to name only a 

1 5 few, have been shown to mediate protein-protein interactions. Some of these may also be 
involved in binding to phospholipids or other second messengers. As will be appreciated by 
one of ordinary skill in the art, these motife can be identified on the basis of primary 
sequence; thus, an analysis of the sequence of protems may provide insight into both the 
enzymatic potential of the molecule and/or molecules with which the protein may associate. 

20 One useful database is Pfam (protein famiUes), which is a large collection of multiple 

sequence alignments and hidden Markov models covering many common protein domains. 
Versions are available via the internet from Washington University in St. Louis, the Sanger 
Center in England, and the Karolinska Institute in Sweden {see, e.g., Bateman et al., Nuc, 
Acids Res. 28:263-266 (2000); Sonnhammer et al, Proteins 28:405-420 (1997); Bateman et 

25 al, Nuc. Acids Res. 27:260-262 (1999); and Sonnhammer et al, Nuc, Acids Res. 26:320-322- 
(1998)). 

In another embodiment, the breast canca: sequences are transmembrane 
. proteins. Transmembrane proteins are molecules that span a phospholipid bilayer of a cell. 
They may have an intracellular domain, an extracellular domain, or both. The intracellular 
30 domains of such proteins may have a number of functions including those already described 

36 



wo 02/059377 



PCT/US02/02242 



for intracellular proteins. For example, the intracellular domain may have enzymatic activity 
and/or may serve as a binding site for additional proteins. Frequently the intracellular 
domain of transmembrane proteins serves both roles. For example certain receptor tyrosine 
kinases have both protein kinase activity and SH2 domains. In addition, autophosphorylation 
5 of tyrosines on the receptor molecule itself, creates binding sites for additional SH2 domain 
containing proteins. 

Transmembrane proteins may contain from one to many transmembrane 
domains. For example, receptor tyrosine kinases, certain cj^okine receptors, receptor 
guanylyl cyclases and receptor serine/threonine protein kinases contain a single 

10 transmembrane domain. However, various other protems including channels and adenylyl 
cyclases contain numerous transmembrane domains. Many important cell surface receptors 
such as G protein coupled receptors (GPCRs) are classified as "seven transmembrane 
domain*' proteins, as they contain 7 membrane spanning regions. Characteristics of 
transmembrane domains include approximately 20 consecutive hydrophobic amino acids that 

1 5 may be followed by charged amino acids. Therefore, upon analysis of the amino acid 
sequence of a particular protein, the localization and number of transmembrane domains 
within the protein may be predicted {see, e.g. PSORT web site http://psort.nibb.ac.jp/). 
Important transmembrane protein receptors include, but are not limited to the insulin 
receptor, insulin-like growth factor receptor, human growth hormone receptor, glucose 

20 transporters, transferrin receptor, epidermal growth factor receptor, low density lipoprotein 
receptor, epidermal growth factor receptor, leptin receptor, interleukin receptors, e.g. IL-1 
receptor, IL-2 receptor. 

The extracellular domains of transmembrane proteins are diverse; however, 
conserved motifs are found repeatedly among various extracellular domains. Conserved 

25 structure and/or functions have been ascribed to different extracellular motifs. Many 

extracellular domains are involved in binding to other molecules. In one aspect, extracellular 
domains are found on receptors. Factors that bind the receptor domain include circulating 
ligands, which may be peptides, proteins, or small molecules such as adenosine and the like. 
For example, growth factors such as EGF, FGF and PDGF are circulating growth factors that 

30 bind to their cognate receptors to initiate a variety of cellular responses. Other factors include 

37 



wo 02/059377 



PCT/US02/02242 



cytokines, mitogenic factors, neurotrophic factors and the like. Extracellular domains also 
bind to cell-associated molecules. In this respect, they mediate cell-cell interactions. Cell- 
associated ligands can be tethered to the cell, e.g., via a glycosylphosphatidylinositol (GPI) 
anchor, or may themselves be transmembrane proteins. Extracellular domains also associate 
5 with the extracellular matrix and contribute to the maintenance of the cell structure. 

Breast cancer proteins that are transmembrane are particularly preferred in the 
present invention as they are readily accessible targets for inununotherapeutics, as are 
described herein. In addition, as outlined below, transmembrane proteins can be also useful 
in imaging modalities. Antibodies may be used to label such readily accessible proteins in 

10 situ. Alternatively, antibodies can also label intracellular proteins, in which case samples are 
typically pemieablized to provide access to intracellular proteins. 

It will also be appreciated by those in the art that a transmembrane protein can 
be made soluble by removing transmembrane sequences, e.g., through recombinant methods. 
Furthermore, transmembrane proteins that have been made soluble can be made to be 

1 5 secreted through recombinant means by adding an appropriate signal sequence. 

In another embodiment, the breast cancer proteins are secreted proteins; the 
secretion of which can be either constitutive or regulated. These proteins have a signal 
peptide or signal sequence that targets the molecule to the secretory pathway. Secreted 
proteins are involved in numerous physiological events; by virtue of their circulating nature, 

20 they serve to transmit signals to various other cell types. The secreted protein may function in 
an autocrine manner (acting on the cell that secreted the factor), a paracrine manner (acting 
on cells in close proximity to the cell that secreted the factor) or an endocrine manner (acting 
on cells at a distance). Thus secreted molecules find use in modulating or altering numerous 
aspects of physiology. Breast cancer proteins that are secreted proteins are particularly 

25 preferred in the present invention as they serve as good targets for diagnostic markers, eg., 
for blood, plasma, serum, or stool tests. 

Use of breast cancer nucleic acids 

As described above, breast cancer sequence is initially identified by 
30 substantial nucleic acid and/or amino acid sequence homology or linkage to the breast cancer 
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sequences outlined herein. Such homology can be based upon the overall nucleic acid or 
amino acid sequence, and is generally determined as outlined below, using either homology 
programs or hybridization conditions. Typically, linked sequences on a mRNA are found on 
the same molecule. 

5 The breast cancer nucleic acid sequences of the invention, e.g., the sequences 

in Tables 1-25, can be fragments of larger genes, i.e., they are nucleic acid segments. 
"Genes" in this context includes coding regions, non-coding regions, and mixtures of coding 
and non-coding regions. Accordingly, as will be appreciated by those m the art, using the 
sequences provided herein, extended sequences, in either direction, of the breast cancer genes 

10 can be obtained, using techniques well known in the art for cloning either longer sequences or 
the full length sequences; see Ausubel, et al, supra. Much can be done by informatics and 
many sequences can be clustered to include multiple sequences corresponding to a single 
gene, e.g., systems such as UniGene (see, http://www.ncbi.nlm.nih.gov/UniGene/). 

Once the breast cancer nucleic acid is identified, it can be cloned and, if 

15 necessary, its constituent parts recombined to form the entire breast cancer nucleic acid 
coding regions or the entire mRNA sequence. Once isolated from its natural source, e.g., 
contained within a plasmid or other vector or excised therefrom as a linear nucleic acid 
segment, the recombinant breast cancer nucleic acid can be further-used as a probe to identify 
and isolate other breast cancer nucleic acids, e.g., extended coding regions. It can also be 

20 used as a "precursor" nucleic acid to make modified or variant breast cancer nucleic acids 
and proteins. 

The breast cancer nucleic acids of the present invention are used in several 
ways. In a first embodiment, nucleic acid probes to the breast cancer nucleic acids are made 
and attached to biochips to be used in screening and diagnostic methods, as outlined below, 
25 or for administration, e.g., for gene therapy, vaccine, and/or antisense applications. 

Alternatively, the breast cancer nucleic acids that include coding regions of breast cancer 
proteins can be put into expression vectors for the expression of breast cancer proteins, again 
for screening purposes or for administration to a patient. 

In a preferred embodiment, nucleic acid probes to breast cancer nucleic acids 
30 (both the nucleic acid sequences outlined in the figures and/or the complements thereof) are 
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made. The nucleic acid probes attached to the biochip are designed to be substantially 
complementary to the breast cancer nucleic acids, the target sequence (either the target 
sequence of the sample or to other probe sequences, e.g., in sandwich assays), such that 
hybridization of the target sequence and the probes of the present invention occurs. As 
5 outlined below, this complementarity need not be perfect; there may be any number of base 
pair mismatches which will interfere with hybridization between the target sequence and the 
single stranded nucleic acids of the present invention. However, if the number of mutations 
is so great that no hybridization can occur imder even the least stringent of hybridization 
conditions, the sequence is not a complementary target sequence. Thus, by "substantially 

10 complementary" herein is meant that the probes are sufficiently complementary to the target 
sequences to hybridize under normal reaction conditions, particularly high stringency 
conditions, as outlined herein. 

A nucleic acid probe is generally single stranded but can be partially single 
and partially double stranded. The strandedness of the probe is dictated by the structure, 

15 composition, and properties of the target sequence. In general, the nucleic acid probes range 
jfrom about 8 to about 100 bases long, with from about 10 to about 80 bases bemg preferred, 
and from about 30 to about 50 bases being particularly preferred. That is, generally whole 
genes are not used. In some embodiments, much longer nucleic acids can be used, up to 
hundreds of bases. 

20 In a preferred embodiment, more than one probe per sequence is used, with 

either overlapping probes or probes to different sections of the target being used. That is, 
two, three, four or more probes, with three being preferred, are used to build in a redimdancy 
for a particular target. The probes can be overlapping (i.e., have some sequence in common), 
or separate. In some cases, PCR primers may be used to amplify signal for higher sensitivity. 

25 As will be appreciated by those in the art, nucleic acids can be attached or 

immobilized to a solid support in a wide variety of ways. By "inunobilized" and grammatical 
equivalents herein is meant the association or binding between the nucleic acid probe and the 
solid support is sufficient to be stable under the conditions of binding, wasliing, analysis, and 
removal as outlined below. The binding can typically be covalent or non-covalent. By **non- 

30 covalent binding" and grammatical equivalents herein is meant one or more of electrostatic, 
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hydrophilic, and hydrophobic interactions. Included in non-covalent binding is the covalent 
attachment of a molecule, such as, streptavidin to the support and the non-covalent binding of 
the biotinylated probe to the streptavidin. By "covalent binding*' and grammatical 
equivalents herein is meant that the two moieties, the soUd support and the probe, are 
5 attached by at least one bond, including sigma bonds, pi bonds and coordination bonds. 
Covalent bonds can be formed directly between the probe and the solid support or can be 
formed by a cross linker or by inclusion of a specific reactive groiq) on either the solid 
support or the probe or both molecules. Immobilization may also involve a combination of 
covalent and non-covalent interactions. 
10 In general, the probes are attached to the biochip m a wide variety of ways, as 

wiU be appreciated by those in the art. As described herein, the nucleic acids can either be 
synthesized first, with subsequent attachment to the biochip, or can be directly synthesized on 
the biochip. 

The biochip comprises a suitable solid substrate. By "substrate" or "solid 
1 5 support** or other grammatical equivalents herein is meant a material that can be modified to 
contain discrete individual sites appropriate for the attachment or association of the nucleic 
acid probes and is amenable to at least one detection method. As will be appreciated by those 
in the art, the number of possible substrates are very large, and include, but are not limited to, 
glass and modified or functionaUzed glass, plastics (including acrylics, polystyrene and 
20 copolymers of styrene and other materials, polypropylene, polyethylene, polybutylene, 

polyurethanes, TeflonJ, etc.), polysaccharides, nylon or nitrocellulose, resins, siUca or silica- 
based materials including silicon and modified silicon, carbon, metals, inorganic glasses, 
plastics, etc. In general, the substrates allow optical detection and do not appreciably 
fluoresce. A preferred substrate is described m copending apphcation entitled Reusable Low 
25 Fluorescent Plastic Biochip, U.S. Application Serial No. 09/270,214, filed March 1 5, 1999, 
herein incorporated by reference in its entirety. 

Generally the substrate is planar, although as will be appreciated by those in 
the art, other configurations of substrates may be used as well. For example, the probes may 
be placed on the inside surface of a tube, for flow-through sample analysis to minimize 
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sample volume. Similarly, the substrate may be flexible, such as a flexible foam, including 
closed cell foams made of particular plastics. 

In a preferred embodiment, the surface of the biochip and the probe may be 
derivatized with chemical functional groups for subsequent attachment of the two. Thus, e.g., 
5 the biochip is derivatized with a chemical functional group including, but not limited to/ 
amino groups, carboxy groups, oxo groups and thiol groups, with amino groups being 
particularly preferred. Using these functional groups, the probes can be attached using 
functional groups on the probes. For example, nucleic acids containing amino groups can be 
attached to surfaces comprising amino groups, e.g. using linkers as are known in the art; e.g., 
10 homo-or hetero-bifunctional linkers as are well known {see 1994 Pierce Chemical Covapmy 
catalog, technical section on cross-linkers, pages 155-200). In addition, in some cases, 
additional Imkers, such as alkyl groups (including substituted and heteroalkyl groups) may be 
used. 

In this embodiment, oligonucleotides are synthesized as is known in the art, 
15 and then attached to the surface of the solid support. As will be appreciated by those skilled 
m the art, either the 5' or 3' terminus may be attached to the solid support, or attachment may 
be via an internal nucleoside. 

In another embodiment, the immobilization to the solid support may be very 
strong, yet non-covalent. For example, biotinylated oUgonucleotides can be made, which 
20 bind to surfaces covalently coated with streptavidin, resulting in attachment. 

Alternatively, the oligonucleotides may be synthesized on the surface, as is 
known in the art. For example, photoactivation techniques utilizing photopolymerization 
compounds and techniques are used. In a preferred embodiment, the nucleic acids can be 
synthesized in situ, using well known photolithographic techniques, such as those described 
25 in WO 95/251 16; WO 95/35505; U.S. Patent Nos. 5,700,637 and 5,445,934; and references 
cited within, all of which are expressly incorporated by reference; these methods of 
attachment form the basis of the Affimetrix GeneChip™ technology. 

Often, amplification-based assays are performed to measure the expression 
level of breast cancer-associated sequences. These assays are typically performed in 
30 conjunction with reverse transcription. In such assays, a breast cancer-associated nucleic acid 
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sequence acts as a template in an amplification reaction (e.g., Polymerase Chain Reaction, or 
PGR). In a quantitative amplification, the amount of amplification product will be 
proportional to the amount of template in the original sample. Comparison to qjpropriate 
controls provides a measure of the amount of breast cancer-associated RNA. Methods of 
5 quantitative amplification are well known to those of skill in the art. Detailed protocols for 
quantitative PCR are provided, e.g., in Innis et al, PCR Protocols, A Guide to Methods and 
Applications (1990). 

In some embodiments, a TaqMan based assay is used to measure expression. 
TaqMan based assays use a fluorogenic oligonucleotide probe that contains a 5' fluorescent 

10 dye and a 3' quenching agent. The probe hybridizes to a PCR product, but cannot itself be 
extended due to a blocking agent at the 3' end. When the PCR product is amplified m 
subsequent cycles, the 5' nuclease activity of the polymerase, e.g., AmpliTaq, results m the 
cleavage of the TaqMan probe. This cleavage separates the 5' fliuorescent dye and the 3' 
quenchmg agent, thereby resulting in an increase in fluorescence as a fimction of 

1 5 amplification {see, e.g., literature provided by Peridn-Elmer, e.g,, www2.perkin-ehner.com). 

Other suitable amplification methods include, but are not lunited to, ligase 
chain reaction (LCR) {see Wu & Wallace, Genomics 4:560 (1989), Landegren et al. Science 
241:1077 (1988), and Barringer etal. Gene 89:117 (1990)), transcription amphfication 
(Kwoh et al. Proa Natl Acad. Set USA 86:1 173 (1989)), self-sustained sequence rephcation 

20 (Guatelh et al, Proc. Nat Acad, Set USA 87:1874 (1990)), dot PCR, and Unker adapter PCR, 
etc. 

Expression of breast cancer proteins from nucleic acids 

In a preferred embodiment, breast cancer nucleic acids, e.g., encoding breast 
25 cancer proteins are used to make a variety of expression vectors to express breast cancer 

proteins which can then be used m screening assays, as described below. Expression vectors 
and recombinant DNA technology are well known to those of skill in. the art {see, e.g., 
Ausubel, supra, and Gene Expression Systems (Femandez & Hoeffler, eds, 1999)) and are 
used to express proteins. The expression vectors may be either self-replicating 
30 extrachromosomal vectors or vectors which integrate into a host genome. Generally, these 
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expression vectors include transcriptional and translational regulatory nucleic acid operably 
linked to the nucleic acid encoding the breast cancer protein. The term "control sequences" 
refers to DNA sequences used for the expression of an operably linked coding sequence in a 
particular host organism. Control sequences that are suitable for prokaryotes, e.g., include a 
5 promoter, optionally an operator sequence, and a ribosome binding site. Eukaryotic cells are 
known to utilize promote, polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional 
relationship with another nucleic acid sequence. For example, DNA for a presequence or 
secretory leader is operably linked to DNA for a polypeptide if it is expressed as a preprotein 

10 that participates in the secretion of the polypeptide; a promoter or enhancer is operably linked 
to a coding sequence if it affects the transcription of the sequence; or a ribosome binding site 
is operably linked to a coding sequence if it is positioned so as to facilitate translation. 
Generally, "operably linked" means that the DNA sequences being linked are contiguous, 
and, in the case of a secretory leader, contiguous and in reading phase. However, enhancers 

15 do not have to be contiguous. Linking is typically accomplished by ligation at convenient 
restriction sites. If such sites do not exist, synthetic oligonucleotide adaptors or linkers are 
used in accordance with conventional practice. Transcriptional and translational regulatory 
nucleic acid will generally be appropriate to the host cell used to express the breast cancer 
protein. Numerous types of appropriate expression vectors, and suitable regulatory 

20 sequences are known in tlie art for a variety of host cells. 

In general, transcriptional and translational regulatory sequences may include, 
but are not limited to, promoter sequences, ribosomal binding sites, transcriptional start and 
stop sequences, translational start and stop sequences, and enhancer or activator sequences. 
In a preferred embodiment, the regulatory sequences include a promoter and transcriptional 

25 start and stop sequences. 

Promoter sequences encode either constitutive or inducible promoters. The 
promoters may be either naturally occurring promoters or hybrid promoters. Hybrid 
promoters, which combine elements of more than one promoter, are also known in the art, 
and are useful in the present invention. 
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In addition, an expression vector may comprise additional elements. For 
example, the expression vector may have two replication systems, thus allowing it to be 
maintained in two organisms, e.g. in mammalian or insect cells for expression and in a 
procaryotic host for cloning and amplification. Furthermore, for integrating expression 
5 vectors, the expression vector contains at least one sequence homologous to the host cell 
genome, and preferably two homologous sequences which flank the expression construct 
The integrating vector may be directed to a specific locus in the host cell by selecting the 
appropriate homologous sequence for inclusion in the vector. Constructs for integrating 
vectors are well known in the art (e.g., Fernandez & Hoeffler, supra). 

10 In addition, in a preferred embodiment, the expression vector contains a 

selectable marker gene to allow the selection of transformed host cells. Selection genes are 
well known in the art and will vary with the host cell used. 

The breast cancer proteins of the present invention are produced by culturing a 
host cell transformed with an expression vector containing nucleic acid encoding a breast 

1 5 cancer protein, under the appropriate conditions to induce or cause expression of the breast 
cancer protein. Conditions appropriate for breast cancer protein expression will vary with the 
choice of the expression vector and the host cell, and will be easily ascertained by one skilled 
in the art through routine experimentation or optimization. For example, the use of 
constitutive promoters in the expression vector will require optimizing the growth and 

20 proliferation of the host cell, while the use of an inducible promoter requires the appropriate 
growth conditions for induction In addition, in some embodiments, the timing of the harvest 
is important. For example, the baculoviral systems used in insect cell expression are lytic 
viruses, and thus harvest time selection can be crucial for product yield. 

Appropriate host cells include yeast, bacteria, archaebacteria, fungi, and insect 

25 and animal cells, including mammalian cells. Of particular interest are Saccharomyces 
cerevisiae and other yeasts, E. coli, Bacillus subtilis, Sf9 cells, CI 29 cells, 293 cells, 
Neurospora, BHK, CHO, COS, HeLa cells, HUVEC (human umbilical vein endothelial 
cells), THPl cells (a macrophage cell line) and various other himian cells and cell lines. 

In a prefenred embodiment, the breast cancer proteins are expressed in 

30 mammalian cells. Mammalian expression systems are also known in the art, and include 
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retroviral and adenoviral systems. One expression vector system is a retroviral vector system 
such as is generally described in PCT/US97/01019 and PCT/US97/01048, both of which are 
hereby expressly incorporated by reference. Of particular use as mammalian promoters are 
the promoters from mammaUan viral genes, since the viral genes are often highly expressed 
5 and have a broad host range. Examples include the SV40 early promoter, mouse mammary 
tumor virus LTR promoter, adenovirus major late promoter, herpes simplex virus promoter, 
and the CMV promoter {see, e.g„ Femandez & Hoeffler, supra). Typically, transcription 
termination and polyadenylation sequences recognized by mammalian cells are regulatory 
regions located 3* to the translation stop codon and thus, together with the promoter elements, 

10 flank the coding sequence. Examples of transcription terminator and polyadenlyation signals 
include those derived form SV40. 

The methods of introducing exogenous nucleic acid hito mammalian hosts, as 
well as other hosts, is well known in the art, and will vary with the host cell used. 
Techniques include dextran-mediated transfection, calcium phosphate precipitation, 

1 5 polybrene mediated transfection, protoplast fusion, electroporation, viral infection, 

encapsulation of the polynucleotide(s) in liposomes, and direct microinjection of the DNA 
into nuclei. 

In a preferred embodiment, breast cancer proteins are expressed in bacterial 
systems. Bacterial expression systems are \yell known in the art. Promoters from 

20 bacteriophage may also be used and are known in the art. hi addition, synthetic promoters 
and hybrid promoters are also useful; e.g., the tac promoter is a hybrid of the trp and lac 
promoter sequences. Furthermore, a bacterial promoter can include naturally occurring 
promoters of non-bacterial origin that have the ability to bind bacterial RNA polymerase and 
initiate transcription. In addition to a functioning promoter sequence, an efficient ribosome 

25 binding site is desirable. The expression vector may also include a signal peptide sequence 
that provides for secretion of the breast cancer protein in bacteria. The protein is either 
secreted into the growth media (gram-positive bacteria) or into the periplasmic space, located 
between the inner and outer membrane of the cell (gram-negative bacteria). The bacterial 
expression vector may also include a selectable marker gene to allow for the selection of 

30 bacterial strains that have been transformed. Suitable selection genes include genes which 
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render the bacteria resistant to drugs such as ampicillin, chloramphenicol, erythromycin, 
kanamycin, neomycin and tetracycline. Selectable markers also include biosynthetic genes, 
such as those in the histidine, tryptophan and leucine biosynthetic pathways. These 
components are assembled into expression vectors. Expression vectors for bacteria are well 
5 known in the art, and include vectors for Bacillus subtilis, E. coli, Streptococcus cremoris, 
and Streptococcus lividans, among others (e.g., Fernandez & Hoeffler, supra). The bacterial 
expression vectors are transformed into bacterial host cells using techniques well known in 
the art, such as calcium chloride treatment, electroporation, and others. 

In one embodunent, breast cancer proteins are produced in insect cells. 

1 0 Expression vectors for the transformation of msect cells, and in particular, baculovirus-based 
expression vectors, are well known in the art. 

In a preferred embodiment, breast cancer protein is produced in yeast cells. 
Yeast expression systems are well known in the art, and include expression vectors for 
Saccharomyces cerevisiae, Candida albicans and C. maltosa, Hansenula polymorpha, 

1 5 Kluyveromyces fragilis and lactis, Pichia guillerimondii and P. pastoris, 
Schizosaccharomyces pombe, and Yarrowia lipolytica. 

The breast cancer protein may also be made as a fusion protein, using 
techniques well known in the art. Thus, e.g., for the creation of monoclonal antibodies, if the 
desired epitope is small, the breast cancer protein may be fused to a carrier protein to form an 

20 immunogen. Alternatively, the breast cancer protein may be made as a fusion protein to 
increase expression, or for other reasons. For example, when the breast cancer protem is a 
breast cancer peptide, the nucleic acid encoding the peptide may be linked to other nucleic 
acid for expression purposes. 

In a preferred embodiment, the breast cancer protein is purified or isolated 

25 after egression. Breast cancer proteins may be isolated or purified in a variety of ways 
known to those skilled in the art depending on what other components are present in the 
sample. Standard purification methods include electrophoretic, molecular, immunological 
and chromatographic techniques, including ion exchange, hydrophobic, affmity, and reverse- 
phase HPLC chromatography, and chromatofocusing. For example, the breast cancer protem 

30 may be purified using a standard anti-breast cancer protein antibody column. Ultrafiltration 
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and diafiltration techniques, in conjunction with protein concentration, are also useful. For 
general guidance in suitable purification techniques, see Scopes, Protein Purification (1982). 
The degree of purification necessary will vary depending on the use of the breast cancer 
protein. In some instances no purification will be necessary. 
5 Once expressed and purified if necessary, the breast cancer proteins and 

nucleic acids are useful in a number of applications. They may be used as immunoselection 
reagents, as vaccine reagents, as screening agents, etc. 

Variants of breast cancer proteins 

10 In one embodiment, the breast cancer proteins are derivative or variant breast 

cancer proteins as compared to the wild-type sequence. That is, as outlined more fully below, 
the derivative breast cancer peptide will often contain at least one amino acid substitution, 
deletion or insertion, with amino acid substitutions being particularly preferred. The amino 
acid substitution, insertion or deletion may occm: at any residue within the breast cancer 

IS peptide. 

Also included within one embodiment of breast cancer proteins of the present 
invention are amino acid sequence variants. These variants typically fall into one or more of 
three classes: substitutional, insertional or deletional variants. These variants ordinarily are 
prepared by site specific mutagenesis of nucleotides in the DNA encoding the breast cancer 

20 protein, using cassette or PCR mutagenesis or other techniques well known in the art, to 
produce DNA encoding the variant, and thereafter expressing the DNA in recombinant cell 
culture as outlined above. However, variant breast cancer protein Jfragments having up to 
about 100-150 residues may be prepared by in vitro synthesis using established techniques. 
Amino acid sequence variants are characterized by the predetermined nature of the variation, 

25 a feature that sets them apart fi-om naturally occurring alleUc or interspecies variation of the 
breast cancer protein amino acid sequence. The variants typically exhibit the same 
qualitative biological activity as the naturally occurring analogue, although variants can also 
be selected which have modified characteristics as will be more fully outlined below. 

While the site or region for introducing an . amino acid sequence variation is 

30 predetermined, the mutation per se need not be predetermined. For example, m order to 
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optimize the performance of a mutation at a given site, random mutagenesis may be 
conducted at the target codon or region and the expressed breast cancer variants screened for 
the optimal combination of desired activity. Techniques for making substitution mutations at 
predetermined sites in DNA having a known sequence are well known, e.g., Ml 3 primer 
5 mutagenesis and PCR mutagenesis. Screening of the mutants is done using assays of breast 
cancer protein activities. 

Amino acid substitutions are typically of single residues; insertions usually 
will be on tiie order of from about 1 to 20 amino acids, although considerably larger 
insertions may be tolerated. Deletions range from about 1 to about 20 residues, altliough in 

1 0 some cases deletions may be much larger. 

Substitutions, deletions, insertions or any combination thereof may be used to 
arrive at a final derivative. Generally these changes are done on a few amnio acids to 
minimize the alteration of the molecule. However, larger changes may be tolerated in certain 
circumstances. When small alterations in the characteristics of the breast cancer protein are 

15 desired, substitutions are generally made in accordance with the amino acid substitution 
relationships provided in the definition section. 

The variants typically exhibit the same qualitative biological activity and will 
elicit the same immune response as the naturally-occurring analog, although variants also are 
selected to modify the characteristics of the breast cancer proteins as needed. Alternatively, 

20 the variant may be designed such that the biological activity of the breast canca: protein is 
altered. For example, glycosylation sites may be altered or removed. 

Substantial changes in function or immunological identity are made by 
selecting substitutions that are less conservative than those described above. For example, 
substitutions may be made which more significantly affect: the structure of the polypeptide 

25 backbone in the area of the alteration, for example the alpha-hehcal or beta-sheet structure; 
the charge or hydrophobicity of the molecule at tiie target site; or the bulk of the side chain. 
The substitutions which in general are expected to produce the greatest changes in the 
polypeptide's properties are ttiose in which (a) a hydrophiUc residue, e.g. seryl or threonyl is 
substituted for (or by) a hydrophobic residue, e.g. leucyl, isoleucyl, phenylalanyl, valyl or 

30 alanyl; (b) a cysteine or proline is substituted for (or by) any other residue; (c) a residue 
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having an electropositive side chain, e.g. lysyl, arginyl, or histidyl, is substituted for (or by) 
an electronegative residue, e.g. glutamyl or aspartyl; or (d) a residue having a bulky side 
chain, e.g. phenylalanine, is substituted for (or by) one not having a side chain, e.g. glycine. 

Covalent modifications of breast cancer polypeptides are included within the 
5 scope of this invention. One type of covalent modification includes reacting targeted amino 
acid residues of a breast cancer polypeptide with an organic derivatizing agent that is c£q)able 
of reacting with selected side chains or the N-or C-terminal residues of a breast cancer 
polypeptide, Derivatization Avith bifunctional agents is useful, for instance, for crosslinking 
breast cancer polypeptides to a water-insoluble support matrix or surface for use in the 

10 method for purifying anti-breast cancer polypeptide antibodies or screening assays, as is more 
fully described below. Commonly used crosslinking agents include, e.g., 1,1- 
bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N-hydroxysuccinimide esters, e.g., esters 
with 4-azidosalicylic acid, homobifunctional imidoesters, including disuccinimidyl esters 
such as 3,3'-dithiobis(succinimidylpropionate), bifunctional maleimides such as bis-N- 

1 5 maleimido-l,8-octane and agents such as methyl-3-((p-azidophenyl)dithio)pn)pioimidate. . 

Other modifications include deamidation of glutaminyl and asparaginyl 
residues to the corresponding glutamyl and aspartyl residues, respectively, hydroxylation of 
proline and lysine, phosphorylation of hydroxyl groups of seryl, threonyl or tyrosyl residues, 
methylation of the amino groups of the lysine, arginine, and histidine side chains (Creighton, 

20 ProteUis: Structure and Molecular Properties, pp. 79-86 (1 983)), acetylation of the N- 
terminal amine, and amidation of any C-terminal carboxyl group. 

Another type of covalent modification of the breast cancer polypeptide 
mcluded within the scope of this invention comprises altering the native glycosylation pattern 
of the polypeptide. "Altering the native glycosylation pattern" is intended for purposes 

25 herein to mean deleting one or more carbohydrate moieties found in native sequence breast 
cancer polypeptide, and/or adding one or more glycosylation sites that are not present in the 
native sequence breast cancer polypeptide. Glycosylation patterns can be altered in many 
ways. For example the use of different cell types to express breast cancer-associated 
sequences can result in different glycosylation patterns. 
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Addition of glycosylation sites to breast cancer polypeptides may also be 
accomplished by altering the amino acid sequence thereof. The alt^ation may be made, e.g., 
by the addition of, or substitution by, one or more serine or threonine residues to the native 
sequence breast cancer polypeptide (for 0-linked glycosylation sites). The breast cancer 

5 amino acid sequence may optionally be altered through changes at the DNA level, 

particularly by mutating the DNA encoding the breast cancer polypeptide at preselected bases 
such that codons are generated that will translate into the desired amino acids. 

Another means of increasing the number of carbohydrate moieties on the 
breast cancer polypeptide is by chemical or enzymatic coupling of glycosides to the 

10 polypeptide. Such methods are described m the art, e.g., in WO 87/05330, and m ApUn & 
Wriston, CRC Cnt Rev. Biochem., pp. 259-306 (1981). 

Removal of carbohydrate moieties present on the breast cancer polypeptide 
may be accomplished chemically or enzymatically or by mutational substitution of codons 
CTicoding for amino acid residues that serve as targets for glycosylation. Chemical 

15 deglycosylation techniques are known in the art and described, for instance, by Hakimuddm, 
et at. Arch Biochem, Biophys., 259:52 (1987) and by Edge a/., ^wa/. Biochem., 118:131 
(1981). Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved by Ihe 
use of a variety of endo-and exo-glycosidases as described by Thotakura et al, Meth, 
EnzymoL, 138:350(1987). 

20 Another type of covalent modification of breast cancer comprises linking the 

breast cancer polypeptide to one of a variety of nonproteinaceous polymers, e.g., 
polyethylene glycol, polypropylene glycol, or polyoxyalkylenes, in the maimer set forth in 
U.S. Patent Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 or 4,179,337. 

Breast cancer polypeptides of the present invention may also be modified in a 

25 way to form chimeric molecules comprising a breast cancer polypeptide fused to another, 
heterologous polypeptide or amino acid sequence. In one embodiment, such a chimeric 
molecule comprises a fusion of a breast cancer polypeptide with a tag polypeptide which 
provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag is 
generally placed at the amino-or carboxyl-terminus of the breast cancer polypeptide. The 

30 presence of such epitope-tagged forms of a breast cancer polypeptide can be detected using 
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an antibody against the tag polypqjtide. Also, provision of the epitope tag enables the breast 
cancer polypeptide to be readily purified by affinity purification using an anti-tag antibody or 
another type of aflBnity matrix that binds to the epitope tag. In an alternative embodiment, 
the chimeric molecule may comprise a fiision of a breast cancer polypeptide with an 
5 immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of the 
chimeric molecule, such a fusion could be to the Fc region of an IgG molecule. 

Various tag polypq)tides and their respective antibodies are well known in the 
art. Examples include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; 
HIS6 and metal chelation tags, the flu HA tag polypeptide and its antibody 12CA5 (Field et 

10 a/., Mol Cell Biol 8:2159-2165 (1988)); the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 
9E10 antibodies thereto (Evan et al. Molecular and Cellular Biology 5:3610-3616 (1985)); 
and the Herpes Simplex virus glycoprotein D (gp) tag and its antibody {Paborsky et al, 
Protein Engineering 3(6):547-553 (1990)). Other tag polypeptides mclude the Flag-peptide 
(Hopp et al, BioTechnology 6:1204-1210 (1988)); the KT3 epitope peptide (Martin et al, 

15 iSczence 255:192-194 (1992)); tubulin epitope peptide (Skmner a/., J, Biol Chem, 

266:15163-15166 (1991)); and the T7 gene 10 protein peptide tag (Lutz-Freyemiuth et al, 
Proc. Natl Acad. Scl USA 87:6393-6397 (1990)). 

Also included are other breast cancer proteins of the breast cancer family, and 
breast cancer proteins firom other organisms, which are cloned and expressed as outlined 

20 below. Thus, probe or degenerate polymerase chain reaction (PGR) primer sequences may be 
used to find other related breast cancer proteins fi-om humans or other organisms. As will be 
appreciated by those in the art, particularly useful probe and/or PGR primer sequences 
include the imique areas of the breast cancer nucleic acid sequence. As is generally known in 
the art, preferred PGR primers are from about 15 to about 35 nucleotides in length, with fi"om 

25 about 20 to about 30 being preferred, and may contain inosine as needed. The conditions for 
the PGR reaction are well known in the art (e.g., Innis, PGR Protocols, supra). 

Antibodies to breast cancer proteins 

In a preferred embodiment, when the breast cancer protein is to be used to 
30 generate antibodies, e.g., for immxmotherapy or immimodiagnosis, the breast cancer protein 
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should share at least one epitope or determinant with the full length protein. By "epitope" or 
"detOTninanf ' herein is typically meant a portion of a protein which will generate and/or bind 
an antibody or T-cell receptor in the context of MHC. Thus, in most instances, antibodies 
made to a smaller breast cancer protein will be able to bind to the full-length protein, 
5 particularly linear epitopes. In a preferred embodiment, the epitope is unique; that is, 
antibodies generated to a unique epitope show Uttle or no cross-reactivity. 

Methods of preparing polyclonal antibodies are known to the skilled artisan 
(e.g., Coligan, supra\ and Harlow & Lane, supra). Polyclonal antibodies can be raised in a 
mammal, e.g., by one or more injections of an immunizing agent and, if desired, an adjuvant. 

10 Typically, the immunizing agent and/or adjuvant will be injected in the mammal by multiple 
subcutaneous or intraperitoneal injections. The immunizing agent may include a protein 
encoded by a nucleic acid of the figures or fragment thereof or a fijsion protein thereof. It 
may be useful to conjugate the immunizing agent to a protein known to be immunogenic in 
the mammal being immunized. Examples of such inmnmogenic proteins include but are not 

15 limited to keyhole Umpet hemocyanin, serum albumin, bovine thyroglobulm, and soybean 
trypsin inhibitor. Examples of adjuvants which may be employed include Freund's complete 
adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose 
dicorynomycolate). The immunization protocol may be selected by one skilled in the art 
without undue experimentation. 

20 The antibodies may, altematively, be monoclonal antibodies. Monoclonal 

antibodies may be prepared using hybridoma methods, such as those described by Kohler & 
Milstein, Nature 256:495 (1975). In a hybridoma method, a mouse, hamster, or other 
appropriate host animal, is typically immunized with an immunizing agent to elicit 
lymphocytes that produce or are capable of producing antibodies that will specifically bind to 

25 the immunizing agent. Altematively, the lymphocytes may be immunized in vitro. The 

immxmizing agent will typically include a polypeptide encoded by a nucleic acid of Tables 1- 
25 or fragment thereof, or a fiision protein thereof. Generally, either peripheral blood 
lymphocytes ("PBLs") are used if cells of human origin are desired, or spleen cells or lymph 
node cells are used if non-human mammalian sources are desired. The lymphocytes are then 

30 fused with an immortalized cell line using a suitable fusing agent, such as polyethylene 

53 



wo 02/059377 



PCT/US02/02242 



glycol, to fonn a hybridoma cell (Goding, Monoclonal Antibodies: Principles and Practicey 
pp. 59-103 (1986)). Immortalized cell lines are usually transformed mammalian cells, 
particularly myeloma cells of rodent, bovine and human origin. Usually, rat or mouse 
myeloma cell lines are employed. The hybridoma cells may be cultured in a suitable culture 
5 medium that preferably contains one or more substances that inhibit the growth or survival of 
the unfused, immortalized cells. For example, if the parental cells lack the enzyme 
hypoxanthme guanine phosphoribosyl transferase (HGPRT or HPRT), the culture medium 
for the hybridomas typically will include hypoxanthine, aminopterin, and thymidine ("HAT 
medium"), which substances prevent the growth of HGPRT-deficient cells. 

10 In one embodiment, the antibodies are bispecific antibodies. Bispecific 

antibodies are monoclonal, preferably human or humanized, antibodies that have binding 
specificities for at least two different antigens or that have bmding specificities for two 
epitopes on the same antigen. In one embodiment, one of the binding specificities is for a 
protein encoded by a nucleic acid Tables 1-25 or a fi-agment thereof, the otlier one is for any 

1 5 other antigen, and preferably for a cell-surface protein or recqjtor or receptor subunit, 
preferably one that is tumor specific. Alternatively, tetramer-type technology may create 
multivalent reagents. 

In a preferred embodiment, the antibodies to breast cancer protein are capable 
of reducing or eliminating a biological function of abreast cancer protein, as is described 

20 below. That is, the addition of anti-breast cancer protein antibodies (either polyclonal or 

preferably monoclonal) to breast cancer tissue (or cells containing breast cancer) may reduce 
or eliminate the breast cancer. Generally, at least a 25% decrease in activity, growth, size or 
the like is preferred, with at least about 50% being particularly preferred and about a 95- 
100% decrease being especially preferred. 

25 In a preferred embodiment the antibodies to the breast cancer proteins are 

humanized antibodies (e.g., Xenerex Biosciences, Mederex, Inc., Abgenix, Inc., Protein 
Design Labs,Inc.) Humanized forms of non-human (e.g., murine) antibodies are chimeric 
molecules of immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, 
Fab, Fab', F(ab02 or other antigen-binding subsequences of antibodies) which contain 

30 minimal sequence derived firom non-human immunoglobulin. Humanized antibodies include 
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human immunoglobulins (recipient antibody) in which residues from a complementary 
determining region (CDR) of the recipient are replaced by residues from a CDR of a non- 
human species (donor antibody) such as mouse, rat or rabbit having the desired specificity, 
affinity and capacity. In some instances, Fv framework residues of the human 
5 immunoglobulin are replaced by corresponding non-human residues. Humanized antibodies 
may also comprise residues which are found neither in the recipient antibody nor in the 
imported CDR or framework sequences. In general, a humanized antibody will comprise 
substantially all of at least one, and typically two, variable domains, in which all or 
substantially all of the CDR regions correspond to those of a non-human immunoglobulin 

1 0 and all or substantially all of the framework (FR) regions are those of a human 

immimoglobulin consensus sequence. The humanized antibody optimally also will comprise 
at least a portion of an immunoglobulin constant region (Fc), typically that of a human 
immunoglobulin (Jones et al. Nature 321:522-525 (1986); Riechmann et aL, Nature 
332:323-329 (1988); andPresta, Curr. Op. Struct, Biol. 2:593-596 (1992)). Humanization 

15 can be essentially performed following tiie method of Winter and co-workers (Jones et al , 
Nature 321:522-525 (1986); Riechmann et al. Nature 332:323-327 (1988); Verhoeyen et al. 
Science 239:1534-1536 (1988)), by substituting rodent CDRs or CDR sequences for the 
corresponding sequences of a human antibody. Accordmgly, such humanized antibodies are 
chimeric antibodies (U.S. Patent No. 4,816,567), wherein substantially less than an intact 

20 human variable domain has been substituted by the corresponding sequence from a non- 
human species. 

Himian antibodies can also be produced using various techniques known in the 
art, mcluding phage display libraries (Hoogenboom & Winter, Mol Biol, 227:381 (1991); 
Marks et al,J, Mol Biol 222:581 (1991)). The techniques of Cole et al and Boemer et al 

25 are also available for the preparation of human monoclonal antibodies (Cole et al , 

Monoclonal Antibodies and Cancer Therapy, p. 77 (1985) and Boemer et al, J, Immunol 
147(l):86-95 (1991)). Similarly, human antibodies can be made by introducing of human 
immunoglobulin loci into transgenic animals, e.g., mice in which the endogenous 
immunoglobulin genes have been partially or completely inactivated. Upon challenge, 

30 human antibody production is observed, which closely resembles that seen in humans in all 
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respects, including gene rearrangement, assembly, and antibody repertoire. This approach is 
described, e.g., in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 5,633,425; 
5,661,016, and in the following scientific publications: Marks et aL, Bio/Technology 10:779- 
783 (1992); Lonberg et aL, Nature 368:856-859 (1994); Morrison, Nature 368:812-13 
5 (1994); Fishwild et al. Nature Biotechnology 14:845-51 (1996); Neuberger, Nature 

Biotechnology 14:826 (1996); Lonberg & Bmz^^Jntem. Rev. Immunol 13:65-93 (1995). 

By immunotherapy is meant treatment of breast cancer with an antibody raised 
against breast cancer proteins. As used herein, immunotherapy can be passive or active. 
Passive immunotherapy as defined herein is the passive transfer of antibody to a recipient 

10 (patient). Active immunization is the induction of antibody and/or T-cell responses in a 
recipient (patient). Induction of an immune response is the result of providing the recipient 
with an antigen to which antibodies are raised. As appreciated by one of ordinary skill in the 
art, the antigen may be provided by injecting a polypeptide against which antibodies are 
desired to be raised into a recipient, or contacting the recipient with a nucleic acid capable of 

15 expressing the antigen and under conditions for expression of the antigen, leading to an 
immune response. 

In a preferred embodiment the breast cancer proteins against which antibodies 
are raised are secreted proteins as described above. Without being bound by theoiy, 
antibodies used for treatment, bind and prevent the secreted protein firom binding to its 

20 receptor, thereby inactivating the secreted breast cancer protein. 

In another preferred embodiment, the breast cancer protein to which 
antibodies are raised is a transmembrane protein. Without being bound by theory, antibodies 
used for treatment, bind the extracellular domain of the breast cancer protein and prevent it 
fi-om binding to other proteins, such as circulating ligands or cell-associated molecules. The 

25 antibody may cause down-regulation of the transmembrane breast cancer protein. As will be 
appreciated by one of ordinary skill in the art, the antibody may be a competitive, non- 
competitive or uncompetitive inhibitor of protein binding to the extracellular domain of the 
breast cancer protein. The antibody is also an antagonist of the breast cancer protein. 
Further, the antibody prevents activation of the transmembrane breast cancer protein. In one 

30 aspect, when the antibody prevents the bindmg of other molecules to the breast cancer 
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protein, flie antibody prevents growth of the cell. The antibody may also be used to target or 
sensitize the cell to cytotoxic agents, including, but not limited to TNF-a, TNF-p, IL-1, INF-y 
and IL-2, or chemotherapeutic agents including 5FU, vinblastine, actinomycin D, cisplatin, 
methotrexate, and the like. In some instances the antibody belongs to a sub-type that 
5 activates serum complement when complexed with the transmembrane protein thereby 
mediating cytotoxicity or antigen-dependent cytotoxicity (ADCC). Thus, breast cancer is 
treated by administering to a patient antibodies directed against the transmembrane breast 
cancer protein. Antibody-labeling may activate a co-toxin, localize a toxin payload, or 
othenvise provide means to locally ablate cells. 

10 In another preferred mibodiment, the antibody is conjugated to an effector 

moiety. The effector moiety can be any number ofmolecules, including labelling moieties 
such as radioactive labels or fluorescent labels, or can be a therapeutic moiety. In one aspect 
the therapeutic moiety is a small molecule that modulates the activity of the breast cancer 
protein. In another aspect the therapeutic moiety modulates the activity ofmolecules 

1 5 associated with or in close proximity to the breast cancer protein. The therapeutic moiety 
may inhibit enzymatic activity such as protease or coUagenase or protein kinase activity 
associated with breast cancer. 

In a preferred embodiment, the therapeutic moiety can also be a cytotoxic 
agent. In this method, targeting the cytotoxic agent to breast cancer tissue or cells, results in 

20 a reduction in the number of afflicted cells, thereby reducing symptoms associated with 

breast cancer. Cytotoxic agents are numerous and varied and include, but are not limited to, 
cytotoxic drugs or toxins or active fragmaits of such toxins. Suitable toxins and their 
corresponding fi-agments include diphtheria A chain, exotoxin A chain, ricin A chain, abrin A 
chain, curcin, crotin, phenomycin, enomycin and the like. Cytotoxic agents also include 

25 radiochemicals made by conjugating radioisotopes to antibodies raised against breast cancer 
proteins, or binding of a radionuclide to a chelating agent that has been covalently attached to 
the antibody. Targeting the therapeutic moiety to transmembrane breast cancer proteins not 
only serves to increase the local concentration of therapeutic moiety in the breast cancer 
afflicted area, but also serves to reduce deleterious side effects that may be associated with 

30 the therapeutic moiety. 
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In another preferred embodiment, the breast cancer protein against which the 
antibodies are raised is an intracellular protein. In this case, the antibody may be conjugated 
to a protein which facilitates entry into the cell. In one case, the antibody enters the cell by 
endocytosis. In another embodiment, a nucleic acid encoding the antibody is administered to 
5 the individual or cell. Moreover, wherein the breast cancer protein can be targeted within a 
cell, i.e., the nucleus, an antibody thereto contains a signal for that target localization, i.e., a 
nuclear localization signal. 

The breast cancer antibodies of the invention specifically bind to breast cancer 
proteins. By "specifically bind" herein is meant that the antibodies bind to the protein with a 
10 Kd of at least about 0.1 mM, more usually at least about 1 pM, preferably at least about 0.1 
|jM or better, and most preferably, 0.01 ]xM or better. Selectivity of binding is also 
important. 

Detection of breast cancer sequence for diagnostic and therapeutic applications 

15 In one aspect, the RNA expression levels of genes are determined for different 

cellular states in the breast cancer phenotype. Expression levels of genes in normal tissue 
(i.e., not undergoing breast cancer) and in breast cancer tissue (and in some cases, for varying 
severities of breast cancer that relate to prognosis, as outlined below) are evaluated to provide 
expression profiles. An expression profile of a particular cell state or point of development is 

20 essentially a "fingerprinf ' of the state. While two states may have any particular gene 
similarly expressed, the evaluation of a number of genes simultaneously allows the 
generation of a gene expression profile that is reflective of the state of the cell. By comparing 
expression profiles of cells in different states, information regarding which genes are 
important (including both up- and down-regulation of genes) in each of these states is 

25 obtained. Then, diagnosis may be performed or confirmed to determine whether a tissue 
sample has the gene expression profile of normal or cancerous tissue. This will provide for 
molecular diagnosis of related conditions. 

"DifferOTtial expression," or grammatical equivalents as used herein, refers to 
qualitative or quantitative differences in the temporal and/or cellular gene expression 

30 patterns within and among cells and tissue. Thus, a differentially expressed gene can 
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qualitatively have its expression altered, including an activation or inactivation, in, e.g., 
normal versus breast cancer tissue. Genes may be turned on or turned off in a particular state, 
relative to another state thus permitting comparison of two or more states. A qualitatively 
regulated gene will exhibit an expression pattern within a state or cell type which is 
5 detectable by standard techniques. Some genes will be expressed in one state or cell type, but 
not in both. Alternatively, the difference in expression may be quantitative, e.g., in that 
expression is increased or decreased; i.e., gene expression is either upregulated, resulting in 
an increased amount of transcript, or downregulated, resulting in a decreased amount of 
transcript. The degree to which expression differs need only be large enough to quantify via 

10 standard characterization techniques as outlined below, such as by use of Affymetrix 
GeneChip™ expression arrays, Lockhart, Nature Biotechnology 14:1675-1680 (1996), 
hereby expressly incorporated by reference. Other techniques include, but are not limited to, 
quantitative reverse transcriptase PGR, northern analysis and RNase protection. As outlined 
above, preferably the change in expression (i.e., upregulation or downregulation) is at least 

1 5 about 50%, more preferably at least about 1 00%, more preferably at least about 1 50%, more 
preferably at least about 200%, with from 300 to at least 1000% being especially preferred. 

Evaluation may be at the gene transcript, or the protein level. The amount of 
gaie expression may be monitored using nucleic acid probes to the DNA or RNA equivalent 
of the gene transcript, and the quantification of gene expression levels, or, alternatively, the 

20 final gene product itself (protein) can be monitored, e.g. , with antibodies to the breast cancer 
protein and standard immunoassays (ELISAs, etc.) or other techniques, including mass 
spectroscopy assays, 2D gel electrophoresis assays, etc. Proteins corresponding to breast 
cancer genes, i.e., those identified as being important in a breast cancer phenotype, can be 
evaluated in a breast cancer diagnostic test. 

25 In a preferred embodiment, gene expression monitoring is performed 

simultaneously on a number of genes. Multiple protein expression monitoring can be 
perfonned as well. Similarly, these assays may be performed on an individual basis as well. 

In this embodiment, the breast cancer nucleic acid probes are attached to 
biochips as outlmed herein for the detection and quantification of breast cancer sequences in 
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a particular cell. The assays are fiirther described below in the example. PGR techniques can 
be used to provide greater sensitivity. 

In a preferred embodiment nucleic acids encoding the breast cancer protein are 
detected. Although DNA or RNA encoding the breast cancer protein may be detected, of 
5 particular interest are methods wherein an mRNA encoding a breast cancer protein is 
detected. Probes to detect mRNA can be a nucleotide/deoxynucleotide probe that is 
complementary to and hybridizes with the mRNA and includes, but is not limited to, 
oligonucleotides, cDNA or RNA. Probes also should contain a detectable label, as defined 
herein. In one method the mRNA is detected after immobilizing the nucleic acid to be 

1 0 examined on a solid support such as nylon membranes and hybridizing the probe with the 
sample. Following washing to remove the non-specifically bound probe, the label is 
detected. In another method detection ofthe mRNA is performed in situ. In this method 
permeabilized cells or tissue samples are contacted with a detectably labeled nucleic acid 
probe for sufficient time to allow the probe to hybridize with the target mRNA. Following 

1 5 washing to remove the non-specifically bound probe, the label is detected. For example a 
digoxygenin labeled riboprobe (RNA probe) that is complementary to the mRNA encoding a 
breast cancer protein is detected by binding the digoxygenin with an anti-digoxygenin 
secondary antibody and developed with nitro blue tetrazolium and 5-bromo-4-chloro-3- 
indoyl phosphate. 

20 In a preferred embodiment, various proteins &om the three classes of proteins 

as described herein (secreted, transmembrane or intracellular proteins) are used in diagnostic 
assays. The breast cancer proteins, antibodies, nucleic acids, modified proteins and cells 
containing breast cancer sequences are used in diagnostic assays. This can be performed on 
an individual gene or corresponding polypeptide level. In a preferred embodiment, the 

25 expression profiles are used, preferably in conjunction with high throughput screening 
techniques to allow monitoring for expression profile gaies and/or corresponding 
polypeptides. 

As described and defined herein, breast cancer proteins, including 
intracellular, transmembrane or secreted protems, find use as markers of breast cancer. 
30 Detection of these proteins in putative breast cancer tissue allows for detection or diagnosis 
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of breast cancer. In one embodiment, antibodies are used to detect breast cancer proteins. A 
preferred method separates proteins from a sample by electrophoresis on a gel (typically a 
denaturing and reducing protein gel, but may be another type of gel, including isoelectric 
focusing gels and the like). Following separation of proteins, the breast cancer protein is 
5 detected, e.g., by immunoblotting with antibodies raised against the breast cancer protein. 
Methods of immunoblotting are well known to those of ordinary skill in the art. 

In another preferred method, antibodies to the breast cancer protein find use in 
in situ imaging techniques, e.g., in histology (e.g., Methods in Cell Biology: Antibodies in 
Cell Biology, volume 37 (Asai, ed. 1993)). In this method cells are contacted with from one 

10 to many antibodies to the breast cancer protein(s). Following washing to remove non- 
specific antibody binding, the presence of the antibody or antibodies is detected In one 
embodiment the antibody is detected by incubating with a secondary antibody that contains a 
detectable label. In another method the primary antibody to the breast cancer pn)tein(s) 
contains a detectable label, e.g. an enzyme marker that can act on a substrate. In another 

1 5 preferred embodiment each one of multiple primary antibodies contains a distinct and 

detectable label. This method finds particular use in simultaneous screening for a plurality of 
breast cancer proteins. As will be appreciated by one of ordinary skill in the art, many other 
histological imaging techniques are also provided by the invention. 

In a preferred embodiment the label is detected in a fluorometer which has the 

20 ability to detect and distinguish emissions of different wavelengths. In addition, a 
fluorescence activated cell sorter (FACS) can be used in the method. 

In another preferred embodiment, antibodies find use in diagnosing breast 
cancer from blood, serum, plasma, stool, and other samples. Such samples, therefore, are 
usefiil as samples to be probed or tested for the presence of breast cancer proteins. 

25 Antibodies can be used to detect a breast cancer protein by previously described 
immunoassay techniques including ELISA, immimoblotting (westem blotting), 
immunoprecipitation, BIACORE technology and the like. Conversely, the presence of 
antibodies may indicate an immune response against an endogenous breast cancer protein. 

In a preferred embodiment, in situ hybridization of labeled breast cancer 

30 nucleic acid probes to tissue arrays is done. For example, arrays of tissue samples, including 
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breast cancer tissue and/or normal tissue, are made. In situ hybridization (see, e.g., Ausubel, 
supra) is then performed. When comparing the fingerprints between an individual and a 
standard, the skilled artisan can make a diagnosis, a prognosis, or a prediction based on the 
findmgs. It is further understood that the genes which indicate the diagnosis may differ from 
5 those which indicate the prognosis and molecular profiling of the condition of the cells may 
lead to distinctions between responsive or refractory conditions or may be predictive of 
outcomes. 

In a preferred embodiment, the breast cancer proteins, antibodies, nucleic 
acids, modified proteins and cells containing breast cancer sequences are used in prognosis 

10 assays. As above, gene expression profiles can be generated that correlate to breast cancer, 
in tenns of long term prognosis. Again, this may be done on either a protein or gene level, 
with the use of genes being preferred. As above, breast cancer probes may be attached to 
biochips for the detection and quantification of breast cancer sequences in a tissue or patient 
The assays proceed as outlined above for diagnosis. PGR method may provide more 

1 5 sensitive and accurate quantification. 

Assays for therapeutic compounds 

In a preferred embodiment members of the proteins, nucleic acids, and 
antibodies as described herein are used in drug screening assays. The breast cancer proteins, 
20 antibodies, nucleic acids, modified proteins and cells containing breast cancer sequences are 
used in drug screening assays or by evaluating the effect of drug candidates on a "gene 
expression profile" or expression profile of polypeptides. In a preferred embodiment, the 
expression profiles are used, preferably in conjunction with high throughput screening 
techniques to allow monitoring for expression profile genes after treatment with a candidate 
25 agent (e.g., Zlokamik, et al. Science 279:84-8 (1998); Held, Genome Res 6:986-94, 1996). 

In a preferred embodiment, the breast cancer proteins, antibodies, nucleic 
acids, modified proteins and cells containing the native or modified breast cancer proteins are 
used in screening assays. That is, the present invention provides novel methods for screening 
for compositions which modulate the breast cancer phenotype or an identified physiological 
30 function of a breast cancer protein. As above, this can be done on an individual gene level or 
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by evaluating the effect of drug candidates on a "gene expression profile". In a preferred 
embodiment, the expression profiles are used, preferably in conjunction with high throughput 
screening techniques to allow monitoring for expression profile genes after treatment with a 
candidate agent, see Zlokamik, supra. 
5 Having identified the differentially expressed genes herein, a variety of assays 

may be executed. In a preferred embodiment, assays may be run on an individual gene or 
protein level. That is, having identified a particular gene as up regulated m breast cancer, test 
compounds can be screened for the ability to modulate gene expression or for binding to the 
breast cancer protein. "Modulation" thus includes both an increase and a decrease in gene 

1 0 expression. The preferred amount of modulation will depend on the original change of the 
gene expression in normal versus tissue undergoing breast cancer, with changes of at least 
10%, preferably 50%, more preferably 100-300%, and in some embodunents 300-1000% or 
greater. Thus, if a gene exhibits a 4-fold increase in breast cancer tissue con^ared to normal 
tissue, a decrease of about four-fold is often desired; similarly, a 10-fold decrease in breast 

15 cancer tissue compared to normal tissue often provides a target value of a 10-fold increase in 
expression to be induced by the test compoimd. 

The amount of gene expression may be monitored using nucleic acid probes 
and the quantification of gene expression levels, or, alternatively, the gene product itself can 
be monitored, e.g., through the use of antibodies to the breast cancer protein and standard 

20 immunoassays. Proteomics and separation techniques may also allow quantification of 
expression. 

In a preferred embodiment, gene expression or protein monitoring of a number 
of entities, i.e., an expression profile, is monitored simultaneously. Such profiles will 
typically involve a plurality of those entities described herein.. 
25 In this embodiment, the breast cancer nucleic acid probes are attached to 

biochips as outlined herein for the detection and quantification of breast cancer sequences in 
a particular cell. Alternatively, PGR may be used. Thus, a series, e.g., of microtiter plate, 
may be used with dispensed primers in desired wells. A PGR reaction can then be performed 
and analyzed for each well. 
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Expression monitoring can be perfoimed to identify compounds that modify 
the expression of one or more breast cancer-associated sequences, e.g., a polynucleotide 
sequence set out inTable 17. Generally, in a preferred embodiment, a test modulator is added 
to the cells prior to analysis. Moreover, screens are also provided to identify agents that 
5 modulate breast cancer, modulate breast cancer proteins, bind to a breast cancer protein, or 
interfere with the binding of a breast cancer protein and an antibody or other binding partner. 

The term "test compound" or "drug candidate" or "modulator" or grammatical 
equivalents as used herein describes any molecule, e.g., protein, oligopeptide, small organic 
molecule, polysaccharide, polynucleotide, etc., to be tested for the capacity to directly or 

1 0 indirectly alter the breast cancer phenotype or the expression of a breast cancer sequence, 
e.g., a nucleic acid or protein sequence. In preferred embodiments, modulators alter 
expression profiles, or expression profile nucleic acids or proteins provided herein. In one 
embodiment, the modulator suppresses a breast cancer phenotype, e.g. to a normal tissue 
fingerprint. In another embodiment, a modulator induced a breast cancer phenotype. 

1 5 Generally, a plurality of assay mixtures are run in parallel with different agent concentrations 
to obtain a differential response to the various concentrations. Typically, one of these 
concentrations serves as a negative control, i.e., at zero concentration or below the level of 
detection. 

Drug candidates encompass numerous chemical classes, though typically they 
20 are organic molecules, preferably small organic compounds having a molecular weight of 
more than 100 and less than about 2,500 daltons. Preferred small molecules are less than 
2000, or less than 1500 or less than 1000 or less than 500 D. Candidate agents comprise 
functional groups necessary for structural interaction with proteins, particularly hydrogen 
bonding, and typically include at least an amine, carbonyl, hydroxyl or carboxyl group, 
25 preferably at least two of the functional chemical groups. The candidate agents often 
comprise cyclical carbon or heterocyclic structures and/or aromatic or polyaromatic 
structures substituted with one or more of the above functional groups. Candidate agents are 
also found among biomolecules including peptides, saccharides, fatty acids, steroids, purines, 
pyrimidines, derivatives, structural analogs or combinations thereof. Particularly preferred 
30 are peptides. 
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In one aspect, a modulator will neutralize the effect of a breast cancer protein. 
By "neutralize" is meant that activity of a protein is inhibited or blocked and the consequent 
effect on the cell. 

In calaiii embodiments, combinatorial libraries of potential modulators will be 
5 screened for an abiUty to bind to a breast cancer polypeptide or to modulate activity. 

Conventionally, new chemical entities with useful properties are generated by identifying a 
chemical compound (called a "lead compoimd") with some desirable property or activity, 
e.g., inhibiting activity, creating variants of the lead compound, and evaluating the property 
and activity of those variant compounds. Often, high throughput screening (HTS) methods 

10 are ^ployed for such an analysis. 

In one preferred embodiment, high throughput screening methods involve 
providing a library containing a large number of potential therapeutic compounds (candidate 
compounds). Such "combinatorial chemical libraries" are then screened in one or more 
assays to identify those library members (particular chemical species or subclasses) that 

15 display a desired characteristic activity. The compounds thus identified can serve as 

conventional "lead compounds" or can themselves be used as potmtial or actual therapeutics. 

A combmatorial chemical library is a collection of diverse chemical 
compounds generated by either chemical synthesis or biological synthesis by combining a 
number of chemical "building blocks" such as reagents. For example, a linear combinatorial 

20 chemical library, such as a polypeptide (e.g., mutein) library, is formed by combining a set of 
chemical building blocks called amino acids in every possible way for a given compoimd 
length (i.e., the nimiber of amino acids in a polypeptide compound). Millions of chemical 
compounds can be synthesized through such combinatorial mixing of chemical building 
blocks (Gallop et al, J. Med. Chenu 37(9):1233-1251 (1994)). 

25 Preparation and screening of combinatorial chemical libraries is well known to 

those of skill in the art. Such combinatorial chemical libraries include, but are not limited to, 
peptide libraries (see, e.g., U.S. Patent No. 5,010,175, Furka, PepL Prot Res, 37:487-493 
(1991), Houghton et al. Nature, 354:84-88 (1991)), peptoids (PCT Pubhcation No WO 
91/19735), encoded peptides (PCT Publication WO 93/20242), random bio-oligomers (PCT 

30 Publication WO 92/00091), benzodiazepines (U.S. Pat. No. 5,288,514), diversomers such as 
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hydantoins, benzodiazepines and dipeptides (Hobbs etal, Proc, Nat Acad. Set USA 
90:6909-6913 (1993)), vinylogous polypeptides (Hagihara et aL, J. Amer, Chem. Soc. 
1 14:6568 (1992)), nonpeptidal peptidomimetics with a Beta-D-Glucose scaffolding 
(Hirschmann et aL, J. Amer, Chem, Soc, 1 14:9217-9218 (1992)), analogous organic syntheses 
5 of small compound libraries (Chen et aL, / Amer. Chem. Soc, 1 16:2661 (1994)), 

oligocarbamates (Cho, et aL, Science 261:1303 (1993)), and/or peptidyl phosphonates 
(Campbell etaL, J, Org. Chem. 59:658 (1994)). See, generally, Gordon etal., J, Med. Chem, 
37:1385 (1994), nucleic acid libraries (see, e.g., Strategene, Corp,), peptide nucleic acid 
libraries {see, e.g., U.S. Patent 5,539,083), antibody libraries {see, e.g., Vaughn et aL, Nature 

10 Biotechnology 14(3):309-314 (1996), and PCT/US96/10287), carbohydrate libraries {see, 
e.g., Liang etaL, Science 274:1520-1522 (1996), and U.S. Patent No. 5,593,853), and small 
organic molecule libraries {see, e.g., benzodiazepines, Baum, C&EN, Jan 18, page 33 (1993); 
isoprenoids, U.S. Patent No. 5,569,588; thiazolidinones and metathiazanones, U.S. Patent No. . 
5,549,974; pyrrolidines, U.S. Patent Nos. 5,525,735 and 5,519,134; morpholmo compounds, 

15 U.S. Patent No. 5,506,337; benzodiazepines, U.S. PktentNo. 5,288,514; and the like). 

Devices for the preparation of combinatorial libraries are commercially 
available {see, e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY, Symphony, 
Rainin, Wobum, MA, 433A Applied Biosystems, Foster City, CA, 9050 Plus, Millipore, 
Bedford, MA). 

20 A number of well known robotic systems have also been developed for 

solution phase chemistries. These systems include automated workstations like the 
automated synthesis apparatus developed by Takeda Chemical Industries, LTD. (Osaka, 
Japan) and many robotic systems utilizing robotic arms (Zymate II, Zymark Corporation, 
Hopkinton, Mass.; Orca, Hewlett-Packard, Palo Alto, Calif), which mimic the manual 

25 synthetic operations performed by a chemist. Any of the above devices are suitable for use 
with the present invention. The nature and implementation of modifications to these devices 
(if any) so that they can operate as discussed herein will be apparent to persons skilled in the 
relevant art. In addition, numerous combinatorial libraries are themselves commercially 
available {see, e.g., ComGenex, Princeton, N.J., Asinex, Moscow, Ru, Tripos, Inc., St. Louis, 
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MO, ChemStar, Ltd, Moscow, RU, 3D Pharmaceuticals, Exton, PA, Martek Biosciences, 
Columbia, MD, etc). 

The assays to identify modulators are amenable to high throughput screening. 
Preferred assays thus detect enhancement or inhibition of breast cancer gene transcription, 
5 inhibition or enhancement of polypeptide expression, and inhibition or enhancement of 
polypeptide activity. 

High throughput assays for the presence, absence, quantification, or other 
properties of particular nucleic acids or protein products are .well known to those of skill in 
the art. Similarly, binding assays and reporter gene assays are similarly well known. Thus, 

10 e,g,, U.S. Patent No. 5,559,410 discloses high throughput screening methods for proteins, 
U.S. Patent No. 5,585,639 discloses high throughput screming methods for nucleic acid 
binding (i.e., in arrays), while U.S. Patent Nos. 5,576,220 and 5,541,061 disclose high 
throughput methods of screening for ligand/antibody binding. 

In addition, high throughput screening systems are commercially available 

15 (see, e.g., Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman 
Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, eta). These systems 
typically automate entire procedures, including all sample and reagent pipetting, liquid 
dispensing, timed incubations, and final readings of the microplate in detector(s) appropriate 
for the assay. These configurable systems provide high throughput and rapid start up as well 

20 as a high degree of flexibility and customization. The manufacturers of such systems provide 
detailed protocols for various high throughput systems. Thus, e.g., Zymark Corp. provides 
technical bulletins describing screening systems for detecting the modulation of gene 
transcription, ligand binding, and the Uke. 

In one embodiment, modulators are proteins, often naturally occurring 

25 proteins or firagments of naturally occurring proteins. Thus, e,g.y cellular extracts containing 
proteins, or random or directed digests of proteinaceous cellular extracts, may be used. In 
this way libraries of proteins may be made for screening in the methods of the invention. 
Particularly preferred in this embodiment are Kbraries of bacterial, fungal, viral, and 
mammalian proteins, with the latter being preferred, and human proteins being especially 
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preferred. Particularly useful test compound will be directed to the class of proteins to which 
the target belongs, e.g., substrates for enzymes or ligands and receptors. 

In a preferred embodiment, modulators are peptides of jfrom about 5 to about 
30 amino acids, with j&om about 5 to about 20 amino acids being preferred, and from about 7 
5 to about 15 being particularly preferred. The peptides may be digests of naturally occurring 
proteins as is outlined above, random peptides, or "biased" random peptides. By 
"randomized" or grammatical equivalents herein is meant that each nucleic acid and peptide 
consists of essentially random nucleotides and amino acids, respectively. Since generally 
these random peptides (or nucleic acids, discussed below) are chemically synthesized, they 

10 may incorporate any nucleotide or amino acid at any position. The synthetic process can be 
designed to generate randomized proteins or nucleic acids, to allow the formation of all or 
most of the possible combinations over the length of the sequence, thus forming a library of 
randomized candidate bioactive proteinaceous agents. 

In one embodiment, the library is fully randomized, with no sequence 

1 5 preferences or constants at any position. In a preferred embodiment, the library is biased. 
That is, some positions within the sequence are either held constant, or are selected from a 
limited number of possibilities. For example, in a preferred embodiment, the nucleotides or 
amino acid residues are randomized within a defined class, e.g., of hydrophobic amino acids, 
hydrophilic residues, sterically biased (either small or large) residues, towards the creation of 

20 nucleic acid binding domains, the creation of cysteines, for cross-linking, prolines for SH-3 
domains, serines, threonines, tyrosines or histidines for phosphorylation sites, etc., or to 
purines, etc. 

Modulators of breast cancer can also be nucleic acids, as defined above. 
As described above generally for proteins, nucleic acid modulating agents may 
25 be naturally occurring nucleic acids, random nucleic acids, or "biased" random nucleic acids. 
For example, digests of procaryotic or eucaryotic genomes may be used as is outlined above 
for proteins. 

In a preferred embodiment, the candidate compounds are organic chemical 
moieties, a wide variety of which are available in the literatiu"e. 
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After the candidate agent has been added and the cells allowed to incubate for 
some period of time, the sample containing a target sequence to be analyzed is added to the 
biochip. If required, the target sequence is prepared using known techniques. For example, 
the sample may be treated to lyse the cells, using known lysis buffers, electroporation, etc., 
5 with purification and/or ampHfication such as PCR performed as appropriate. For example, 
an in vitro transcription with labels covalently attached to the nucleotides is performed. 
Generally, the nucleic acids are labeled with biotin-FITC or PE, or with cy3 or cy5. 

In a preferred embodiment, the target sequence is labeled with, e.g., a 
fluorescent, a chemiluminegcent, a chemical, or a radioactive signal, to provide a means of 

1 0 detecting the target sequence's specific binding to a probe. The label also can be an enzyme, 
such as, alkaline phosphatase or horseradish peroxidase, which when provided wifli an 
appropriate substrate produces a product that can be detected. Alternatively, the label can be 
a labeled compound or small molecule, such as an enzyme inhibitor, that binds but is not 
catalyzed or altered by the enzyme. The label also can be a moiety or compound, such as, an 

1 5 epitope tag or biotin which specifically binds to streptavidin. For the example of biotin, the 
streptavidin is labeled as described above, thereby, providing a detectable signal for the 
bound target sequence. Unboimd labeled streptavidin is typically removed prior to analysis. 

As will be appreciated by those in the art, these assays can be direct 
hybridization assays or can comprise "sandwich assays", which include the use of multiple 

20 probes, as is generally outUned in U.S. Patent Nos. 5,681,702, 5,597,909, 5,545,730, 

5.594.117, 5,591,584, 5,571,670, 5,580,731, 5,571,670, 5,591,584, 5,624,802, 5,635,352, 

5.594.118, 5,359,100, 5,124,246 and 5,681,697, all of which are hereby incorporated by 
reference. In this embodiment, in general, the target nucleic acid is prepared as outlined 
above, and then added to the biochip comprising a plurality of nucleic acid probes, imder 

25 conditions that allow the formation of a hybridization complex. 

A variety of hybridization conditions may be used in the present invention, 
including high, moderate and low stringency conditions as outlined above. The assays are 
generally run under stringency conditions which allows formation of the label probe 
hybridization complex only in the presence of target. Stringency can be controlled by 
30 altering a step parameter that is a thermodynamic variable, including, but not hmited to, 

69 



wo 02/059377 PCT/US02/02242 



temperature, formamide concentration, salt concentration, chaotropic salt concentration pH, 
organic solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is 
generally outlined in U.S. Patent No. 5,68 1,697. Thus it may be desirable to perform certain 
5 steps at higher stringency conditions to reduce non-specific binding. 

The reactions outlined herein may be accomplished in a variety of ways. 
Components of the reaction may be added simultaneously, or sequentially, in different orders, 
with preferred embodiments outlined below. In addition, the reaction may include a variety 
of other reagents. These include salts, buffers, neutral proteins, e.g, albumin, detergents, etc, 
10 which may be used to facilitate optimal hybridization and detection, and/or reduce non- 
specific or background interactions. Reagents that otherwise improve the efficiency of the 
assay, such as protease inhibitors, nuclease inhibitors, anti-microbial agents, etc, may also be 
used as appropriate, depending on the sample preparation methods and purity of the target. 

The assay data are analyzed to determine the expression levels, and changes in 
15 expression levels as between states, of individual genes, forming a gene expression profile. 

Screens are performed to identify modulators of the breast cancer phenotype. 
In one embodiment, screening is performed to identify modulators that can induce or 
suppress a particular expression profile, thus preferably generating the associated phenotype. 
In another embodiment, e.g,, for diagnostic applications, having identified dijfferentially 
20 expressed genes important in a particular state, screens can be performed to identify 

modulators that alter expression of individual genes. In an another embodiment, screening is 
performed to identify modulators that alter a biological fimction of the expression product of 
a difierentially expressed gene. Again, having identified the importance of a gene in a 
particular state, screens are performed to identify agents that bind and/or modulate the 
25 biological activity of the gene product. 

In addition screens can be done for genes that are induced in response to a 
candidate agent. After identifying a modulator based upon its ability to suppress a breast 
cancer expression pattern leading to a normal expression pattern, or to modulate a single 
breast cancer gene expression profile so as to mimic the expression of the gene firom normal 
30 tissue, a screen as described above can be performed to identify genes that are specifically 
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modulated in response to the agent. Comparing expression profiles between normal tissue 
and agent treated breast cancer tissue reveals genes that are not expressed in normal tissue or 
breast cancer tissue, but are expressed in agent treated tissue. These agent-specific sequences 
can be identified and used by methods described herein for breast cancer genes or proteins. 
5 In particular these sequences and the proteins they encode find use in marking or identifying 
agent treated cells. In addition, antibodies can be raised against the agent induced proteins 
and used to target novel therapeutics to the treated breast cancer tissue sample. 

Thus, in one embodiment, a test compound is administered to a population of 
breast cancer cells, that have an associated breast cancer expression profile. By 

10 "administration*' or "contacting'' herein is meant that the candidate agent is added to the cells 
in such a manner as to allow the agent to act upon the cell, whether by uptake and 
intracellular action, or by action at the cell surface. In some embodiments, nucleic acid 
raicoding a proteinaceous candidate agent (i.e., a peptide) may be put into a viral construct 
such as an adenoviral or retroviral construct, and added to the cell, such that expression of 

15 the peptide agent is accomplished, e.g., PCT US97/01019. Regulatable gene therapy systems 
can also be used. 

Once the test compound has been administered to the cells, the cells can be 
washed if desired and are allowed to incubate under preferably physiological conditions for 
some period of time. The cells are then harvested and a new gene expression profile is 
20 generated, as outlined herein. 

Thus, e.g., breast cancer tissue may be screened for agents that modulate, e.g., 
induce or suppress the breast cancer phenotype. A change in at least one gene, preferably 
many, of the expression profile indicates that the agent has an effect on breast cancer activity. 
By defining such a signature for the breast cancer phenotype, screens for new drugs that alter 
25 the phenotype can be devised. With this approach, the drug target need not be kaown and 
need not be represented in the original expression screening platform, nor does the level of 
transcript for the target protein need to change. 

In a preferred embodiment, as outlined above, screens may be done on 
individual genes and gene products (proteins). That is, having identified a particular 
30 differentially expressed gene as important in a particular state, screening of modulators of 
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either the expression of the gene or the gene product itself can be done. The gene products of 
differentially expressed genes are sometimes referred to herein as "breast cancer proteins" or 
a "breast cancer modulatory protein". The breast cancer modulatory protein may be a 
fragment, or alternatively, be the full length protein to the fragment encoded by the nucleic 
5 acids of the Tables. Preferably, the breast cancer modulatory protein is a fragment. In a 
preferred embodiment, the breast cancer amino acid sequence which is used to determine 
sequence identity or similarity is encoded by a nucleic acid of Table 25. In another 
embodiment, the sequences are naturally occurring allelic variants of a protein encoded by a 
nucleic acid of Table 25. In another embodiment, the sequences are sequence variants as 

1 0 further desaibed herein. 

Preferably, the breast cancer modulatory protein is a fragment of 
^proximately 14 to 24 amino acids long. More preferably the fragment is a soluble 
fragment. Preferably, the fragment includes a non-transmembrane region. In a preferred 
embodiment, the fragment has an N-terminal Cys to aid in solubility. In one embodiment, the 

15 C-terminus of the fragment is kept as a free acid and the N-terminus is a free amine to aid in 
couplmg, i.e., to cysteine. 

In one embodiment the breast cancer proteins are conjugated to an 
immunogenic agent as discussed herein. In one embodiment the breast cancer protein is 
conjugated to BSA. 

20 Measurements of breast cancer polypeptide activity, or of breast cancer or the 

breast cancer phenotype can be performed using a variety of assays. For example, the effects 
of the test compounds upon the function of the breast cancer polypeptides can be measiu'ed 
by examining parameters described above. A suitable physiological change that affects 
activity can be used to assess the influence of a test compound on the polypeptides of this 

25 invention. When the functional consequences are determined using intact cells or animals, 
one can also measure a variety of effects such as, in the case of breast cancer associated with 
tumors, tumor growth, tumor metastasis, neovascularization, hormone release, transcriptional 
changes to both known and uncharacterized genetic markers (e.g., northern blots), changes in 
cell metabolism such as cell growth or pH changes, and changes in intracellular second 
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messengers such as cGMP. In the assays of the invention, mammalian breast cancer 
polypeptide is typically used, e.g., mouse, preferably human. 

Assays to identify compounds with modulating activity can be performed in 
vitro. For example, a breast cancer polypeptide is first contacted with a potential modulator 
5 and incubated for a suitable amount of time, e.g., from 0.5 to 48 hours. In one embodiment, 
the breast cancer polypeptide levels are determined in vitro by measuring the level of protein 
or mRNA. The level of protem is measured using immunoassays such as western blotting, 
ELIS A and the like with an antibody that selectively binds to the breast cancer polypeptide or 
a fragment thereof. For measurement of mRNA, amplification, e.g., using PGR, LCR, or 

10 hybridization assays, e.g., northern hybridization, RNAse protection, dot blotting, are 
preferred. The level of protein or mRNA is detected using directly or mdirectly labeled 
detection agents, e.g., fluorescently or radioactively labeled nucleic acids, radioactively or 
enzymatically labeled antibodies, and the like, as described herein. 

Alternatively, a reporter gene system can be devised using the breast cancer 

15 protein promoter operably linked to a reporter gene such as luciferase, green fluorescent 
protein, CAT, or p-gal. The reporter construct is typically transfected into a cell. After 
treatment with a potential modulator, the amount of reporter gene transcription, translation, or 
activity is measured according to standard techniques known to those of skill in the art. 

In a preferred embodiment, as outlined above, screens may be done on 

20 individual genes and gene products (proteins). That is, having identified a particular 

differentially expressed gene as important in a particular state, screening of modulators of the 
expression of the gene or the gene product itself can be done. The gene products of 
differentially expressed genes are sometimes referred to herein as "breast cancer proteins." 
The breast cancer protein may be a fragment, or alternatively, be the full length protein to a 

25 fragment shown herein. 

In one embodiment, screening for modulators of expression of specific genes 
is performed. Typically, the expression ofonly one or a few genes are evaluated. In another 
embodiment, screens are designed to first find compounds that bind to differentially 
expressed proteins. These compounds are then evaluated for the ability to modulate 
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differentially expressed activity. Moreover, once initial candidate compounds are identified, 
variants can be further screened to better evaluate structure activity relationships. 

In a preferred embodiment, binding assays are done. In general, purified or 
isolated gene product is used; tliat is, the gene products of one or more differentially 
5 expressed nucleic acids are made. For example, antibodies are generated to the protein gene 
products, and standard immimoassays are run to determine the amount of protein present. 
Alternatively, cells comprising the breast cancer proteins can be used in the assays. 

Thus, in a preferred embodiment, the methods comprise combining a breast 
cancer protein and a candidate compound, and determining the binding of the compoimd to 

10 the breast cancer protein. Preferred embodiments utilize the human breast cancer protein, 
although other mammalian proteins may also be used, e.g. for the development of animal 
models of himian disease. In some embodiments, as outlined herein, variant or derivative 
breast cancer proteins may be used. 

Generally, in a preferred embodiment of the methods herein, the breast cancer 

1 5 protein or the candidate agent is non-diffusably bound to an insoluble support having isolated 
sample receiving areas (e.g. a microtiter plate, an array, etc.). The insoluble supports may be 
made of any composition to which the compositions can be bound, is readily separated fiom 
soluble material, and is otherwise compatible with the overall method of screening. The 
surface of such supports may be solid or porous and of any convenient shape. Examples of 

20 suitable insoluble supports include microtiter plates, arrays, membranes and beads. These are 
typically made of glass, plastic (e.g., polystyrene), polysaccharides, nylon or nitrocellulose, 
teflon™, etc. Microtiter plates and arrays are especially convenient because a large number 
of assays can be carried out simultaneously, using small amounts of reagents and samples. 
The particular manner of binding of the composition is not crucial so long as it is compatible 

25 with the reagents and overall methods of the invention, maintains the activity of the 

composition and is nondifiiisable. Preferred methods of binding include the use of antibodies 
(which do not sterically block either the ligand binding site or activation sequence when the 
protein is bound to the support), direct binding to "sticky" or ionic supports, chemical 
crosslinking, the synthesis of the protein or agent on the surface, etc. Following binding of 

30 the protem or agent, excess unbound material is removed by washing. The sample receiving 
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areas may then be blocked through incubation with bovine serum albumin (BSA), casein or 
other innocuous protein or other moiety. 

In a preferred embodiment, the breast cancer protein is bound to the support, 
and a test compound is added to the assay. Alternatively, the candidate agent is bound to the 
5 support and the breast cancer protein is added. Novel binding agents include specific 
antibodies, non-natural binding agents identified in screens of chemical libraries, peptide 
analogs, etc. Of particular interest are screening assays for agents that have a low toxicity for 
human cells. A wide variety of assays may be used for this purpose, including labeled in 
vitro protein-protein binding assays, electrophoretic mobility shift assays, immunoassays for 

1 0 protein binding, functional assays (phosphorylation assays, etc.) and the like. 

The determination of the binding of the test modulating compound to the 
breast cancer protein may be done in a number of ways. In a preferred embodiment, the 
compoiind is labeled, and binding determined directly, e.g., by attaching all or a portion of 
the breast cancer protein to a soUd support, adding a labeled candidate agent (e.g., a 

1 5 fluorescent label), washing off excess reagent, and determining whether the label is present 
on the solid support. Various blocking and washing steps may be utilized as appropriate. 

In some embodiments, only one of the components is labeled, e.g., the 
proteins (or proteinaceous candidate compounds) can be labeled. Alternatively, more than 
one component can be labeled with diflFerent labels, e.g., ^^^I for the proteins and a fluorophor 

20 for the compound. Proximity reagents, e.g., quenching or energy transfer reagents are also 
useful. 

In one embodiment, the binding of the test compound is determined by 
competitive binding assay. The competitor is a binding moiety known to bind to the target 
molecule (i.e., a breast cancer protein), such as an antibody, peptide, binding partner, ligand, 

25 etc. Under certain circumstances, there may be competitive binding between the compoimd 
and the binding moiety, with the binding moiety displacing the compound. In one 
embodiment, the test compound is labeled. Either the compound, or the competitor, or both, 
is added first to the protein for a time sufficient to allow binding, if present. Incubations may 
be performed at a temperature which facilitates optimal activity, typically between 4 and 

30 40°C. Incubation periods are typically optimized, e.g., to facilitate rapid high throughput 
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screening. Typically between 0,1 and 1 hour will be suiSficient. Excess reagent is generally 
removed or washed away. The second component is then added, and the presence or absence 
of the labeled component is followed, to indicate binding. 

In a preferred embodiment, the competitor is added first, followed by the test 
5 compound. Displacement of the competitor is an indication that the test compound is binding 
to the breast cancer protein and thus is capable of binding to, and potentially modulating, the 
activity of the breast cancer protein. In this embodiment, either component can be labeled. 
Thus, e.g., if the competitor is labeled, the presence of label in the wash solution indicates 
displacement by the agent. Alternatively, if the test compound is labeled, the presence of the 

10 label on the support indicates displacement. 

In an alternative embodiment, the test compound is added first, with 
incubation and washing, followed by the competitor. The absence of binding by the 
competitor may indicate that the test compound is bound to the breast cancer protein with a 
higher affinity . Thus, if the test compound is labeled, the presence of the label on the 

1 5 support, coupled with a lack of competitor binding, may indicate that the test compound is 
capable of binding to the breast cancer protein. 

In a preferred embodiment, the methods comprise differential screening to 
identity agents that are capable of modulating the activity of the breast cancer proteins. In 
this embodiment, the methods comprise combining a breast cancer protein and a competitor 

20 in a first sample. A second sample comprises a test compound, a breast cancer protein, and a 
competitor. The binding of the competitor is determined for both samples, and a change, or 
difference in binding between the two samples indicates the presence of an agent capable of 
binding to the breast cancer protein and potentially modulating its activity. That is, if the 
binding of the competitor is different in tlie second sample relative to the first sample, the 

25 agent is capable of binding to the breast cancer protein. 

Alternatively, differential screening is used to identify drug candidates that 
bind to the native breast cancer protein, but cannot bind to modified breast cancer proteins. 
The structure of the breast cancer protein may be modeled, and used in rational drug design 
to synthesize agents that interact with that site. Drug candidates that affect the activity of a 
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breast cancer protein are also identified by screening drugs for the ability to either enhance or 
reduce the activity of the protein. 

Positive controls and negative controls may be used in the assays. Preferably 
control and test samples are performed in at least triplicate to obtain statistically significant 
5 results. Incubation of all samples is for a time sufficient for the binding of the agent to the 
protein. Following incubation, samples are washed jfree of non-specifically bound material 
and the amount of bound, generally labeled agent determined. For example, where a 
radiolabel is employed, the samples may be counted in a scintillation counter to determine the 
amount of bound compound. 

10 A variety of other reagents may be included in the screening assays. These 

include reagents like salts, neutral proteins, e.g. albumin, detergents, etc. which may be used 
to facilitate optimal protein-protein binding and/or reduce non-specific or background 
interactions. Also reagents that otherwise improve the efficiency of the assay, such as 
protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may be used. The mixture 

15 of components may be added in an order that provides for the requisite binding. 

In a preferred embodmient, the invention provides methods for screening for a 
compound capable of modulating the activity of a breast cancer protein. The methods 
comprise adding a test compound, as defined above, to a cell comprising breast cancer 
proteins. Preferred cell types include ahnost any cell. The cells contain a recombinant 

20 nucleic acid that encodes a breast cancer protein. In a preferred embodiment, a library of 
candidate agents are tested on a plurality of cells. 

In one aspect, the assays are evaluated in the presence or absence or previous 
or subsequent exposiure of physiological signals, e.g. hormones, antibodies, peptides, 
antigens, cytokines, growth factors, action potentials, pharmacological agents including 

25 chemotherapeutics, radiation, carcinogenics, or other cells (i.e. cell-cell contacts). In another 
example, the determinations are determined at different stages of the cell cycle process. 

In this way, compounds that modulate breast cancer agents are identified. 
Compounds with pharmacological activity are able to enhance or interfere with the activity of 
the breast cancer protein. Once identified, similar structures are evaluated to identify critical 

30 structural feature of the compound. 
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In one embodiment, a method of inhibiting breast cancer cell division is 
provided. The method comprises administration of a breast cancer inhibitor. In another 
embodiment, a method of inhibiting breast cancer is provided. The method comprises 
administration of a breast cancer inhibitor. In a further embodiment, methods of treating cells 
5 or individuals with breast cancer are provided. The method comprises administration of a 
breast cancer inhibitor. 

In one embodiment, a breast cancer inhibitor is an antibody as discussed 
above. In another embodiment, the breast cancer inhibitor is an antisense molecule. 

A variety of cell growth, proliferation, and metastasis assays are known to 
10 those of skill in the art, as described below. 

Soft agar growth or colony formation in suspension 

Normal cells require a solid substrate to attach and grow. When the cells are 
transformed, they lose this phenotype and grow detached from the substrate. For example, 
transformed cells can grow in stirred suspension culture or suspended in semi-solid media, 
15 such as semi-solid or soft agar. The transformed cells, when transfected with tumor 

suppressor genes, regenerate normal phenotype and require a solid substrate to attach and 
grow. Soft agar growth or colony formation in suspension assays can be used to identify 
modulators of breast cancer sequences, which when expressed in host cells, inhibit abnormal 
cellular proliferation and transformation. A therapeutic compound would reduce or eliminate 
20 the host cells' ability to grow in stirred suspension culture or suspended in semi-solid media, 
such as semi-solid or soft. 

Techniques for soft agar growth or colony formation in suspension assays are 
described in Freshney, Culture of Animal Cells a Manual of Basic Technique (3^^ ed., 1994), 
herein incorporated by reference. See alsOy the methods section of Garkavtsev et al (1996), 
25 5i(pm, herein incorporated by reference. 

Contact inhibition and density limitation of growth 

Normal cells typically grow in a flat and organized pattern in a petri dish xmtil 
they touch other cells. When the cells touch one another, they are contact inhibited and stop 
growing. When cells are transformed, however, the cells are not contact inhibited and 
30 continue to grow to high densities in disorganized foci. Thus, the transformed cells grow to a 
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higher saturation density than normal cells. This can be detected morphologically by the 
formation of a disoriented monolayer of cells or rounded cells in foci within the regular 
pattern of normal surrounding cells. Alternatively, labeling index with ("^BO-thymidine at 
saturation density can be used to measure density limitation of growth. See Freshney (1994), 
5 supra. The transformed cells, when transfected with tumor suppressor genes, regenerate a 
normal phenotype and become contact inhibited and would grow to a lower density. 

In this assay, labeling index with (•'H)-thymidine at saturation density is a 
preferred method of measuring density Hmitation of growth. Transformed host cells are 
transfected with abreast cancer-associated sequence and are grown for 24 hours at saturation 
1 0 density in non-limiting medium conditions. The percentage of cells labeling with ("^H)- 
thymidine is determmed autoradiographically. See, Freshney (1994), supra. 

Growth factor or serum dependence 

Transformed cells have a lower serum dependence than their normal 
15 counterparts {see, e.g., Temin, X Natl Cancer ImtL 37:167-175 (1966); Eagle et al, J, Exp. 
Med, 131 :836-879 (1970)); Freshney, supra. This is in part due to release of various growth 
factors by the transformed cells. Growth factor or serum dependence of transformed host 
cells can be compared with that of control. 

20 Tumor specific markers levels 

Tumor cells release an increased amount of certain factors (hereinafter *'tumor 
specific markers") than their normal counterparts. For example, plasminogen activator (PA) 
is released from human glioma at a higher level than from normal brain cells (see, e,g,, 
Gullino, Angiogenesis, tumor vascularization, and potential interference with twnor growth. 

25 in Biological Responses in Cancer, pp. 178-184 (Mihich (ed.) 1985)). Similarly, Tumor 
angiogenesis factor (TAF) is released at a higher level in tumor cells than their normal 
coimterparts. See, e.g., FoUkmsaiy Angiogenesis and Cancer, Sem Cancer Biol. (1992)). 

Various techniques which measm-e the release of these factors are described in 
Freshney (1994), supra. Also, see, Unkless et al. , J. Biol. Chem. 249:4295-4305 (1974); 

30 Strickland & Beers, J. Ao/. CAem. 251:5694-5702(1976); Whure/ a/., 5r. J. Cancer 42:305- 
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312 (1980); GuUino, Angiogenesis, tumor vascularization, and potential interference with 
tumor growth, m Biological Responses in Cancer, pp. 178-184 (Mihich (ed.) 1985); 
¥iQ±nQy Anticancer Res, 5:111-130 (1985). 

5 Invasiveness into Matrigel 

The degree of invasiveness into Matrigel or some other extracellular matrix 
constituent can be used as an assay to identify compounds that modulate breast cancer- 
associated sequences. Tumor cells exhibit a good correlation between malignancy and 
invasiveness of cells into Matrigel or some other extracellular matrix constituent. In this 
10 assay, tumorigenic cells are typically used as host cells. Expression of a tumor suppressor 
gene in these host cells would decrease invasiveness of the host cells. 

Techniques described in Freshney (1994), supra, can be used. Briefly, the 
level of invasion of host cells can be measured by using filters coated with Matrigel or some 
other extracellular matrix constituent. Penetration into the gel, or through to the distal side of 
15 the filter, is rated as invasiveness, and rated histologically by number of cells and distance 

moved, or by prelabeling the cells with ^^I and counting the radioactivity on the distal side of 
the filter or bottom of the dish. See, e.g., Freshney (1984), supra. 

Tumor growth in vivo 

20 Effects of breast cancer-associated sequences on cell growth can be tested in 

transgenic or immune-suppressed mice. Knock-out transgenic mice can be made, in which 
the breast cancer gene is disrupted or in which a breast cancer gene is inserted. Knock-out 
transgenic mice can be made by insertion of a marker gene or other heterologous gene into 
the endogenous breast cancer gene site in the mouse genome via homologous recombination. 

25 Such mice can also be made by substituting the endogenous breast cancer gene with a 

mutated version of the breast cancer gene, or by mutating the endogenous breast cancer gene, 
e.g., by exposure to carcinogens. 

A DNA construct is introduced into the nuclei of embryonic stem cells. Cells 
containing the newly engineered genetic lesion are injected into a host mouse embryo, which 

30 is re-implanted into a recipient female. Some of these embryos develop into chimeric mice 
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that possess germ cells partially derived from the mutant cell line. Therefore, by breeding the 
chimeric mice it is possible to obtain a new line of mice containing the introduced genetic 
lesion (see, e,g., Capecchi et aL, Science 244:1288 (1989)). Chimeric targeted mice can be 
derived according to Hogan et al.y Manipulating the Mouse Embryo: A Laboratory Manual^ 
5 Cold Spring Harbor Laboratory (1988) and Teratocarcinomas and Embryonic Stem Cells: A . 
Practical Approach, Robertson, ed., JRL Press, Washington, D.C., (1987). 

Alternatively, various immune-suppressed or immune-deficient host animals 
can be used. For example, genetically athymic "nude" mouse (see, e.g„ Giovanella et al, 1 
Natl Cancer Inst, 52:921 (1974)), a SCBD mouse, a thymectomized mouse, or an irradiated 

10 mouse (see, e.g,, Bradley et al, Br, J. Cancer 38:263 (1978); Selby et al, Br. J. Cancer 
41 :52 (1980)) can be used as a host. Transplantable tumor cells (typically about 10^ cells) 
injected into isogenic hosts will produce invasive tumors in a high proportions of cases, while 
normal cells of similar origin will not. In hosts which developed invasive tumors, cells 
expressing a breast cancer-associated sequences are injected subcutaneously. After a suitable 

1 5 length of time, preferably 4-8 weeks, tumor growth is measured (e.g., by volume or by its two 
largest dimensions) and compared to the control. Tumors that have statistically significant 
reduction (using, e.g.. Student's T test) are said to have inhibited growth. 

Polynucleotide modulators of breast cancer 

20 Antisense Polynucleotides 

In certain embodiments, the activity of a breast cancer-associated protein is 
down-regulated, or entirely inhibited, by the use of antisense polynucleotide, i.e,, a nucleic 
acid complementary to, and which can preferably hybridize specifically to, a coding mRNA 
nucleic acid sequence, e.g., a breast cancer protein roRNA, or a subsequence thereof 

25 Binding of the antisense polynucleotide to the mRNA reduces the translation and/or stability 
ofthemKNA. 

In the context of this invention, antisense polynucleotides can comprise 
naturally-occurring nucleotides, or synthetic species formed from naturally-occurring 
subunits or their close homologs. Antisense polynucleotides may also have altered sugar 
30 moieties or inter-sugar linkages. Exemplary among these are the phosphorothioate and other 
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sulfur containing species which are known for use in the art. Analogs are comprehended by 
this invention so long as they function effectively to hybridize with the breast cancer protein 
mRNA. See, e.g., Isis Pharmaceuticals, Carlsbad, CA; Sequitor, Inc., Natick, MA. 

Such antisense polynucleotides can readily be synthesized using recombinant 
5 means, or can be synthesized in vitro. Equipment for such synthesis is sold by several 
vendors, including Applied Biosystems. The preparation of other oligonucleotides such as 
phosphorothioates and alkylated derivatives is also well known to those of skill in the art. 

Antisense molecules as used herein include antisense or sense 
oligonucleotides. Sense oligonucleotides can, e.g., be employed to block transcription by 
10 binding to the anti-sense strand. The antisense and sense oligonucleotide conrprise a single- 
stranded nucleic acid sequence (either KNA or DNA) capable of binding to target mRNA 
(sense) or DNA (antisense) sequoices for breast cancer molecules. A preferred antismse 
molecule is for a breast cancer sequences in Tables 1-25, or for a ligand or activator thereof. 
Antismse or sense oligonucleotides, according to the present invention, comprise a fragment 
15 generally at least about 14 nucleotides, preferably from about 14 to 30 nucleotides. The 
ability to derive an antisense or a sense oligonucleotide, based upon a cDNA sequence 
encoding a given protein is described in, e.g., Stein & Cohen {Cancer Res. 48:2659 (1988 and 
van der Krol et al (BioTechniques 6:958 (1988)). 

20 Ribozymes 

In addition to antisense polynucleotides, ribozymes can be used to target and 
inhibit transcription of breast cancer-associated nucleotide sequences. A ribozyme is an 
RNA molecule that catalytically cleaves other RNA molecules. Different kinds of ribozymes 
have been described, including group I ribozymes, hammerhead ribozymes, hairpin 

25 ribozymes, RNase P, and axhead ribozymes {see, e.g., Castanotto et al^ Adv, in 

Pharmacology 25: 289-317 (1994) for a general review of the properties of different 
ribozymes). 

The general features of hairpin ribozymes are described, e.g., inHampel et aL, 
Nucl Acids Res. 18:299-304 (1990); European Patent Publication No. 0 360 257; U.S. Patent 
30 No. 5,254,678. Methods of preparing are well known to those of skill in the art {see, 
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WO 94/26877; Ojwang et al. Proa Natl Acad. Set USA 90:6340-6344 (1993); Yamada et 
al. Human Gene Therapy 1:39-45 (1994); Leavitt et al. Proa Natl. Acad. Sci. USA 92:699- 
703 (1995); Leavitt et al.. Human Gene Therapy 5:1 151-120 (1994); and Yamada et aL, 
Virology 205: 121-126 (1994)). 
5 Polynucleotide modulators of breast cancer may be introduced into a cell 

containing the target nucleotide sequence by formation of a conjugate with a ligand binding 
molecule, as described in WO 91/04753. Suitable ligand binding molecules include, but are 
not limited to, cell surface receptors, growth factors, other c3^okines, or other ligands that 
bind to cell surface recq)tors. Preferably, conjugation of the ligand binding molecule does 

10 not substantially interfere with the ability of the ligand binding molecule to bind to its 

corresponding molecule or receptor, or block entry of the smse or antisense oligonucleotide 
or its conjugated version into the cell. Alternatively, a polynucleotide modulator of breast 
cancer may be introduced into a cell containing the target nucleic acid sequence, e.g., by 
formation of an polynucleotide-lipid complex, as described in WO 90/1 0448. It is 

15 understood that the use of antisense molecules or knock out and knock in models may also be 
used in screening assays as discussed above, in addition to methods of treatment. 

Thus, in one embodiment, methods of modulating breast cancer in cells or 
organisms are provided. In one embodiment, the methods comprise administering to a cell an 
anti-breast cancer antibody that reduces or eliminates the biological activity of an endogenous 

20 breast cancer protein. Alternatively, the methods comprise administering to a cell or 
organism a recombinant nucleic acid encoding a breast cancer protein. This may be 
accomplished in any number of ways. In a preferred embodiment, e.g. when the breast 
cancer sequence is down-regulated in breast cancer, such state may be reversed by increasing 
the amoimt of breast canca: gene product in the cell. This can be accomplished, e.g., by 

25 overexpressing the endogenous breast cancer gene or administering a gene encoding the 
breast cancer sequence, using known gene-therapy techniques, e.g.. In a preferred 
embodiment, the gene ther^y techniques include the incorporation of the exogenous gene 
using enhanced homologous recombination (EHR), e.g. as described in PCT/US93/03868, 
hereby incorporated by reference in its entirety. Alternatively, e.g. when the breast cancer 
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sequence is up-regulated in breast cancer, the activity of the endogenous breast cancer gene is 
decreased, e.g. by the administration of a breast cancer antisense nucleic acid. 

In one embodiment, the breast cancer proteins of the present invaition may be 
used to generate polyclonal and monoclonal antibodies to breast cancer proteins. Similarly, 
5 the breast cancer proteins can be coupled, using standard technology, to affinity 
chromatography columns. These columns may then be used to purify breast cancer 
antibodies useful for production, diagnostic, or therapeutic purposes. In a prefenred 
embodiment, the antibodies are generated to epitopes unique to a breast cancer protein; that 
is, the antibodies show little or no cross-reactivity to other proteins. The breast cancer 
10 antibodies may be coupled to standard affinity chromatogrq)hy columns and used to purify 
breast cancer proteins. The antibodies may also be used as blocking polypeptides, as outlined 
above, since they will specifically bind to the breast cancer protein. 

Methods of identifying variant breast cancer-associated sequences 

1 5 Without being boimd by theory, expression of various breast cancer sequences 

is correlated with breast cancer. Accordingly, disorders based on mutant or variant breast 
cancer genes may be determined. In one embodiment, the invention provides methods for 
identifying cells containing variant breast cancer genes, e.g., determining all or part of the 
sequence of at least one endogenous breast cancer genes in a cell. This may be accomplished 

20 using any number of sequencing techniques. In a preferred embodiment, the invention 

provides methods of identifying the breast cancer genotype of an individual, e.g., determining 
all or part of the sequence of at least one breast cancer gene of the individual. This is 
generally done in at least one tissue of the individual, and may include the evaluation of a 
number oftissues or different samples ofthe same tissue. The method may include 

25 comparing the sequence of the sequenced breast cancer gene to a known breast cancer gene, 
i.e., a wild-type gene. 

The sequence of all or part ofthe breast cancer gene can then be compared to 
the sequence of a known breast cancer gene to determine if any differences exist. This can be 
done using any number of known homology programs, such as Bestfit, etc. In a preferred 

30 embodiment, the presence of a difference in the sequence between the breast cancer gene of 
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the patient and the known breast cancer gene correlates with a disease state or a propensity 
for a disease state, as outlined herein. 

In a preferred embodiment, the breast cancer genes are used as probes to 
determine the number of copies of the breast cancer gene in the genome. 
5 In another preferred embodiment, the breast cancer genes are used as probes to 

determine the chromosomal localization of the breast cancer genes. Information such as 
chromosomal localization finds use in providing a diagnosis or prognosis in particular when 
chromosomal abnormalities such as translocations, and the like are identified in the breast 
cancer gene locus. 

10 

AdministratioD of pharmaceutical and vaccine compositions 

In one embodiment, a therapeutically effective dose of a breast cancer protein 
or modulator thereof, is administered to a patient. By "therapeutically effective dose" herein 
is meant a dose that produces effects for which it is administered. The exact dose will depend 

15 on the purpose of the treatment, and will be ascertainable by one skilled in the art using 
known techniques (e.g., Ansel et al , Pharmaceutical Dosage Forms and Drug Delivery; 
Lieberman, Pharmaceutical Dosage Fonns (vols. 1-3, 1992), Dekker, ISBN 0824770846, 
082476918X, 0824712692, 0824716981; Lloyd, The Art, Science and Technology of 
Pharmaceutical Compounding (1999); and Pickar, Dosage Calculations (1999)). As is 

20 known in the art, adjustments for breast cancer degradation, systemic versus localized 

delivery, and rate of new protease synthesis, as well as the age, body weight, general health, 
sex, diet, time of administration, drug interaction and the severity of the condition may be 
necessary, and will be ascertainable with routine experimentation by those skilled in the art. 
U.S. Patent AppUcationN. 09/687,576, further discloses the use of compositions and 

25 methods of diagnosis and treatment in breast cancer is hereby expressly incorporated by 
reference. 

A "patienf ' for the purposes of the present invention includes both humans 
and other animals, particularly mammals. Thus the methods are applicable to both human 
therapy and veterinary applications. In the preferred embodiment the patient is a mammal, 
30 preferably a primate, and in the most preferred embodiment the patient is human. 
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The administration of the breast cancer proteins and modulators thereof of the 
present invention can be done in a variety of ways as discussed above, including, but not 
limited to, orally, subcutaneously, intravenously, intranasally, transdermally, 
intraperitoneally, intramuscularly, intrapubnonary, vaginally, rectally, or intraocularly. In 
5 some instances, e.g., in the treatment of wounds and inflammation, the breast cancer proteins 
and modulators may be directly applied as a solution or spray. 

The pharmaceutical compositions of the present invention comprise a breast 
cancer protem m a form suitable for administration to a patient. In the preferred embodiment, 
the pharmaceutical compositions are in a water soluble form, such as being present as 

1 0 pharmaceutically acceptable salts, which is meant to include both acid and base addition 
salts. 'Tharmaceutically acceptable acid addition salf * refers to those salts that retain the 
biological effectiveness of the free bases and that are not biologically or otherwise 
undesirable, formed with inorganic acids such as hydrochloric acid, hydrobromic acid, 
sulfuric acid, nitric acid, phosphoric acid and the like, and organic acids such as acetic acid, 

15 propionic acid, glycolic acid, pyruvic acid, oxalic acid, maleic acid, malonic acid, succinic 
acid, fumaric acid, tartaric acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, 
methanesulfonic acid, ethanesulfonic acid, p-toluenesulfonic acid, salicylic acid and the like. 
*Tharraaceutically acceptable base addition salts" include those derived from inorganic bases 
such as sodium, potassiimi, lithiiun, ammonium, calciimi, magnesium, iron, zinc, copper, 

20 manganese, aluminum salts and the like. Particularly preferred are the ammonium, 

potassium, sodium, calciimi, and magnesium salts. Salts derived from pharmaceutically 
acceptable organic non-toxic bases include salts of primary, secondary, and tertiary amines, 
substituted amines including naturally occurring substituted amines, cyclic amines and basic 
ion exchange resins, such as isopropylamine, trimethylamine, diethylamine, triethylamine, 

25 tripropylamine, and ethanolamine. 

The pharmaceutical compositions may also include one or more of the 
following: carrier proteins such as serum albumin; buffers; fillers such as microcrystalline 
cellulose, lactose, com and other starches; binding agents; sweeteners and other flavoring 
agents; coloring agents; and polyethylene glycol. 
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The pharmaceutical compositions can be administered in a variety of unit 
dosage forms depending upon the method of administration. For example, unit dosage forms 
suitable for oral administration include, but are not limited to, powder, tablets, pills, capsules 
and lozenges. It is recognized that breast cancer protein modulators (e.g., antibodies, 
5 antisense constructs, ribozymes, small organic molecules, etc.) when administered orally, 
should be protected from digestion. This is typically accomplished either by complexing tlie 
molecule(s) with a composition to render it resistant to acidic and enzymatic hydrolysis, or by 
packaging the molecule(s) in an a|^ropriately resistant carrier, such as a liposome or a 
protection barrier. Means of protecting agents from digestion are well known in the art. 

10 The compositions for administration will commonly comprise a breast cancer 

protein modulator dissolved in a pharmaceutically acceptable carrier, preferably an aqueous 
carrier. A variety ofaqueous carriers can be used, e.g., buffered saline and the like. These 
solutions are sterile and generally free of undesirable matter. These compositions may be 
sterilized by conventional, well known sterilization techniques. The compositions may 
. 15 contain pharmaceutically acceptable auxiliary substances as required to approximate 

physiological conditions such as pH adjusting and buffering agents, toxicity adjusting agents 
and the like, e.g., sodium acetate, sodixmi chloride, potassium chloride, calcium chloride, 
sodium lactate and the like. The concentration of active agent in these formulations can vary 
widely, and will be selected primarily based on fluid volumes, viscosities, body weight and 

20 the like in accordance with the particular mode of administration selected and the patient's 
needs (e.g., Remington's Pharmaceutical Science (15th ed., 1980) and Goodman & Gilhnan, 
Tlie Pharmacologial Basis of Therapeutics (Hardman et alfiAs,, 1 996)). 

Thus, a typical pharmaceutical composition for intravenous administration 
would be about 0.1 to 10 mg per patient per day. Dosages from 0.1 up to about 100 mg per 

25 patient per day may be used, particularly when the drug is administered to a secluded site and 
not into the blood stream, such as into a body cavity or into a lumen of an organ. 
Substantially higher dosages are possible in topical administration. Actual methods for 
preparing parenterally administrable compositions will be known or apparent to those skilled 
in the art, eg.. Remington 's Pharmaceutical Science and Goodman and Gillman, The 

30 pharmacologial Basis of Therapeutics^ supra, 
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The compositions containing modulators of breast cancer proteins can be 
administered for ther^eutic or prophylactic treatments. In therapeutic applications, 
compositions are administered to a patient suffering from a disease (e.g., a cancer) in an 
amount sufficimt to cure or at least partially arrest the disease and its complications. An 
5 amount adequate to accomplish this is defined as a "therapeutically effective dose." Amounts 
effective for this use will depend upon the severity of the disease and the general state of the 
patient's health. Single or multiple administrations of the compositions may be adnainistered 
depending on the dosage and frequency as required and tolerated by the patient. In any event, 
the composition should provide a sufficient quantity of the agents of this invention to 

10 effectively treat the patient. An amount of modulator that is capable of preventing or slowing 
the development of cancer in a mammal is referred to as a "prophylactically effective dose." 
The particular dose required for a prophylactic treatment will depend upon the medical 
condition and history of the mammal, the particular cancer being prevented, as well as other 
factors such as age, weight, gender, administration route, efficiency, etc. Such prophylactic 

15 treatments may be used, e.g., in a mammal who has previously had cancer to prevent a 

recurrence of the cancer, or in a mammal who is suspected of having a significant likelihood 
of developing cancer. 

It will be appreciated that the present breast cancer protein-modulating 
compounds can be administered alone or in combination with additional breast cancer 

20 modulating compounds or with other therapeutic agent, e.g., other anti-cancer agents or 
treatments. 

In numerous embodiments, one or more nucleic acids, e.g., polynucleotides 
comprising nucleic acid sequences set forth in Tables 1-25, such as antisense polynucleotides 
or ribozymes, will be introduced into cells, in vitro or in vivo. The present invention provides 
25 methods, reagents, vectors, and cells usefiil for expression of breast cancer-associated 
polypeptides and nucleic acids using in vitro (cell-free), ex vivo or in vivo (cell or 
organism-based) recombinant expression systems. 

The particular procedure used to introduce the nucleic acids into a host cell for 
expression of a protein or nucleic acid is application specific. Many procedures for 
30 introducing foreign nucleotide sequences into host cells may be used. These include the use 
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of calcium phosphate transfection, spheroplasts, electroporation, liposomes, microinjection, 
plasma vectors, viral vectors and any of the other well known methods for introducing cloned 
- genomic DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell {see, 
e.g., Berger & Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology 
5 volume 152 (Berger), Ausubel et al, eds., Current Protocols (supplemented through 1999), 
and Sambrook et al. Molecular Cloning - A Laboratory Manual (2nd ed., Vol. 1-3, 1989. 

In a preferred embodiment, breast cancer proteins and modulators are 
administered as therapeutic agents, and can be formulated as outlined above. Similarly, 
breast cancer genes (including both the full-length sequence, partial sequences, or regulatory 

10 sequences of the breast cancer coding regions) can be administered in a gene therapy 
application. These breast cancer genes can include antisense applications, either as gene 
therapy (i.e. for incorporation into the genome) or as antisense compositions, as will be 
appreciated by those in the art. 

Breast cancer polypeptides and polynucleotides can also be administered as 

1 5 vaccine compositions to stimulate HTL, CTL and antibody responses.. Such vaccine 

compositions can include, e.g., lipidated peptides {see, e.g^.,Vitiello, A. et aL, J. Clin, Invest. 
95:341 (1995)), peptide compositions encapsulated in poly(DL-lactide-co-glycolide) CTLG") 
microspheres {see, e.g, Eldridge, et aL, Molea Immunol. 28:287-294, (1991); Alonso et al. 
Vaccine 12:299-306 (1994); Jones etal. Vaccine 13:675-681 (1995)), peptide compositions 

20 contained in immune stimulating complexes (ISCOMS) {see, e.g., Takahashi et al. , Nature 
344:873-875 (1990); Hu et al, Clin Exp Immunol. 113:235-243 (1998)), multiple antigen 
peptide systems (MAPs) {see, e.g.. Tarn, Proc. Natl Acad Sci. U.S.A. 85:5409-5413 (1988); 
Tarn, /. Immunol Methods 196:17-32 (1996)), peptides formulated as multivalent peptides; 
peptides for use in ballistic delivery systems, typically crystallized peptides, viral delivery 

25 vectors (Perkus, et al. In: Concepts in vaccine development (Kaufinann, ed., p. 379, 1996); 
Chakrabarti, etal. Nature 320:535 (1986); Eu etaly Nature 320:537 (1986); Kieny, etal, 
AIDS Bio/Technology 4:790 (1986); Top et al, J. Infect. Dis. 124:148 (1971); Chanda et al. 
Virology 175:535 (1990)), particles of viral or synthetic origin {see, e.g., Kofler et al,J. 
Immunol Methods. 192:25 (1996); Eldridge etal, Sem. Hematol 30:16 (1993); Falo etal, 

30 Nature Med. 7:649 (1995)), adjuvants (Warren et al, Annu. Rev. Immunol 4:369 (1986); 
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Gupta et al. Vaccine 11:293 (1993)), liposomes (Reddy et aL,J, Immunol. 148:1585 (1992); 
Rock, Immunol, Today 17:131 (1996)), or, naked or particle absorbed cDNA (Ulmer, et al. 
Science 259:1745 (1993); Robinson et al. Vaccine 1 1 :957 (1993); Shiver et al. In: Concepts 
in vaccine development (Kaufinann, ed., p. 423, 1996); Cease & Berzofsky, Annu. Rev, 
5 Immunol 12:923 (1994) and Eldridge et al. Sent. Hematol 30:16 (1993)). Toxin-targeted 
delivery technologies, also known as receptor mediated targeting, such as those of Avant 
Immunotherapeutics, Inc. (Needham, Massachusetts) may also be used. 

Vaccine compositions often include adjuvants. Many adjuvants contain a 
substance designed to protect the antigen from rapid catabolism, such as aluminum hydroxide 

10 or mineral oil, and a stimulator of immime responses, such as lipid A, Bortadella pertussis or 
Mycobacterium tuberculosis derived proteins. Certain adjuvants are commercially available 
as, e.g., Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, 
MI); Merck Adjuvant 65 (Merck and Company, Inc., Rahway, NJ); AS-2 (SmithKline 
Beecham, Philadelphia, PA); alimainum salts such as aluminum hydroxide gel (alum) or 

15 aluminum phosphate; salts of calcium, iron or 2dnc; an insoluble suspension of acylated 
tyrosine; acylated sugars; cationically or anionically derivatized polysaccharides; 
polyphosphazenes; biodegradable microspheres; monophosphoryl lipid A and quil A. 
Cytokines, such as GM-CSF, interleukin-2, -7, -12, and other like growth factors, may also be 
used as adjuvants. 

20 Vaccines can be administered as nucleic acid compositions wherein DNA or 

RNA encoding one or more of the polypeptides, or a fragment thereof, is administered to a 
patient. This approach is described, for instance, in Wolff e/. al. Science 247:1465 (1990) as 
well as U.S. Patent Nos. 5,580,859; 5,589,466; 5,804,566; 5,739,118; 5,736,524; 5,679,647; 
WO 98/04720; and in more detail below. Examples of DNA-based delivery technologies 

25 include "naked DNA", facilitated (bupivicaine, polymers, peptide-mediated) delivery, 

cationic lipid complexes, and particle-mediated ("gene gun") or pressure-mediated delivery 
{see, e.g., U.S. Patent No. 5,922,687). 

For thorapeutic or prophylactic immunization purposes, the peptides of the 
invention can be expressed by viral or bacterial vectors. Examples of expression vectors 

30 include attenuated viral hosts, such as vaccinia or fowlpox. This approach involves the use of 
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vaccinia virus, e.g., as a vector to express nucleotide sequences that encode breast cancer 
polypeptides or polypeptide fragments. Upon introduction into a host, the recombinant 
vaccinia virus expresses the immunogenic peptide, and thereby elicits an immune response. 
Vaccinia vectors and methods useful in unmunization protocols are described in, e.g., U.S. 
5 Patent No. 4,722,848. Another vector is BCG (Bacille Cabnette Guerin). BCG vectors are 
described in Stover et al. Nature 351 :456-460 (1991). A wide variety of other vectors useful 
for therapeutic adnainistration or immimization e.g. adeno and adeno-associated virus vectors, 
retroviral vectors, Salmonella typhi vectors, detoxified anthrax toxin vectors, and the like, 
will be apparent to those skilled in the art from the description herein {see, e.g., Shata et al^ 

10 Mot Med Today 6:66-71 (2000); Shedlock et al, JLeukoc Biol 68:793-806 (2000); Hipp et 
al. In Vivo 14:571-85 (2000)). 

Methods for the use of genes as DNA vaccines are well known, and include 
placing a breast cancer gene or portion of a breast cancer gene under the control of a 
regulatable promoter or a tissue-specific promoter for expression in a breast cancer patient. 

1 5 The breast cancer gene used for DNA vaccines can encode full-length breast cancer proteins, 
but more preferably encodes portions of the breast cancer proteins including peptides derived 
from the breast cancer protein. In one embodiment, a patient is immunized with a DNA 
vaccine comprising a plurality of nucleotide sequences derived from a breast cancer gene. For 
example, breast cancer-associated genes or sequence encoding subfragments of abreast 

20 cancer protein are introduced into expression vectors and tested for their immunogenicity in 
the context of Class I MHC and an ability to generate cytotoxic T cell responses. This 
procedure provides for production of cytotoxic T cell responses against cells which present 
antigen, including intracellular epitopes. 

In a preferred embodiment, the DNA vaccines include a gene encoding an 

25 adjuvant molecule with the DNA vaccine. Such adjuvant molecules include cytokines that 
increase the immunogenic response to the breast cancer polypeptide encoded by the DNA 
vaccine. Additional or alternative adjuvants are available. 

In another preferred embodiment breast cancer genes find use in generating 
animal models of breast cancer. When the breast cancer gene identified is repressed or 

30 diminished in cancer tissue, gene therapy technology, e.g., wherein antisaise RNA directed 
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to the breast cancer gene will also diminish or repress expression of the gene. Animal models 
of breast cancer fmd use in screening for modulators of a breast cancer-associated sequence 
or modulators of breast cancer. Similarly, transgenic animal technology including gene 
knockout technology, e.g. as a result of homologous recombination with an appropriate gene 
5 targeting vector, will result in the absence or increased expression of the breast cancer 
protein. When desired, tissue-specific expression or knockout of the breast cancer protein 
may be necessary. 

It is also possible that the breast cancer protein is overexpressed in breast 
cancer. As such, transgenic animals can be generated that overexpress the breast cancer 
10 protein. Depending on the desired expression level, promoters of various strengths can be 
employed to express the transgene. Also, the number of copies of the integrated transgene 
can be determined and compared for a determination of the expression level of the transgene. 
Animals goierated by such methods find use as animal models of breast cancer and are 
additionally useful in screening for modulators to treat breast cancer. 

15 

Kits for Use in Diagnostic and/or Prognostic Applications 

For use in diagnostic, research, and therapeutic appUcations suggested above, 
kits are also provided by the invention. In the diagnostic and research applications such kits 
may include any or all of the following: assay reagents, buffers, breast cancer-specific nucleic 

20 acids or antibodies, hybridization probes and/or primers, antisense polynucleotides, 

ribozymes, dominant negative breast cancer polypeptides or polynucleotides, small molecules 
inhibitors of breast cancer-associated sequences etc. A therapeutic product may include 
sterile saline or another pharmaceutically acceptable emulsion and suspension base. 

In addition, the kits may include instructional materials containing directions 

25 (i.e., protocols) for the practice of the methods of this invention. While the instmctional 
materials typically comprise written or printed materials they are not Umited to such. Any 
mediiun capable of storing such instructions and commimicating them to an end user is 
contemplated by this invention. Such media include, but are not limited to electronic storage 
media (e.g., magnetic discs, tapes, cartridges, chips), optical media (e.g., CD ROM), and the 
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like. Such media may include addresses to internet sites that provide such instractional 
materials. 

The present invention also provides for kits for screening for modulators of 
breast cancer-associated sequences. Such kits can be prepared from readily available 
5 materials and reagents. For example, such kits can comprise one or more of the following 
materials: a breast cancer-associated polypeptide or polynucleotide, reaction tubes, and 
instructions for testing breast cancer-associated activity. Optionally, the kit contains 
biologically active breast cancer protein. A wide variety of kits and components can be 
prepared according to the present invention, depending upon the intended user of the kit and 
10 the particular needs of the user. Diagnosis would typically involve evaluation of a plurality 
of genes or products. The genes will be selected based on correlations with important 
parameters in disease which may be identified in historical or outcome data. 



15 EXAMPLES 

Example 1: Tissue Preparation, Labeling Chips, and Fingerprints 

Purifying total RNA from tissue sample using TRIzol Reagent 

20 The sample weight is first estimated. The tissue samples are homogenized in 

1 ml of TRIzol per 50 mg of tissue using a homogenizer (e.g., Polytron 3100). The size of 
the generator/probe used depends upon the sample amount. A generator that is too large for 
the amount of tissue to be homogenized will cause a loss of sample and lower RNA yield. A 
larger generator (e.g., 20 mm) is suitable for tissue samples weighing more than 0.6 g. Fill 

25 tubes should not be overfilled. If the working volume is greater than 2 ml and no greater than 
10 ml, a 15 ml polypropylene tube (Falcon 2059) is suitable for homogenization. 

Tissues should be kept fi-ozen until homogenized. The TRIzol is added 
directly to the firozen tissue before homogenization. Following homogenization, the insoluble 
material is removed fix>m the homogenate by centrifiigation at 7500 x g for 15 min. in a 

30 Sorvall superspeed or 12,000 x g for 10 ndn. in an Eppendorf centrifuge at 4<^C. The cleared 
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homogenate is then transfOTcd to a new tube(s). Samples may be frozen and stored at -60 to 
-TO^C for at least one month or else continue with the purification. 

The next process is phase separation. The homogenized samples are incubated 
for 5 minutes at room temperature. Then, 0.2 ml of chloroform per 1ml of TRIzol reagent is 
5 added to the homogenization mixture. The tubes are securely capped and shaken vigorously 
by hand (do not vortex) for 15 seconds. The samples are then incubated at room temp, for 
2-3 minutes and next centrifiiged at 6500 rpm in a Sorvall superspeed for 30 rain, at 4oC. 

The next process is RNA Precipitation. The aqueous phase is transferred to a 
fresh tube. The organic phase can be saved if isolation of DNA or protein is desired. Then 
1 0 0.5 ml of isopropyl alcohol is added per 1ml of TRIzol reagent used in the original 

homogenization. Then, the tubes are securely capped and inverted to mix. The samples are 
then incubated at room temp, for 10 minutes an cenbifuged at 6500 rpm in Sorvall for 20 
min.at40C. 

The RNA is then washed. The supematant is poured off and the pellet washed 
15 with cold 75% ethanol. 1 ml of 75% ethanol is used per 1 ml of the TRIzol reagent used m 
the initial homogenization. The tubes are capped securely and inverted several times to 
loosen pellet without vortexing . They are next centrifuged at <8000 rpm (<7500 x g) for 5 
minutes at 4°C. 

The RNA wash is decanted. The pellet is carefully transferred to an 
20 Eppendorf tube (sliding down the tube mto the new tube by use of a pipet tip to help guide it 
in if necessary). Tube(s) sizes for precipitating the RNA depending on the working volumes. 
Larger tubes may take too long to dry. Dry pellet. The RNA is then resuspended in an 
appropriate volume (e.g., 2 -5 ug/ul) of DEPC H2O. The absorbance is then measured. 

The poly A+ mRNA may next be purified from total RNA by other methods 
25 such as Qiagen' s RNeasy kit. The poly A mRNA is purified from total RNA by adding the 
oligotex suspension which has been heated to 37^C and mixing prior to adding to RNA. 
The Elution Buffer is incubated at 70^C. If there is precipitate in the buffer, warm up the 2 x 
Binding Buffer at 65^C. The the total RNA is mixed with DEPC-treated water, 2 x Bmding 
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Buffer, and Oligotex according to Table 2 on page 16 of the Oligotex Handbook and next 

incubated for 3 minutes at 65^C and 10 minutes at room temperature. 

The preparation is centrifiiged for 2 minutes at 14,000 to 18,000 g, preferably, 
at a "soft setting," The supernatant is removed without disturbing Oligotex pellet, A little bit 
5 of solution can be left behind to reduce the loss of Oligotex. The supernatant is saved until 

satisfactory binding and elution of poly A"*" roRNA has been found. 

Then, the prq)aration is gently resuspended in Wash Buffer 0W2 and pipetted 
onto the spin column and centriftiged at fiiU speed (soft setting if possible) for 1 minute. 

Next, the spin column is transferred to a new collection tube and gently 
10 resuspended in Wash Buffer 0W2 and centrifiiged as described herein. 

Then, the spin column is transferred to a new tube and eluted with 20 to 100 ul 

of preheated (TO^C) Elution Buffer. The Oligotex resin is gently resuspended by pipetting up 
and down. The centrifiigation is repeated as above and the elution repeated with fresh elution 
buffer or first eluate to keep the elution volume low. 

15 The absorbance is next read to determine the yield, using diluted Elution 

Buffer as the blank. 

Before proceeding with cDNA synthesis, the mRNA is precipitated before 
proceeding with cDNA synthesis, as components leftover or in the Elution Buffer from the 
Oligotex purification procedure will inhibit downstream enzymatic reactions of the mRNA. 

20 0.4 vol. of 7.5 M NH40Ac + 2.5 vol. of cold 100% ethanol is added and the preparation 
precipitated at -20°C 1 hour to ovemigjit (or 20-30 min. at -TO^'C), and centrifiiged at 
14,000-16,000 X g for 30 minutes at 4°C. Next, the pellet is washed with 0.5 ml of 80% 
ethanol (-20°C) and then centrifiiged at 14,000-16,000 x g for 5 minutes at room temperature. 
The80% ethanol wash is then repeated. The last bit of ethanol &om the pellet is then dried 

25 without use of a speed vacuum and the pellet is then resuspended in DEPC H2O at lug/ul 
concentration. 

Altemativelv the RNA mav be purified using other methods (e.g.. Oiaeen's RNeasv kit)^ 
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No more than 100 ug is added to the RNeasy column. The sample volume is 
adjusted to 100 ul with RNase-free water. 350 ul Buffer RLT and then 250 ul ethanol 
(100%) are added to the sample. The preparation is then mixed by pipetting and applied to an 
RNeasy mini spin column for centrifugation (15 sec at >10,000 rpm). If yield is low, reapply 
5 the flowthrough to the column and centrifuge again. 

Then, transfer colimm to a new 2 ml collection tube and add 500 ul Buffer 
RPE and centrifuge for 15 sec at >10,000rpm. The flowthrough is discarded. 500 ul Buffer 
RPE and is thm added and the preparation is centriuged for 15 sec at >i 0,000 rpm. The 
flowthrough is discarded, and the column membrane dried by centrifuging for 2 min at 
10 maximum speed. The column is transferred to a new 1.5-ml collection tube. 30-50 ul of 

RNase-free water is applied directly onto colimm membrane. The column is then centrifiiged 
for 1 mui at >1 0,000 rpm and the elution step repeated. 

The absorbance is then read to determine yield. If necessary, the material may 
be ethanol precipitated with ammonium acetate and 2,5X volume 100% ethanol. 

15 

■ First Strand cDNA Synthesis 

The first strand can be make using using Gibco^s "Superscript Choice System 
for cDNA Synthesis" kit. The starting material is 5 ug of total RNA or 1 ug of polyA+ 
mRNAl. For total RNA, 2 ul of Superscript RT is used; for polyAH- mRNA, 1 ul of 

20 Superscript RT is used. The final volume of first strand synthesis mix is 20 ul The RNA 
should be in a volume no greater than 10 ul. The RNA is incubated with 1 ul of 100 pmol 
T7-T24 oligo for 10 min at lO^'C followed by addition on ice of 7 ul of: 4ul 5X l^t Strand 
. Buffer, 2 ul of O.IM DTT, and 1 ul of lOmM dNTP mix. The preparation is then incubated at 
37°C for 2 min before addition of the Superscript RT followed by incubation at 37°C for 1 

25 hour. 

Sficnnd Strand Synthesis 

For the second strand synthesis, place 1st strand reactions on ice and add: 91 

ul DEPC H2O; 30 ul 5X 2nd Strand Buffer; 3 ul lOmM dNTP mix; 1 ul 10 U/ul E.coli DNA 

30 Ligase; 4 ul 10 U/ul E.coli DNA Polymerase; and 1 ul 2 U/ul RNase H. Mix and incubate 2 
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hours at le^'C. Add 2 ul T4 DNA Polymerase. Incubate 5 min at 16°C. Add 10 ul of 0.5M 
EDTA. 

Cleaning up cDNA 

5 The cDNA is puriiSed using Phenol :Chloroform:Isoamyl Alcohol (25:24:1) 

and Phase-Lock gel tubes. The PLG tubes are centrifuged for 30 sec at maximum speed. 
The cDNA mix is then transferred to PLG tube. An equal volume of 
phenol:chloroform:isamyl alcohol is then added, the preparation shaken vigorously (no 
vortexing), and centrifuged for 5 minutes at maximum speed. The top aqueous solution is 
1 0 transferred to a new tube and ethanol precipitated by adding 7.5X 5M NH40Ac and 2.5X 
volume of 100% ethanol. Next, it is centrifuged unmediately at room temperature for 20 
min, maximum speed. The supernatant is removed, and the pellet washed with 2X with cold 
80% ethanol. As much ethanol wash as possible should be removed before an: drying the 
pellet; and resuspending it in 3 ul RNase-free water. 

15 

In vitro Transcription (TVT) and labeling with biotin 

In vitro Transcription (IVT) and labeling with biotin is perforaied as follows: 
Pipet 1 .5 ul of cDNA into a thin-wall PGR tube. Make NTP labeling mix by combining 2 ul 
T7 lOxATP (75 mM) (Ambion); 2 ul T7 lOxGTP (75 mM) (Ambion); 1.5 ul T7 lOxCTP (75 

20 mM) (Ambion); 1.5 ul T7 lOxUTP (75 mM) (Ambion); 3.75 ul 10 mM Bio-ll-UTP 

(Boehringer-Mannheim/Roche or Enzo); 3.75 ul 10 mM Bio-16-CTP (Enzo); 2 ul lOx T7 
transcription buffer (Ambion); and 2 ul lOx T7 enzyme mix (Ambion). The final volume is 
20 ul. Incubate 6 hours at 37°C in a PGR machine. The RNA can be furthered cleaned. 
Glean-up follows the previous instructions for RNeasy columns or Qiagen's RNeasy protocol 

25 handbook. The cRNA often needs to be ethanol precipitated by resuspension in a volume 
compatible with the fragmentation step. 

Fragmentation is performed as follows. 15 ug of labeled RNA is usually 
fragmented. Try to minimize the fragmentation reaction volume; a 10 ul volume is 
recommended but 20 ul is all right. Do not go higher than 20 ul because the magnesium in 

30 the fragmentation buffer contributes to precipitation in the hybridization buffer. Fragment 
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RNA by incubation at 94 C for 35 minutes in 1 x Fragmentation buffer (5 x Fragmentation 
buffer is 200 mM Tris-acetate, pH 8.1 ; 500 mM KOAc; 150 mM MgOAc). The labeled 
RNA transcript can be analyzed before and after fragmentation. Samples, can be heated to 
eS'^C for 15 minutes and electrophoresed on 1% agarose/TBE gels to get an approximate idea 
5 of the transcript size range 

For hybridization, 200 ul (10 ug cRNA) of a hybridization mix is put on the 
chip. If multiple hybridizations are to be done (such as cycling through a 5 chip set), then it 
is recommended that an initial hybridization mix of 300 ul or more be made. The 
hybridization mix is: fragment labeled RNA (50 ng/ul final cone); 50 pM 948-b control 
10 oligo; 1.5 pM BioB; 5 pM BioC; 25 pM BioD; 100 pM CRE; 0.1 mg/ml herring sperm DNA; 
0.5 mg/ml acetylated BSA; and 300 ul with IxMES hyb buffer. 

The hybridization reaction is conducted with non-biotinylated IVT (purified 
by RNeasy columns) (see example 1 for steps from tissue to IVT): The following mixture is 
prq)ared: 

1 5 rVT antisense RNA; 4 ug: jiil 

Random Hexamers (1 jag/jal): 4 |il 
H2O: _ul_ 

14jil 

Incubate the above 14 \i\ mixture at TO^C for 10 min.; then put on ice. 
20 The Reverse transcription procedure uses the foUovraig mixture: 

O.IMDTT: 3^1 

50XdNTPmix: 0.6^1 

H2O: 2.4^1 

Cy3 or Cy5 dUTP (ImM): 3 jil 
25 SSRTn(BRL): 1 pl 



16ki1 

The above solution is added to the hybridization reaction and incubated for 30 min., 42^C. 

Then, 1 \i\ SSII is added and incubated for another hour before being placed on ice. 
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The SOX dNTP mix contains 25mM of cold dATP, dCTP, and dGTP, lOmM 
of dTTP and is made by adding 25 ^il each of lOOmM dATP, dCTP, and dGTP; 10 of 
lOOmM dTTP to 15 pil H2O. ] 

KNA degradation is perfonned as follows. Add 86 ^1 H20, 1 .5 jil IM NaOH/ 
5 2 mM EDTA and incubate at eS'C, 10 min.. For U-Con 30, 500 \i\ TE/sample spin at 7000 g 
for 10 min, save flow through for purification. For Qiagen purification, susp^ u-con 
recovered material in 500 jil buffer PB and proceed using Qiagen protocol. For DNAse 
digestion, add 1 ul of 1/100 dilution of DNAse/30 ul Rx and incubate at 37°C for 15 min. 
Incubate at 5 min 95''C to denature the DNAse. 

10 

Sample preparation 

For sample preparation, add Cot-1 DNA, 10 ^1; SOX dNTPs, 1 ^il; 20X SSC, 2.3 ^1; Napyro 
phosphate, 7.5 |xl; 10 mg/ml Herring sperm DNA; 1 ul of 1/10 dilution to 21.8 final vol. Dry 
in speed vac. Resuspend in 15 ^1 H2O. Add 0.38 ^1 10% SDS. Heat 9S°C, 2 min and slow 
15 cool at room temp, for 20 min. Put on slide and hybridize overnight at 64°C. Washing after 
the hybridization: 3X SSaO.03% SDS: 2 min., 37.5 ml 20X SSC+0.75ml 10% SDS in 
250ml H2O; IX SSC: 5 min., 12.5 ml 20X SSC in 250ml H2O; 02X SSC: 5 min., 2.5 ml 20X 
SSC in 250ml H2O. Dry slides and scan at appropiate PMT's and channels. 
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TABLE 1: Figure 1 from BRCA 001 US 

Table 1 shows genes, (incorporated in their entirety here and throughout the application 
where primekeys are provided), downregulated in tumor tissue compared to normal breast 
tissue. 



Pkey: Unique Eos probeset identiTier nunlber 

ExAccn: Exemplar Aooession number, Genbank accession number 

UnigenelO: Unigene number 

Unlgene Title: Unigene gene title 

R1 : Ratk) of normal breast tissue to tumor 



Pkey ExAccn UnigenelD UnlgeneTitle R1 

100472 D90084 Hs.ia23 pyruvate dehydnsgenase (lipoamide) alpha 5 

100499 T51986 Hs.283108 hemoglobin, gamma G 10 

100545 M55405 gb;Homo sapiens mucin (MUC-3) mRNA, part 5 

100549 BE142019 Hs.222056 Homo sapiens cDNAFLJ1 1572 fis. done HE 10 

100613 X52078 Hs.101047 transcription factor 3 (E2A immunoglobul 5 

100635 BE259039 Hs.129953 Ewing sarcoma breaf^point region 1 5 

100645 X16841 Hs.167988 neural cell adhesk>n molecule 1 5 

100654 A03758 NM.000477*:Honno sapiens albumin (ALB), m 10 

100702 1^7065 gb:Human neurofibromatosis 2 (NF2) mRNA. 5 

100915 M60832 Hs.249239 collagen, type VIII. alpha 2 5 

100971 BE379727 Hs.83213 fatty add Wndlng protein 4. adipocyte 10 

101125 AJ250562 Hs.82749 transmembrane 4 superfamily member 2 5 

101166 M90424 Hs^099 lipocalin 1 (protein migrating faster th 5 

101184 NIVL001674 Hs.460 activating transcription factor 3 10 

101336 NI^006732 Hs.75678 FB J murine osteosarcoma viral oncogene h 10 

101367 X03350 Hs.4 alcohol dehydrogenase IB (class I), beta 10 

101447 M21305 gbiHuman alpha satellite and satellite 3 10 

101461 N9B569 Hs.76422 phospholipase A2, group HA (platelets. 10 

101511 M27826 Hs^67319 endogenous retroviral protease 10 

101634 AV650262 Hs.75765 GR02 oncogene 5 

101736 M74447 Hs.502 transporter2,ATP-blndIng cassette, sub 10 

102208 U22961 gb:Human mRNA done with similarity to L 10 

102297 NM.001S04 Hs.1 98252 G protein-coupled receptor 9 5 

102450 U48251 Hs.75671 protein kinase C binding protefnl 10 

102515 U89337 Hs.169886 tenascinXB 10 

102571 U60115 Hs^39069 four and a half LIM domains 1 5 

102800 AA313538 gb:EST185419 Colon carcinoma (HOC) cell 10 

102857 NM_006744 Hs.76461 retinoi-binding protein 4. interstitial 10 

102990 AA8"29286 Hs.332053 serum amyloid A1 10 

103434 X98085 Hs^4433 tenasdn R (nestrictin Janusln) 5 

103747 AA081995 gb:zn26d05.r1 Stratagene neuroepithelium 10 

103750 AA126129 gb:zm78c07.r1 Stratagene neuroepithelium 5 

103812 AA137107 Hs.326391 Homo sapiens, done MGC:16638, mRNA, com 10 ' 

103851 AA326216 Hs.8719 hypothetical protein M6C1 136 5 

104080 AB041036 Hs.57771 kallikrein 11 (KU<11;TLSP; PRSS20;hipp 5 

104093 R50727 Hs.336970 ESTs 10 

104106 AA422123 gb:zv26h12.r1 Soares_NhHIVIPiLS1 Homosapi 5 

104109 AL353957 Hs.284181 hypothetical protein DKFZp434P0531 10 

104250 F06638 Hs.12440 Homo sapiens done 24734 mRNA sequence 10 

104340 AA426189 gb:zw11e09.r1 Soares_NhHMPu31 Homosapi 5 

104492 N73185 Hs.94285 EST 10 

104506 N91071 Hs.109650 ESTs 10 

104511 N99542 Hs^2 orosomucoidi 5 

104532 AI498763 Hs203013 hypothetical protein FIJI 2748 10 
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104536 R24024 Hs.158101 HomosaplenscDNAFLJ14$73fis.cloneNT 5 

104572 Y11312 Hs.132463 phosphoinosjtide-34cinase. dass 2. beta 5 

■ 104659 AW969769 Hs.105201 ESTs 5 

104677 AA009764 Hs.1 90380 ESTs 10 

5 104711 AA017245 Hs.32794 ESTs 10 

104731 AA019300 Hs.125070 ESTs. Moderately similar to 154374 gene 10 

104764 AI039243 Hs.278585 ESTs 5 

105005 AI298208 Hs.28805 ESTs 10 

105038 AA130390 Hs.25549 hypothetica) protein RJ20898 10 

10 105105 R61532 Hs.87016 hypothetical protein FU22938 5 

105231 AW970043 Hs^38039 hypothetica! protein FU 11090 5 

105239 AA221036 gb:zr03f12.r1 Stratagene NT2 neuronal pr 10 

105921 AA421973 Hs.169119 ESTs. Weakly simitar to T25731 hypothet! 5 

105957 BE242B57 Hs^7021 hypothetical protein FU11 159 5 

IS 106052 N79885 Hs.6382 ESTs, Hfghly similar to T00391 hypotheti 10 

106119 AL359624 Hs.11387 KIAA1453 protein 5 

106181 AI803651 Hs.1 91608 ESTs 10 

106194 AW976171 Hs.286194 hypothetcai protein FU22233 5 

106283 A1085846 Hs.25522 KIAA1 808 protein 10 

20 106379 AL042069 Hs.1 19021 DKFZP434N061 protein 10 

106451 AW235928 Hs.313182 ESTs 10 

106491 AA135688 Hs.10083 Homo sapiens, clone IMAGE:41397e6. mRNA. 10 

106700 AA906434 Hs.3776 zinc finger protein 216 5 

106782 AW054886 Hs.25682 Homo sapiens mRNA for KIAA1 863 protein. 10 

25 106851 AI458623 gb:tk04g09jc1 NCI_CGAP_Lu24 Homo sapiens 5 

106870 A1983730 Hs.26530 serum deprivation response (phosphatidyl 5 

106892 Ai347578 H$.124015 hypothetical protein MGC2605 5 

106954 AF12B847 Hs.204038 IndolethylamineN-methyltransferase 5 

106991 AJ223811 Hs.30127 hypothetical protein 5 

30 107103 AI446183 Hs.9572 ESTs, Highly simnar to CYA5.HUMANADENY 5 

107124 AB006532 Hs.31442 RecQ protein-like 4 10 

107148 AI005036 Hs.334305 GS1999full 10 

107214 AF127026 Hs.5394 myosin lA 10 

107242 AB020672 Hs.175411 K1AA0865 protein 10 

35 107331 A1905985 Hs.1 11805 ESTs 10 

107351 U51704 Hs.323428 ESTs, Moderately similar to ALU8_HUMAN A 5 

107423 W26652 Hs.6163 PTEN induced putative kinase 1 5 

107447 W28516 Hs.19210 hypothetical protein MGC 11 308 10 

107451 AL042425 Hs.283976 hypthetical protein PR02389 10 

40 107453 AI092790 Hs.334703 hypothetical protein FU 14529 5 

107459 W38002 Empirfcally selected from AFFX single pr 10 

107683 N53167 Hs.47623 ESTs 10 

107711 W96141 Hs.220687 ESTs 10 

107754 AA017462 Hs.269244 ESTs 10 

45 107757 BE621721 Hs.280792 hypothetica! protein FU1 2387 similar to 10 

107864 AA025060 Hs.61246 ESTs 10 

107872 BE271708 Hs.95110 ESTs. Weakly slmflar to A55943 1-phospha 10 

107888 AA025836 Hs.1 91 637 ESTs 5 

107997 AL049176 Hs.82223 chordin-like 10 

50 108056 AA043675 Hs.62633 ESTs 10 

108081 AA093668 Hs.28578 musdeblind (Drosophila)-tike 5 

108113 AA012881 Hs.72531 hypothetical protein FU1 1838 10 

108238 AA059473 Hs.66783 EST 10 

108257 AA677927 Hs.1 44269 ESTs 5 

55 108335 AA070500 gb2m70h03 .si Stratagene neuroeplthellum 5 

108351 AA071193 gb:zf79b12.s1 Soaresj)lneaUland_N3HPG 10 

108382 NM>006770 Hs.67726 macrophage receptorwith collagenous sir 5 

108392 AA075124 gb:zm86a01 .si Stratagene ovarian cancer 10 

108441 AA079079 gb:zm97c09^1 Stratagene ooton HT29 (937 10 

60 108446 AA085383 gb:zn13g03^1 Stratagene hNTneuron (937 10 

108497 AA074897 gb:zm85a05.r1 Stratagene ovarian cancer 10 

108604 AA934589 Hs.49696 ESTs 5 

108662 AF1 17646 Hs.156637 Cas-Br-M (murine) ectropic retroviral tr 5 

108706 AA121820 Hs.74569 KIAA0842 protein 10 

65 108738 AA126583 Hs.158725 ESTs 10 

108827 AI273692 Hs.1 10470 ESTs 10 

109123 AI028376 Hs.73232 ESTs 10 
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109389 AA101325 Ms.86154 hypothetical protein FU12457 10 

109546 F01449 Hs.26954 Homo sapiens mRNA;cDNADKFZp762G123(fr 5 

109919 R40804 Hs.129539 ESTs, Weakly similar to MCAOUMAN MITOC 10 

110006 AI094674 Hs.30524 ring finger protein 24 10 

5 110141 H46749 Hs.31540 ESTs 10 

110354 W22165 Hs.22586 ESTs 5 

110433 AW294162 Hs.301062 UDP-N-acelyl-alpha-I>galactosamlne:polyp 10 

110448 H51276 Hs.13526 hypothetical protein FLJ 12688 • 10 

110455 H52576 gb:yt85e08.r1 SoaPBSj)ineaLgten<J_N3HPG 5 

10 110540 H72639 Hs.167608 ESTs 5 

110553 H60593 Hs.124990 ESTs 10 

110976 AL044174 Hs.159526 patched prosophila) homolog 10 

110987 AI753316 Hs.26034 ESTs 5 

111158 N66616 Ms.138629 KsapiensmRNA for subtalomeric repeats 5 

15 1 11 168 AI798376 gbA34l)07.x1 NCLCGAP,Ov23 Homo sapiens 10 

111187 AJ224864 Hs.9688 leuttocyte membrane antigen 5 

111307 AA641638 Hs^477 ESTs. Weakly similar to T46908hypotheti 5 

111400 R00144' Hs.189771 ESTs 10 

111498 AM 685 11 gbx)w90h09.s1 SoarBsJetaUiver_spleen_ 10 

20 111651 R16733 Hs.20499 ESTs 10 

111738 R26065 gb:yh39cK)3^1 Soares placenta Nb2HP Homo 5 

111803 AA593731 Hs.325823 ESTs, Moderately similar to ALUS^HUMAN A 10 

111995 R42333 Hs.302292 ESTs 10 

112071 AL1 17490 Hs.47225 Ras-assodated protein Rapl 10 

25 112204 NM 006668 Hs.25121 cytochrome P450. subfamily 46 (cholester 10 

112258 R51889 Hs.24990 ESTs 5 

112490 R31094 Hs.24378 ESTs 10 

112588 R77302 gb:yi75h06^1 Soaras placenta Nb2HP Homo 10 

112654 BE618629 Hs.268809 ESTs 5 

30 112784 T98628 Hs.191290 ESTs 5 

112817 AI057205 Hs.14584 ESTs 5 

112885 AA581428 Hs.5021 EST 10 

112913 T16837 Hs.4241 ESTs 5 

113149 T51568 gb:yl]27e06^1 Stratagene fetal spleen (9 10 

35 113174 T54659 Hs.301755 Honx)saplenscDNAFLJ11465 fis, ctoneHE 5 

113203 AA743563 Hs.10305 ESTs 5 

113299 AW207424 Hs.332594 ESTs 10 

113367 N92359 Hs.14518 ESTs. Moderately similar to A48752 B-cel 10 

113457 R16763 Hs^68679 ESTs 5 

40 113563 AA913635 Hs.326413 HomosaplenscDNAFLJ20812fis,cloneAD 10 

113574 R06874 Hs.268628 ESTs,Moderately similar to ALULHUMAN A 5 

113776 AI791905 Hs.95549 hypothetical protein 10 

113790 AI244311 Hs.26912 ESTs 10 

113807 W07586- Hs.6045 ESTs 3 

45 113958 W86195 gb2h54e05^1 SoaresJetaUiver.spleen. - 10 

114211 Z39319 Hs^7347 EST 10 

114254 AB018263 Hs.180338 tumor necrosis factor receptor superfami 5 

114349 AA745978 Hs^8273 ESTs 5 

114449 AA020736 gb:ze63b1 1. si Soares retina N2b4HR Homo 5 

50 114484 AA034378 Hs.267319 endogenous retroviral protease 5 

114576 AA065096 gb2ni50a02.s1 Stratagene fibfoblast (937 5 

114624 AA081507 gbznOSblO.rl Slratagene'hNT neuron (937 5 

114844 AA234826 Hs.87386 EST 5 

114906 AA234462 Hs.87350 ESTs 5 

55 115624 AK000725 Hs.50579 hypothetical protein FU207 18 3 

115666 AF173081 Hs.1 7821 5 Vertebrate UN7 homolog 1, Tax interacti 5 

115712 AB020649 Hs74569 KIAA0842 protein 5 

115889 AA398841 Hs.39850 hypothetical protein FLJ20517 10 

115949 AI478427 Hs.4312S esophageal cancer related gene 4 protetn 10 

60 116107 AL133916 Hs.172572 hypothetical protein FU20093 10 

116180 AA4639Q2 Hs.13522 ESTs. Weakly similar to 136022 hypothetl 5 

116267 AW968703 Hs.30085 hypothetical protein FU23186 5 

116291 AW410377 Hs.41502 hypothetical protein FiJ21 276 5 

116527 AW1 94253 Hs.68607 ESTs 10 

65 116659 BE314852 Hs. 168694 Homo sapiens clone 23763 unknown mRN A, p 5 

116708 F10528 Hs.70001 ESTs,Moderately similar to JC61 69 nude 5 

117058 AW801806 gb:ll54JM0070-1 10400^^7 UM0070 Homo 5 
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117151 AI803656 Hs.42373 ESTs 5 

117226 N20468 gb.7x39b10^1 Soares melanocyte 2NbHM Ho 10 

117323 AI472863 Hs.43387 ESTs 5 

117571 N34417 Hs,44584 ESTs 3 

5 117624 N26627 Hs.82364 ESTs. Weakly similar taJC4124 pregnancy 5 

117673 N40551 Hs.1 84043 Homo sapiens Ets-1 binding protein (El B) 10 

117847 N49285 Hs.182391 ESTs 10 

117877 AW263476 Hs.44268 myelin gene expression factor 2 10 

117919 BE222341 Hs279472 ESTs 5 

10 118049 N53145 gb.7v55f09.s1 Soares fetal liver spleen 3 

118413 AW955696 Hs.90960 ESTs 10 

116613 At078236 Hs.496d6 ESTs 5 

118664 N70907 Hs.230619 EST 10 

116658 A112204O Hs.102961 Homo sapiens mRNA; cDNA DKFZp434G1972 (f 3 

15 116902 AA993527 Hs^3907 hypothetical protein FU23403 5 

119039 AI160570 Hs.252097 pregnancy specific beta-1 -glycoprotein 6 3 

119159 AF142419 Hs.15O20 homolog of mouse quaking QKI (KH domain 5 

119216 AA514422 Hs.221849 ESTs 5 

119317 AK002001 Hs.51305 v-maf musculoaponeurotic fibrosarcoma (a 10 

20 119366 T77892 gb7d20i04^1 SoaiesfetaiHver spleen 5 

119378 TBI 824 Hs.90949 EST 5 

119528 W38051 Empihcaily selected fifomAFFX single pr 10 

119792 AL049798 Hs.80552 demiatopontin 3 

119800 AF086332 Hs:58314 ESTs 10 

25 119817 AF088061 Hs.159690 ESTs 6 

119835 AF086429 Hs.58429 ESTs 5 

119923 AW803308 Hs.62954 femtin, heavy' polypeptide 1 5 

119961 U34249 Hs.337461 Human putative zinc finger protein (ZNFB 5 

120379 AL042725 gbDKFZp434B182^j1 434 (synonym: htes3) 10 

30 120931 AW136934 Hs.97162 ESTs 5 

121037 AA907743 Hs.142373 ESTs 5 

121282 AA401695 Hs.97334 ESTs 5 

121382 AA405763 Hs.111939 Homo sapiens cDNA FU20470 fis. done KA 5 

121764 AA421452 Hs.164851 ESTs. Weakly similar to KIAA0926 protein 5 

35 122034 AK000229 Hs.98017 Homo sapiens cDNA FU20222 fis. clone CO 10 

122441 AA447555 Hs.99116 EST 10 

122756 AA458945 Hs.95898 ESTs 10 

122771 AW135093 Hs.97282 ESTs. Highly similar to GIOO^HUMAN 1 10 K 5 

123601 AA609122 Hs.112645 Homo sapiens mRNA; cONA DKFZp434D2472 (f 5 

40 123623 AI024595 Hs,97508 a disintegrin and metalloproteinase doma 5 

123941 AA621529 gb:af47a02^1 SoarBSLtotaLfetusJ^b2HF8_ 10 

124215 H62570 gb7r44a01.rl Soares fetal liver spleen 5 

124276 H63465 gb.7s91a1U1 Soares rBfinaN2b5HR Homo 5 

124680 AK0015Z7 Hs.163953 hypothetical protein FLJ10665 5 

45 125099 NIVL014312 Hs.1 12377 oortic al thymocyte receptor (X laevis 10 

125121 T98199 Hs.46403 hypothetical protein FU 10847 10 

125188 BE299567 Hs.271749 ESTs. Moderately similar to ALU8J1UIWI A • 5 

125284 NM_002666 Hs.103253 perilipin 10 

125906 BE256206 Hs.17775 p75NTR^ssoc!ated cell death executor, o 5 

50 128484 AA485421 Hs.2705O3 ESTs.Weakly similar to ALU7_HUI\MN ALUS 10 

128511 NM_002250 Hs.10082 potassium Intemnediate/small conductance 10 

128538 R44214 Hs.101189 ESTs 5 

128606 C16161 Hs.283040 hypothetical protein PR02543 5 

128850 AA193106 Hs.180817 chromosome 1 1 open reading frame 23 10 

55 128870 H39537 Hs75309 eukaryotic translation elongation factor 10 

128903 AW150717 Hs.296176 STAT induced STAT Inhibitor 3 10 

128931 N62689 Hs.107242 Homo sapiens cDNA FU12965 fis. done NT 10 

129001 AA443323 Hs.107812 BPOZ protein 5 

129091 AA056483 Hs.301463 Human Chromosome 16 BAC done CIT987SK-A 5 

60 129101 NM-013403 Hs.108665 zinedin 10 

129146 AL117472 Hs.108924 SH3^omain protein 5 (ponsfn) 5 

129213 An46494 Hs.109525 ESTs. Weakly similar to IRX2^HUMAN IROQU 3 

129228 LI40714 Hs.239307 tyrosyl-tRNA synthetase 5 

129265 AA530892 Hs.1 71 695 dual specificity phosphatase 1 5 

65 129285 BE617015 Hs.11006 ESTs, Moderately similar to T1 7372 plasm 10 

129346 AF1 10141 Hs.288908 WAS protein family, member 2 10 

129368 NftL003877 Hs.1 10776 STAT induced STAT inhlbitor-2 5 
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129371 X06828 Hs.110802 von Wlllebrand factor 5 

129381 AW245805 Hs.110903 claudin 5 (transjnembrane protein deleted 10 

129440 W37944 Hs.4007 Sarcolemmal-associated protein 5 

129441 BE061069 Hs.301 943 KIAA0467 protein 10 
5 129516 AF020038 Hs.11223 isodtrate dehydrogenase 1 (NADP+). solu 10 

129554 BE222078 Hs.1 13069 ESTs 10 

129684 BE622468 Hs.11924 ESTs. Weakly similar b 138022 hypotheli 5 

129702 AI304966 Hs.12035 ESTs, Weakly similar to 138022 hypotheti 5 

129778 AK001676 Hs.12457 hypothetical protein FLJ10814 10 

10 129893 AK000956 Hs.13209 hypothetical protein nJ10094 5 

129928 AI33d993 Hs.134535 ESTs 5 

129973 AJ251760 Hs.273385 guanine nudeotMe t)(nding protein (G pr 5 

129977 NM.000399 Hs.1395 early growth response 2 (Krox-20(Drosop 5 

130014 NM.001158 Hs.143102 am&ie oxidase, copper containing 2 (reti 5 

15 130085 M62402 Hs.274313 Insulln^Ike growth factor binding prote 10 

130089 AA452006 Hs.333199 ESTs 5 

130162 W80711 Hs.319946 Homo sapiens mRNA for KIAA1 727 protein, 5 

130243 1388435 Hs.153227 cyclin G associated kinase 10 

130315 AI241084 Hs.154353 nonselective sodium potassium/|)roton exc 5 

20 130339 AA435746 gb:zt79e03.s1 Scares Jeslis_NHT Homo sap 5 

130400 V00517 Hs.283108 hemoglobin, gamma G 10 

130436 NM_001928 Hs.155597 D component of complement (adipsin) 10 

130478 X72308 Hs.251526 small friducIWe cytokine A7 (monocyte ch 5 

130480 BE222978 Hs.15760 r/IYG1 protein 10 

25 130494 AW390834 iHs.75874 pregnancy-associated plasma protein A 5 

130563 BE270472 Hs.279900 HSPC015 protein 10 

130589 AL1 10226 Hs.16441 DKFZP434H204 protein 10 

130606 AI652143 Ks^88382 hypothetical protein FU131 11 5 

130634 AI769067 Hs.127824 ESTs, Weakly similar to T28770 hypotheti 3 

30 130683 AA993269 Hs.17872 Homo sapiens, done IMAGE:3875012, mRNA 10 

130689 NMJ06691 Hs.17917 extracellular link domain-containing 1 10 

130716 AA232075 Hs.18259 XPA binding protein 1; putative ATP(6TP) 5 

130718 AF263462 Hs.18376 KIAA131 9 protein 10 

130722 N41322 Hs.18441 ESTs 5 

35 130798 M81349 Hs.1955 semm amyloid A4. constitutive 10 

130840 BE048821 Hs.20144 small inducible cytokine subfamily A (Cy 10 

131184 AB040935 Hs.23954 cerebral cell adhesion molecule 10 

131261 AA360419 Hs.171776 inositol(myoH(or4Hnonophosphatase 1 10 

131262 X03350 Hs.4 alcohol dehydrogenase IB (dass I), beta 10 
40 131328 AW939251 Hs.25647 v-fos FBJ murine osteosarooma viral onco 10 

131340 AK000393 Hs.25817 BTB (POZ) domain containing 2 5 

131341 AF1 10908 Hs.297660 TNFreoeptor-assodatBd^tor3 5 
131406 H83294 Hs.284122 Wnt inhibitory fiador-1 5 
131489 BE394648 Hs.27414 hypothetical protein 5 

45 131543 AW966881 Hs.41639 programmed cell dealh 2 10 

131692 BE559681 Hs.30736 KIAA0124 protein 5 

131753 AA829286 Hs.332053 semm amylokJ A1 10 

131756 AA443966 Hs.315.95 ESTs 10 

131785 H69342 Hs.26320 TRABID protein 10 

50 131815 AA021258 Hs.32753 ESTs 5 

131819 BE244961 Hs.173103 FE65-LIKE2 5 

131828 AJ000263 Hs.278658 keratin, hair, basic. 6 (monilethrix) 10 

131888 AW294659 Hs.34054 Homo sapiens dDNA: FU22488 fis, clone H 5 

131927 AJ003112 Hs.34780 doublecortex; lissencephaly. X-iinked (d 5 

55 131949 AK000010 Hs.258798 hypothetical protein FU20003 10 

132115 H81604 Hs.178471 KIAA0798 gene product 5 

132177 X80818 Hs.178078 glutamate receptor, metabotropic 4 5 

132296 AA467752 Hs.195161 ESTs 5 

132426 AW118072 Hs.89981 diacylglycerol kinase, zeta (104kD) 10 

60 132477 S68874 Hs.170917 prostaglandin E receptor 3 (subtype EP3) 5 

132675 AI291496 Hs.5476 Homo sapiens, done IMAGE:35301 23. mRNA. 10 

132796 NM_006283 Hs.173159 transfbmiing, addle colled-coil contain 10 

132898 W28548 Hs.224829 ESTs 10 

132905 NM_004235 Hs7934 Kmppel-like factor 4 (gut) 10 

65 132953 BE175645 Hs.321264 LBP protein 32 5 

133116 BE563966 Hs.6529 ESTs. Weakly similar to 178885 serinefth 5 

133120 NM-003278 Hs.65424 tetranecUn (plasminogen-binding protein 10 
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133139 AF052138 Hs.6580 Homo sapiens cDNA: FU23227 fis, done C 5 

133163 AA668224 Hs.6634 Homo sapiens cDNA: FU22547 fis. done H 5 

1 33258 AW956781 Hs^93937 ESTs. Weakly simflar to FXD2 J1UMAN FORKH 5 

133272 NM_002776 Hs.69423 kallikrein 10 (KLK10)(PRSSL1) (nesi) 5 

5 133379 AA207059 gb:zq80h09.s1 Stratagene hNT neuron (937 5 

133407 AF017987 Hs.7306 secreted frizzted^ebted proteinl 5 

133552 H21497 Hs.7471 BBP-like protein 1 5 

133702 L02321 Hs.75652 glutathione S-transferase MS 5 

133719 H26904 Hs.75736 apofrpopratein D 5 

10 133731 N71725 Hs272572 hemoglol)in, alpha 2 10 

133789 T85626 Hs.76239 hypothetical protein FLJ20608 5 

134007 AF072441 Hs.7840 caldneurin binding protein 1 10 

134055 D86062 Hs.182423 ESI (zBbrafish) protein, human homolog o 10 

134111 AI372566 Hs.8022 TU3A protein 10 

15 134117 AA081846 Hs.7921 Homo sapiens mRNA;cDNADKFZp566E1 83 (fr 10 

134177 BE243319 Hs79672 KIAA0652 gene product 5 

134308 AW905827 Hs.81454 ketohexokinase (fructokinase) 10 

134361 BE549343 Hs.82208 acyl-CoenzymeAdehydrogenase, very long 5 

134369 AF207664 Hs.8230 a disintegrin-like and metalloprotease ( 5 

20 134449 L34155 Hs.83450 lamlnln. alpha 3 (nJceIn (150kD). kafini 5 

134467 AI190413 Hs.8373 ESTs 10 

134496 M64936 gb:Homo sapiens retlnoic add-indudble 10 

134510 NM_002757 Hs^50870 mitogen-actlvated protein kinase kinase 10 

134550 M26315 Hs.85258 CD8 antigen, alpha polypeptide (p32) 10 

25 134577 BE244323 Hs.85951 expo rtin, tRNA (nudear export receptor 5 

134591 U73394 Hs.166085 klllercellimmunoglobulln-llke receptor 5 

134678 AL008583 Hs.182595 dynein. axonemaljlght polypeptide 4 5 

134728 D10216 Hs.89394 POU domain. dass1. transcription lacto 5 

134758 NI4.00007B Hs.89538 choiesteryi ester transfer protein, ptas 10 

30 134786 729618 Hs.89640 TEK tyrosine kinase, endothelial p/enous 10 

134912 T87521 Hs261457 ESTs 5 

134963 NI\L003394 Hs.91985 wingless-type MMTVintegratton site fami 10 

134969 H22570 Hs.172572 hypothetical protein FU20093 5 

135001 AA302517 Hs.92732 KIAA1 444 protein 5 

35 135066 X04430 Hs.93913 interleukin 6 (interferon, beta 2) 10 

135173 AL036557 Hs.95910 putative lymphocyte G0/G1 switch gene 10 

135197 U76456 Hs.1 90787 tissue Inhibitor of metalloproteinase 4 5 

135219 AB002361 Hs.96633 KIAA0363 protein 5 

135250 U83171 Hs.97203 sn^ilfndudble cytokine subfemlly A (Cy 5 

40 135304 AA416829 Hs.191597 ESTs 5 

135337 AA905406 Hs.9905 ESTs, Weakly similar to unnamed protein 3 

135417 X55019 Hs.99975 cho&nergtc receptor, nixtinic. delta p 10 

101387 X03350 Hs.4 akx)hotdehydroger)ase 1B(dassl).beta 5 - 

128870 H39537 Hs.75309 eukaryotictranslationekuigationfeictor 5 

45 129381 AW245805 Hs.110903 claudin 5 (transmembrane protein deleted 5 

130085 M82402 Hs.274313 insufm-like growth factor binding prote 5 

130689 NIVL008691 Hs.17917 extracellular link domain-containing 1 10 

133120 NIVL003278 Hs.65424 tetranectin (plasminogen-binding protein 3 

133407 AF017987 Hs.7306 secreted frizzled-related protein 1 5 

50 133731 N71725 Hs^2572 hemoglobin, alpha 2 5 

134369 AF207664 Hs^230 a disintegrin-like and metalloprotease ( 5 

135066 X04430 Hs.93913 interleukin 6 (interferon, beta 2) - 10 

135173 AL036557 Hs.95910 putative lymphocyte G0/G1 switch gene 5 

322580 AK001852 Hs.274151 ligatin 5 

55 408790 AW580227 Hs.47860 neurotrophic tyrosine kinase, receptor, type 2 10 

418043 AW377752 Hs.83341 AXL receptor tyrosine kinase 5 

427458 BE20B364 Hs.29283 ESTs. Weakly similar to LKHU proteoglycan link 5 

446674 AA563892 Hs306000 solute canier fiam9y 4 (anion exchanger), memb 10 

449826 U85642 Hs.138506 ESTs 5 

60 RC.H15814.S Human apMlmRNA for GS3109 (novel adipose specific collagen 10 

Ya024w/RIP1 EST-YEL024W/RIP1 3 
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TABLE lA 

Table 1 A shows the accession numbers for those plceys lacking unigeneDD's for Table 1 . 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequoices derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset Identifier number 

CAT mmber. Gene cluster number 
Aooessibn: Genbank accession numbers 



Pk^ CAT Number Accessions 



108446 112224 1 AA085383 AA1 26091 AA074174AA075373AA079120AA070831AA075978AA075372AA128503 
108497 IIOO79I2 AA074897 AA113914 AA064871 AA079329 AA071309 AA084710 AA1 29030 AA075042 AA074794 AA1071453 AA078803 
AA148628 AA122204 AA074159 AA126185 AA0791 17 AA1270B9 AA070912 AA079280 AA131372 AA078833 AA071087 
AA076131 AA071047 AA079401 AA083070 AA102076 AA115163 AA074 198 AA134725 AA1 13889 AA121 1 03 AA075041 
AA065148 AA071310 AA101 144 AA079659 AA078931 AA079209 AA070928 AAQ58994 AA06981 7 AA076187 AA069053 
AA1 31489 AA071308 AA063317 AA070156 AA071430 AA076056 AA075684 AA070053 AA126283 AA126078 AA075895 
AA079208 AA074583 AA071 086 AA079623 AA070627 AA078802 AA076622 AA065Q51 AA079143 AAD71 110 AA079434 
AA148748 AA079230 AA085188 AA074485 AA070580 AA076151 AA083166 AA085118 AA079450 AA085044 AA120938 
AA079200 AA100188 AA081472 AA122355 AA129031 AA085362 AA069220 AA070940 AA075968 AA074563 AA084027 
AA1 15929 

124215 1597154_1 H62570 H59063 

117058 1219924 1 AW801806 H90434 BE086530 

1 10455 46874J H52576 AF085971 H52172 

111168 38585_1 A1798376 346400 AW81 1617 AW811616 W00557 BE142245 AW858232 AW861851 AW658362 AA232351 AA218567 
AA055556 AW858231 AW857541 AW814172H66214 AW&14398AF134164 AA243093AA173345AA199942AA223384 
AA227092 AA227080 T12379 AA092174 T61139 AA149776 AA699829 AW879188 AW813567 AW813538 AI267168 
AA157718 AA157719 AA100472 AA100774 AA130756 AA157705 AA157730 AA157715 AA053524 AW849581 AW854566 
C05254 AW8B2836 T92637 AW812621 AA206583 AA209204 BE156909 AA226824 AI829309 AW991957 N66951 AA527374 
H66215 AA045564 AI694265 H60808AA1 49726 AW 195620 BH)81333 BE073424 AW817662 AW817705 AW817703 
AW817659 BE081531 H59570 

111498 411008J AI168511 AI022712 AA700366 R07371 R07324 

104340 46289J0 AA426189 F15201 

103747 117944_1 AA081995AA101099 

134495 46501J A464936AI025512 AI362987 BE061777 AA089966 BE169930T41176 AW594624 BE502415AA121893AI269283 T40311 

AI079277A1241318BE327710AW975215AW896268AA884990BE327514 
103750 118365.1 AA1261 29 AA1 26033 AA082561 

105239 34624 1 AA221036 R87170 BE537066 BE544757 C18935 AW812058 T92565 AA227415 AA233942 AA223237 AA668403 AA601627 
AW869639 BE061833 BE000620AW961170 AW847519 AA30d542 AW821833 AW945668 C04699 AA205504 AA377241 
AW821667 AA055720 AWB17981 AW856468 AA155719 AA179928 T03007 AW754298 AA227407 AA113928 AA307904 
C16859 

120379 34624 3 AL042725 BE063316 AW975610AA457591 BE062092AI655202AA714296 AI267264 A!075321 AA223286AA071122 
AA227849 AA216700 AI696002 AA101867 AA099426 AA135997 ALa41698 T02815 T51824 AA207189 T59230T51868 
AA663341 BE165757 AW818104AW392886 AA584918 AA099408 AW856396AW861859 AA053045 

114624 1 1 1686_1 AA081507 AA070071 AA070840 AA084362 

106851 322947 1 A1458623 AA639708 AA485409 R22065 AA485570 

108392 113549 1 AA075124 AA075208 

100545 22955J1 M55405 AW752552 

100654 tigr HT2969 A03758 A06977 A15293 017029 D17107 D17171 L00132 L00133 M12523 M13075 M13076 M92816 U22961 V00494 V00495 

X51363 X51364 X51365 
100702 tigr.HT3413 L27065 

102208 6735_9 U22961 AA203623AA503337 Al 174733 AM 92802 C06092AA035357 AI190619A1199244 AI828450 AA602296 AI378195 
AI20gi70 AI166653 AI127795 AI183846 H77389 AI589465 AA629390 H94306 At018388 R66584 AA027196 AI745413 
AI685092 AI093426 AI623873 AI074570 N50096 AA047486 N25060 AA327614 AI042512 AI3639S7 AA1d6873 AI333101 
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N70806 AI141254 AI383191 A1401237 AI080709 AI093400 W84549 T90806 R00012 W01413 AA630557 AI378348 AI559265 
AA877103 W84464 AA625146 R68379 AI133207 AI132980 AI133214 A1064826 AI061615 AI133473 AI174852 AI133404 
AM 33272 V00494 M12523 M12523 AI207526AI133120AI064802AI174993 AH 14729 A1061645 AI064716 A1064959 H77388 
T85706 AF075298 AM 10799 D17107 NM_000477 AF190168 R50724 A1248416 AI207432 AI133684 AI133345 A1174710 
A1133290AI133304AI174948A1207484AM10717 AF074624 All 14515 AF06351 6 All 10642 All 14559 AI114498 AI114759 
A1207568 AI0649^ AM 74753 All 14666 R69184 R0001 1 AI064997 T60501 AI207701 T71735 AA385318 H73569 T60496 
H94399 AI133158 T74675 AA484750 T73413 T55909 R50261 T72061 N80533 T51189T74936 AI207490 AI132925 AI064701 
AI17474B All 14563 AI133104 AI132999 A1133100 AI064925 AI064979 AI133063 AA343347 T69091 AA233989 T39772 
AI444620T52290 D16931 T40012 T48403 T58926 T69195 Ai133061 T50850 AI400677 AI091136 AA334608 T5741 1 Z20979 
N56507 T87485 A1133622 AA34337D T40075 T69671 T53849 T74820 AF075316 All 10818 T40121 T57381 AI1144S8 
AA332728 T51362 All 14589 R06691 All 10629 AF063503 AI140543 AA334661 AA332720 AA343262 773513 T86549 
All 14840 T57284 T39981 T61407 T72757 T51749 T56630 AA343125 T72126 R94135 T83028 T39972 739898 A1174786 
AI132926 R09237 AI064838 A1133660 760398 T88753 755930 792126 AM44602 760996 AI114792 H93911 AI1331M 
R10779AI065020 790925 750889 D17029 AI133703 AA333805 A1133040 AI133017 AI064857 All 1 0730 AF074 637 AI207567 
H71080 773217 AA343950AI174743AA334224AA334281 R06692 764739 7401 63 T60628 T81 661 773179 R01842 
AA501 730 739931 739662 740136 AA334904 771 425 H77784 R00874AI065049 78451 2 T559 18 A1207595 739951 
AA005016 T60361 769176 773356 758795 T61233 739955 760612 AI1 14676 A1064778 AA035710 W52763 AI1 14786 
783564 AA341859 781684755769 All 14710 751776 AA343213 At114714 T58102 AI110809 R28984 AI174854 AA305675 
AA343592 753836 746869 764721 755508 W05241 754019 757945 T60513 748364 AF075308 W86731 782851 748269 
H54053 T7321 1 AI114590 748317 755965 774857 R84226 T56552 752231 T74946 T76976 R02576 T95666 AI203974 
AI189471 AA005147AI478102AI207662AI192792 AI768421 AI0&4737 AW051713 AA936693 An33117AI766232 A1913646 
783962 A1065112 AI207689 AI174684 A1207702 781475 All 33325 AI032512 AA701169 A1936354 All 14720 AI433289 
AA046980 AI823182 A»1 14536 AA860651 AW242644 R07469 AW300438 AI13341 6 AW271670 A1991363 778943 AI823481 
AA845518 AA719124 AA883454 768850 T69115 AI935509 AI150977 T62890 T71374 768294 AI174774 76741 1 768318 
A1064689 756624 T69010 768982 768302 AI332829 772908 AI064819 A1205880 762895 T69430 795111 AA025050 773330 
W52657 T71984 T69118 W92684 AI114860 762093 761797 AI522333 T73322 H92981 75601 8 761811 757232 AI338158 
T61821 769457 762900 762912 T72917 746885 AI702448 T57212 757203 R94581 77131 1 761819 789358 767708 770918 
T59166AI187111 764308 T62071 769427 AI114750 T60430 R09734 769033769141 769453 T67908R16809 769394 
AI207729 755839 790273 T73339 AW194909 775486 T71850 771 305 T71287 T53877 773452 768852 N75290 AI312890 
767751 A1174983 T51679 T54851 H69880 N73734 AA443453 773466 H69672 N53869 768447 D1 1809 D1241 2 764300 
T28321 T55864 

123941 genbanK_AA621529 AA621529 

118049 genbank.N53145 N53145 

102800 14782^20 AA313538 U88895 U88902 

104106 AA422123_LatAA422123j 

111738 genbanleR26055 R26065 

113149 genbank 751588 751588 

113958 genbank_W86195 W86195 

108335 genbankJ\A070500 AA070500 

108351 genbankj^071193 AA071193 

108441 genbank^AA079079 AA079079 

124276 genbanK.H83465 H83465 

101447 entre2LM21305 M21305 

117226 genbanleN20468 N2046B 

133379 genbankj^07059 AA207059;^7241 

119366 genbank^T77892 T77892 

119528 NO7^FOUND_entre2LW38051 W38051 

112588 genbank R77302 R77302 

114449 genbank AA020736 AA020736 

114576 genbankuAA065096 AA065096 

107459 W38002.s.at W38002.S 

130339 genbanKJ^435746 AA435746 
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TABLE 2: Figure 2 from BRCA 001 US 



Table 2 shows genes downregulated in tumor tissue compared to normal breast tissue. 



Pkey: Unique Eos probeset Identifier number 

ExAcxn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unlgene Title: Unigene gene title 



R1: 



Ratk) of norma) breast tissue to tumor 



Pkey 


ExAccn 


UnlgeneU) 


unigene Tittie 


Kl 


100499 


T51986 


HS.2o3108 


hemoglobin, gamma G 


10 


100549 


BE142019 


HS22205o 


Homo sapiens cDNA FLJ1 1 572 fis, done nc 


1U 


100654 


A03758 




Ni^_O0D477 .Homo Sapiens albumin (ALB}i m 


10 


100971 


Bcj79727 


nS.o3ZlJ 


fatty add Dinoing protein 4, adipocyte 


•in 
10 


101184 


Nl\fl.001674 


nS.460 


activating transcnption factor 3 


•in 
10 


101336 


NI\A_006732 


Hs.75678 


FBJ murine osteosarcoma viral oncogene h 


'in 
10 


101367 


X03350 


Hs.4 


alcohol dehydrogenase 16 (class 1), beta 


10 


101447 


M21305 




gbiHuman alpha satellite and satellite 3 


10 


101461 


N98569 


Hs76422 


phosphoHpase A2, group IIA (platelets, 


10 


101511 


M27826 


lis.267319 


endogenous retroviral protease 


10 


101736 


M74447 


Hs.502 


transporter 2, ATP-blndlng cassette* sub-fianS 


10 




Uz^sol 




gbdHuman mRNA done with similarity to L 


1U 


102450 


U48251 


Hs75871 


protein kinase C binding protein 1 


1U 


102600 


AAo 13538 




goxSi loMiy Goion carcinoma (HLLjceii 


in 
1U 


102857 


NM_0Od744 


nS./Mbl 


retinol-blnding protein 4, interstitial 


in 
1U 


1 0299U 


A A QOOOOC 

AAo292oD 


If- Qonneo 

nS.MzUOJ 


serum amyloid AI 


in 
1U 


103747 






□h*zn26d06 r1 StralaoenG neuiiDeDliheiiuni 


10 


103812 


AA137107 


Hs.326391 


Homo sapiens, ckme M6C:16638. mRNA. com 


10 


104093 


R50727 


Hs.336970 


ESTs 


10 


104109 


AL353957 


Hs^181 


hypothetkal protein DKFZp434P0531 


10 


104250 


F06638 


Hs.12440 


Homo sapiens clone 24734 mRNA sequence 


10 


104492 


N73185 


Hs.94285 


EST 


10 


104506 


N91071 


Hs.109650 


ESTs 


10 


104532 


AI498763 


Hs.203013 


hypothetk:al protein FU12748 


10 


104677 


AA009764 


Hs.190380 


ESTs 


10 


104711 


AA017245 


Hs.32794 


ESTs 


10 


104731 


AA019300 


Hs.125070 


ESTs. Moderately similar to 154374 gene 


10 


105005 


A1298208 


Hs.28805 


ESTs 


10 


105036 


AA130390 


Hs^5549 


hypothetical protein FLJ20898 


10 


105239 


AA221036 




gb.-zrf)3f12.r1 Stnatagene HT2 neuronal pr 


10 


106052 


N798a5 


Hs.6382 


ESTs, Highly similar to T00391 hypothet) 


10 


106181 


A1803651 


Hs.191608 


ESTs 


10 


106283 


A1085846 


Hs^5522 


KtAA1808 protein 


10 


106379 


AL042069 


Hs.119021 


DKFZP434N061 protein 


10 


106451 


AW235928 


Hs.313162 


ESTs 


10 


106491 


AA1356d8 


Hs.10083 


Homo sapiens, ctone IMAGE:4139786. mRNA, 


10 


106782 


AW054886 


Hs.25682 


Homo sapiens mRNA for KIAA1863 protein. 


10 


107124 


AB006532 


Hs.31442 


RecQ protein-like 4 


10 


107148 


AI005036 


Hs.334305 


GS1999full 


10 


107214 


AF127026 


Hs.5394 


myosin lA 


10 


107242 


AB020672 


Hs.175411 


K1AA0865 protein 


10 


107331 


AI905985 


Hs.111805 


ESTs 


10 


107447 


W28516 


Hs.19210 


hypothetfcal protein M6C1 1308 


10 


107451 


AL042425 


Hs.283976 


hyptl^tical protein PR02389 


10 


107872 


BE271708 


Hs.95110 


ESTs, Weakly similar to A55943 1-phospha 


10 


108351 


AA071193 




gb:zf79b12.s1 Soares plneaUlarxj N3HPG 


10 


109546 


F01449 


Hs.26954 


Homo sapiens mRNA; cDNA DKFZp762G123 (fr 


10 


110433 


AW294162 


Hs.301062- 


UDP-N-acetyl-alpha-D-galactosamine:polyp 


10 


110976 


AL044174 


Hs.158526 


patdied (Drosophila) homolog 


5 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



111168 


AI798376 




gb:tr34b07J{1 NCLCGAP_Ov23 Homo sapiens 


10 


111651 


R16733 


Hs.20499 


ESTs 


10 


111803 


AA593731 


Hs.325823 


ESTs, Moderately similar to ALU5_HUMAN A 


10 


114484 


AA034378 


Hs.267319 


endogenous retroviral protease 


10 


125284 


NIVL002666 


Hs.103253 


perilipln 


10 


128850 


AA193106 


Hs.180817 


chromosome 1 1 open reading frame 23 


5 


128903 


AW150717 


HS296176 


STAT induced STAT inhibitor 3 


10 


129346 


AF1 10141 


Hs.288908 


WAS protein family, member 2 


10 


129381 


AW245805 


Hs.1 10903 


daudin 5 (transmembrane protein deleted 


10 


129516 


AF020038 


Hs.11223 


Isocitrate dehydrogenase 1 (NADP+), solu 


10 


129554 


BE222078 


Hs.1 13069 


ESTs 


10 


130085 


M62402 


Hs.274313 


insulln-nke growth factor binding prote 


10 


130243 


D88435 


Hs.153227 


cycUn G associated kinase 


10 


130400 


V00517 


H&283108 


hemoglobin, gamma G 


10 


130436 


NK.00ig28 


Hs.155597 


D component of complement (adipsin) 


10 


130563 


BE270472 


HS279900 


HSPC015 protein 


10 


130589 


AL110226 


Hs.16441 


DKFZP434H204 protein 


10 


130883 


AA993269 


Hs.17872 


Homo sapiens, clone IMAGE:3875012, mRNA 


10 


130689 


NM006691 


Hs.17917 


extracellular linl^ domalnKXjntalning 1 


10 


130689 


AA046747 


Hs.17917 


extracellular link domain-containing 1 


10 


130718 


N70196 


Hs.18376 


KIAA1319 protein 


10 


130798 


MB1349 


Hs.1955 


serum amyloid A4, constitutive 


10 


130840 


BE048821 


Hs.20144 


small inducible cytokine subfamBy A (Cy 


10 


131184 


AB040935 


Hs23954 


cerebral ceil adhesion molecule 


10 


131282 


X03350 


Hs.4 


alcohol dehydrogenase IB (dass 1), beta 


10 


131328 


AW939251 


Hs.25647 


v-fos FBJ murine osteosarcoma viral onco 


10 


131543 


AW966881 


Ms.41639 


programmed cell death 2 


10 


131753 


AA829286 


Hs^2053 


serum amyloid A1 


10 


131785 


H69342 


Hs^320 


TRABID protein 


10 


131828 


AJ000263 


HS278658 


keratin, hair, basic, 6 (monilethrix) 


10 


132426 


AW118072 


Hs.89981 


diacytglycerol kinase, zeta (104kD) 


10 


132675 


AI291496 


Hs.5476 


Homo sapiens, done IMAGE:3530123. mRNA, 


10 


132898 


W28548 


Hs.224829 


ESTs 


10 


132905 


NNL004235 


Hs.7934 


Kruppel-like facbr4 (gut) 


10 


133120 


N^L003278 


Hs.65424 


tetranectin (plasminogen-binding protein 


10 


133407 


AF017987 


Hs.7306 


secreted frizzled-related protein 1 


10 


133719 


H26904 


HS75736 


apolipoprotein D 


10 


134007 


AF072441 


Hs.7840 


caldneurin binding protein 1 


10 


134055 


D86062 


Hs.182423 


ESI (zebrafish) protein, human homotog o 


10 


134111 


AI372588 


Hs.8022 


TU3A protein 


5 


134117 


AA081846 


Hs.7921 


Homo sapiens mRNA; cDNA DKFZp566E183 (fr 


5 


134177 


BE243319 


Hs.79672 


KlAA0652geneprodud 


10 


134369 


AF207664 


Hs.8230 


a dislntegrin-like and metalbprotBase ( 


10 


134496 


M64936 




gbiHomo sapiens ratinolc addnndudble 


10 


134510 


Nftt002757 


Hs.250870 


mitogen-activated protein kinase kinase 


10 


134550 


M26315 


Hs.85258 


CD8 antigen, alpha polypeptide (p32) 


5 


134758 


NM,000078 


Hs.89538 


cholesteryl ester transfer protein, plas 


5 


134963 


Nl\|.003394 


Hs.91985 


wingless-type f^MTV integration site farm 


10 


135066 


X04430 


Hs,93913 


interieukin 6 (Interferon, beta 2) 


10 


408790 


AW580227 


Hs.47860 


neurotrophic tyrosine kinase, receptor, type 2 


10 


446674 


AA563892 


Hs.306000 


solute canfer ^ily 4 (anion exchanger), memb 


10 
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15 



TABLE 2A 

Table 2 A shows the accession numbers for those pkeys lacking unigenelD's for Table 2. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbanic accession numbers 

PIcey CAT number Accessions 



111168 38585 1 AI798376 846400 AW81 1617 AW81 1616 W00557 BE142245AW858232AW861851 AW858362 AA232351 AA218567 
AA055556 AWB58231 AW857541 AW814172 H66214 AW814398 AF134164 AA243093AA1 73345 AA199942 AA223384 
20 AA227092 AA227080 T12379 AA092174 T61 139 AA149776 AA699829 AW879188 AW813567 AW813538 AI267168 AA157718 * 

AA157719 AA100472 AA100774 AA130756 AA157705 AA157730 AA157715 AA053524 AW849581 AWa54566 C05254 
AW882836T92637 AW812621 AA206583AA209204 BE156909AA226824 AI829309 AW991957 N6e951 AA527374 H66215 
AA045564 Ai694265 H60808 AA149726 AW195620 BE081333 BE073424 AW817662 AW817705 AW817703 AW817659 
BE081531 H59570 
25 103747 117944J AA081995AA101099 

134496 46501_1 M64936A1025512 AI3829a7 BE061777 AA089966 BE169930T41176AW594624 BE502415 AA121893A1269283T40311 

A1884569 AA25701 1 A1079277 A1241318 BE327710 AW975215 AW896268 AA884990 BE327514 
105239 34624_1 AA221036 R87170 BE537068 BE544757 018935 AW812058 T92565 AA227415 AA233942 AA223237 AA668403 AA601627 
AW869639 BE061833 BE000620 AW961 170 AW847519 AA308542 AW821833 AW945688 C04699 AA205504 AA37724 1 
3 0 AW821667 AA055720 AW817981 AW856468 AA155719 AA179928 T03007 AW754298 AA227407 AA1 13928 AA307904 

. 016859 

100654 tlgr_HT2969 A03758 A06977 A15293 D17029 D17107 D17171 100132 L00133 1^12523 M13075 M13076 M92816 U22961 V00494 V00495 

X51363X51364X51365 

102208 6736 9 U22961 AA203623 AA503337 AI174733 All 92802 006092 AA035357 All 9061 9 Al 199244 AI828450 AA602296 AI378195 

35 A1209170 AI186653 AI127795 A1183846 H77389 AI589455 AA629390 H94306 AI018388 R68584 AA027196 AI745413 

Al685092Al093426AI623873Ai074570N50096AA047486N25060AA327614Al0425l2Al383957AA156873AI333101 
N70806 AI141254 Ai383191 AI401 237 AI080709 AI093400 W84549 T90806 R00012 W01413 AA630557 AI378348 AI559265 
AA877103 W&4464 AA625146 R68379 AI133207 A1132980 Ai133214 Ai064826 Ai06l615 AI133473 AI174852 AI133404 
AI133272 V00494 M12523 M12523 Ai207526 AI133120 AI064802 At174993 AI114729 AI061645 AI064716 AI064959 H77388 

40 ' T85706 AF075298 All 10799 D17107 NM^000477 AF190168 R50724 AI248416 AI207432 A1133684 AI133345 AI174710 

AI133290AI1333O4 AI174948A12074&4 All 10717 AF074624AI114515AF063516AI110542 All 14559 AI114498Ai114759 
A1207568AI064960 A1174753 AH 14666 R69184 R0001 1 AI064997 T60501 AI207701 T71735 AA385318 H73569T60496 
H94399AI133158T74675 AA484750 T73413T55909 R50261 T72061 N80533 T51189 T74936 AI207490 AI132925AI064701 
AI174748 A11 14663 Al 133104 Ai132999 AI133100 Ai064925 AI064979 AI133063 AA343347 T69091 AA233989 T39772 

45 AI444620 T52290 D16931 T40012 T48403 T58926 T69195 A1133061 T50850 AI400677 AI091 136 AA3346CW T5741 1 Z20979 

N56507 T87485AI133622AA34337OT40075T69671 T53849 T74820 AF075316 AI110818T40121 T57381 AI114468 
AA332728 T51362 AH 14589 R06691 AH 10629 AF063503 AI140543 AA334661 AA332720 AA343262 T73513 T86549 
AI114840 T57284 T39981 T61407 T72757 T51749 T56630 AA343125 T721 26 R94135 T83028 T39972 T39896 AI174786 
A1132926 R09237 AI064838 AI133660 T60398 T88753 T55930 T92126 AI444602 T60996 AH 14792 H9391 1 AI133106 

50 R10779 Ai065020 T90925 T50889 D17029 AI133703 AA333805 AH 33040 AI133017 Ai064857 AI110730 AF074637 AI207567 

H71080T73217AA343950AI174743AA334224AA334281 R06692T64739 T40163T60628T81661 T73179 R01842 
AA501730T39931 T39662T40136 AA334904 T71425 H77784 R00874 AI065049 T84512 T55918 A1207595 T39951 
AA00S016 T60361 T69176 T73356 T58795 T61233 T39955 T60612 AI114676 AI064778 AA035710 W52763 Ai114786 
T83564 AA341859 T81684 T557e9 AH 14710 T51776 AA343213 AI114714 T58102 All 10809 R28984 AI174854 AA305675 

55 AA343592T53836 T46869 T64721 T55508 W05241 T54019 T57945T60513 T48364 AF075308 W86731 T82851 T48269 

H54053 T73211 AH 14590 T48317 T55965 T74857 R84226 T56552 T52231 T74946 T76976 R02576 T95666 AI203974 
AH89471 AA005147 AI478102 AI207662 AI192792 AI768421 AI064737 AW051713 AA936693 AI133117 AI766232 AI913646 
T83962 AI0651 12 A1207689 AI174684 AI207702 T81475 AI133325 AI032512 AA701 169 AI936354 AH 14720 A1433289 
AA046980 A1823482 AI114535 AA860651 AW242644 R07469 AW300438 AI133416 AW271670 AI991363 n8943 AI823481 

60 AA845518 AA719124 AA883454 T68850 T691 15 AI935509 A1150977 T62890 T71374 T68294 AH 74774 T67411 T68318 

AI064689 T56624 T69010 T68982 T68302 AI332829 T72908 A1064819 AI205880 T62895 T69430 T951 1 1 AA025050 T73330 
W52657 T71984 T69118 W92684 AH 14860 T62093 T61797 AI522333 T73322 H92981 T56018 T61811 T57232 AI336158 
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T61821 T69457 T62900 T62912 T72917 T46885 AI702448 T57212 T57203 R94581 T7131 1 T61819 T89358 T67708 T70918 
T59166AI187111 T64308T62071 T69427 AI114750 T60430R09734T69033 T69141 T69453 T67908R16809 T69394 
AI207729 T55839 T90273 T73339 AW194909 T75486 T71850 T71305 T71287 T53877 T73452 T68852 N75290 AI312890 
T67751 AM74983 T51679 T54851 H69880 N73734 AA443453 T73466 H69672 N53869 T88447 D11809 D12412 T64300 
T28321 T55864 

102800 1478^ AA313538 U88895U88902 

108351 genbankJ\A071193 AA071193 

101447 entfB?_M21305 M21305 
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TABLE 3: Figure 3 from BRCA 001 US 
Table 3 shows genes downregulated in tumor tissue compared to normal breast tissue. 



Pkey: Unlq ue Eos pnobeset Identifier number 

ExAccn: Exemplar Accession nunnber, Genbank accession number 

UnigenelD: Unigene number 

Unlgene Tide: Unigene gene title 

R1 : Ratio of normal breast tissue to tumor 



Pkey ExAccn UnigenelD UnigeneTitie R1 

101336 NM 006732 Hs.75678 FBJ murine osteosarcoma viral oncogene h 10.0 

102208 U2^61 gb:HumanmRNA clone with similarity to L 10.0 

102990 AA829286 Hs.332053 serum amytoid A1 10.0 

111166 AI798376 gb1r34b07j(1 NCLCGAP_Ov23 Homo sapiens 10.0 

1 1 1803 AA593731 Hs.325823 ESTs. Moderately similar to ALU5.HUMAN A 10.0 

130085 M62402 Hs.274313 insulirvlike growth factor binding prole 10.0 

130840 BE04d821 )Hs.20144 small inducible cytokine subfamily A (Cy 10.0 

131543 AW966881 Hs.41639 programmed cell death 2 10.0 

133120 NM.003278 Hs.65424 tetranecGn (plasminogen-binding protein 10.0 

134758 N(y/I.000078 Hs.89538 choiesteryl ester transfer protein, pias 10.0 
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TABLE 3 A 

Table 3A shows the accession numbers for those pkeys lacking unigenelD's for Table 3. For 
each probeset, we have Usted the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
inRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools poubleTwist, Oakland California). The Genbank accession nimabers for 
sequences comprising each cluster are listed in the "Accession" colunrn. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 

111168 38585_1 AI798376 S46400 AW811617 AW811616 W00557 BE142245 AW858232 AW861851 AW858362 AA232351 AA218567 

AA055556 AW858231 AW857541 AW814172 H66214 AW814398 AF134164 AA243093 AA173345 AA199942 AA223384 
AA227092 AA227080 T12379 AA092174 T61139 AA149776 AA699829 AW879188 AW813567 AW813538 AI267168 AA157718 
AA157719 AA100472 AA100774 AA130756AA157705AA157730AA157715 AA053524 AW849581 AW854566 C05254 
AW882836 T92637 AW812621 AA2065&3 AA209204 BE156909 AA226824 AI829309 AW991957 N66951 AA527374 H66215 
AA045564 A1694265 H60808 AA149726 AW195620 BE0B1333 BE073424 AW817662 AW817705 AW817703 AW817659 
BE081531 H59570 

1022086735J U22961 AA203623 AA503337 AI174733 AI192802 C06092 AA035357 AI190619 AM99244 AI828450 AA602296 AI378195 

A1209170 AI186653 AI127795 AI183846 H77389 A1589465 AA629390 H94306 AI018388 R68584 AA027196 AI745413 AI685092 
AI093426 AI623873 AI074570 N50096 AA047486 N25060AA327614 AI042512AI383957 AA156873AI333101 N70806 AI141254 
AI383191 AI401237 A1080709 AI093400 W84549 T90806 R00012 W01413 AA630557 AI378348 A1559265 AA877103 W84464 
AA625146 R68379 AI133207 AI132980 A1133214 AI064826 AI061615 A1133473 AI174852 AI133404 Al 133272 V0O494 M12523 
M12523 AI207526 A1133120 AI064802 AI174993 AI1 14729 A1061645 AI06471 6 AI064959 H77388 T85706 AF075298 All 10799 
D17107 N^L000477 AF190168 R50724 AI248416 AI207432 AI133684 AI133345 AI174710 AI133290 AI133304 All^ 
AI207484 AI110717 AF074624 AI114515 AF063516 AI110642 AI114559 A1114498 A1114759 AI207568 AI064960 AI174753 
All 14666 R69184 R00011 AI064997 T60501 AI207701 T71735 AA385318 H73569 T60496 H94399 AI133158 T74675 AA484750 
T73413 T55909 R50261 T72061 N80533 T51189 T74936 AI207490 AI132925 AI064701 A1174748 A1114663 AI133104 AI132999 
AI133100 A1064925 AI064979 AI133063 AA343347 T69091 AA233989 T39772 AI444620 T52290 D16931 T40012 T48403 
T58926 T69195 AM 33061 T50850 AI400677 AI091136 AA334608 T5741 1 Z20979 N56507 T87485 AI133622 AA343370 T40075 
T69671 T53849 T74820 AF075316 AI110818T40121 T57381 AI114468 AA332728 T51362 AI114589 R06691 A1110629 
AF063503 AI140543 AA334661 AA332720 AA343262 T73513 T86549 AI114840T57284 T39981 T61407 T72757 T51749 
T56630 AA343125 T72126 R94135 T83028 T39972 T39896 AI174786 AI132926 R09237 Aia64838 AI133660 T60398 T88753 
T55930 T921 26 A1444502 T60996 AI114792 H9391 1 A1133106 R1 0779 AI065020 T90925 T50889 D17029 AI133703 AA333805 
AI133040 AI133017 AI064857 AI110730 AF074637 AI207567 H71080 T73217 AA343950 A1174743 AA334224 AA334281 
R06692 T64739 T40163 T60628 T81661 T73179 R01842 AA501730 T39931 T39662 T40136 AA334904 T71425 H77784 R00874 
AI065049 T84512 T55918 AI207595 T39951 AA005016T60361 T69176 T73356 T58795 T61233 T39955 T60612 AI114676 
AI064778 AA035710 W52763 All 14786 T83564 AA341859 T81684 T55769 AI114710 T51776 AA343213 All 14714 T58102 
A1110809 R28984 AI174854 AA305675 AA343592 T53836 T46869 T64721 T55508 W05241 T54019 T57945 T60513 T48364 
AF075308W86731 T82851 T48269 H54053 T73211 All 14590 T48317T55965T74857 R84226T56552T52231T74946T76976 
R02576 T95666 AI203974 AI189471 AA005147 AI478102 AI207662 A1192792 AI768421 AI064737 AW051713 AA936693 
AI1331 17 A!766232 AI913646 T83962 AI065112 AI207689 AI174684 AI207702 T81475 AI133325 AI032512 AA701 169 AI936364 
AI114720 AI433289 AA04e980 AI823482 AI114536 AA860651 AW242644 R07469 AW300438 AI133416 AW271670 AI991363 
T78943 AI823481 AA845518 AA719124 AA883454 T68850T69115 A1935509 AI150977 T62890 T71374 T68294 AI174774 
, T67411 T68318 AI064689 T56624 T69010 T68982 T68302 AI332829 T72908 AI064819 AI205880 T62895T69430 T95111 
AA025050 T73330 W52657 T71984 T69118 W92684 AI114860 T62093 T61797 A1522333 T73322 H92981 T56018 T51811 
T57232 AI336158 T61821 T69457 T62900 T62912 T72917 T46885 AI702448 T57212 T57203 R94581 T71311 T61819 T89358 
T67708 T70918 T59166 AI187111 T64308 T62071 T69427 AI114750 T6D430 R09734 T69033 T69141 T69453 T67908 R16809 
T69394 AI207729 T55839 T90273 T73339 AW194909 T75486 T71850 T71305 T71287 T53877 T73452 T68852 N75290 
AI312890 T67751 AI174983 T51679 T54851 H69880 N73734 AA443453 n3466 H69672 N53869 T68447 D11809 D12412 
T64300T28321T55864 
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TABLE 4: Figure 4 from BRCA 001 US 



5 Table 4 shows genes upregulated in tumor tissue compared to normal breast tissue. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank aocession number 

10 UnigenelD: Unigene number 

Urngene Title: Unigene gene title 

R1 : Ratio of tumor to normal breast tissue 



IS Pl(ey ExAccn UnigenelD Unigene Title 

100113 NM.001269HS.84746 chromosome condensation 1 2.3 

100114 X02308 Hs.82962 ttiymidylate synthetase 2.9 
100131 D12485 Hs.11951 e do nucleotide pyrophosphatase/phosphodiesterase 1 1.9 

20 100146 BE185499 Hs.2471 KIAA0020 gene product 1-9 

100147 D13666 Hs.136348 osteoblast specific factor 2 (fascicllnHil<e)(periostin) 7.5 

100154 H50720 Hs.81892 KIAA01 01 gene product 9.2 

100163 W44671 Hs.124 gene predicted from cDNA with a complete coding sequence 1.6 

100220 AW015534 Hs.217493 annexinA2 2.0 

25 100265 D38521 Hs.1 12396 KIAA0077 protein 1.5 

100271 BE160081 Hs.256290 8100 calciunn-binding protein All (calgizzann) 13.5 

100275 BE242802 Hs.154797 KtAA0090 protein 5.1 

100323 D50920 Hs.23106 KIAA0130 gene product 1-9 

100335 AW247529 Hs:6793 platelet-activating factor acetylhydrolase, isofonn lb, gamma subunit (29I<D) 2.7 

30 100354 NM_004341Hs.154868 carbamoyl-phosphate synthetase 2, aspartate transcartwmylase. and dihydroorotase 2.0 

100372 NMJ14791HS.184339 KIAA0175 gene product 2.6 

100393 D84145 Hs39913 novel RGCMx>ntaining protein 32 

100400 AW954324 Hs.75790 phosphalidylinosltolglycan,dassC 1-5 

100418 D86978 Hs.84790 KIAA0225 protein 2.0 

35 100482 M65028 Hs.81361 heterogeneous nuclear ribonucleoprotein A/B 2.9 

100518 NIVL004415HS.74316 desmoplakin (DPI. DPIl) 1.9 

100666 L05424 Hs.169610 CD44 antigen (homing function and Indian blood group system) 5.7 

100667 L05424 Hs.169610 CD44 antigen (homing function and Indian blood group system) 9.0 

100668 L05424 Hs.169610 CD44 antigen (homing function and Indian bk)od group system) 7.6 
40 100678 AW502935 Hs.740 PTK2 protein tyrosine kinase 2 53.2 

100685 AA328229 Hs.184562 nbosomal protein L24 1-8 

100690 AA383256 Hs.1657 estrogen receptor 1 16 

100783 AF078847 Hs.191356 general transcription factor IIH^poIypeptWe 2 (44kDsubuniO 5.9 

100850 AA836472 Hs.297939 cathepslnB 1-7 

45 100892 BE245294 Hs.180789 S164 protein 1.7 

100945 AF002225 Hs.180686 ubkjuitin protein llgase E3A (human papilloma virus E6-associated protein, Angelman syndrome) 1.5 

1 00969 AA1 57634 Hs.79172 solute earner family 25 (mitxhondrial cam'en adenine nucleotide translocator), member 5 6.3 

100988 AK000405 Hs.76480 ubiquitin-fike 4 11-4 

100999 H38765 Hs.80706 diaphorase (NADH/NADPH) (cytochrome b-5 reductase) 1.6 

50 101031 J05070 Hs.151738 matrix metaltoprotelnase 9 (gelatinase B, 92kD gelatinase. 92kD type IV coHagenase) 8.2 

101045 J05614 gb:Human proliferating cell nuclear antigen (PCNA) gene, promoter regkxi. 5.0 

101077 N99692 Hs.75227 Empirically selected from AFFX single probeset 2.6 

101093 L06419 Hs.75093 procollagen-lysine. 2-oxoglutarate 5-dtoxygenase (lysine hydroxylase. Ehlers-Danlos syndrome type Vl)1.4 

101161 NM_006262Hs.37044 peripherin 16.9 

55 101186 AA020956 Hs.179881 core-binding (actor, beta subunit 2.0 

101216 AA284166 Hs.84113 cycln-dependent kinase inhibitor 3 (CDK2^soctatod dual specificity phosphatase) 1.6 

101228 AA333387 Hs.82916 chaperonin containing TCP1, subunit 6A (zela 1) 1.7 

101247 AA132666 Hs.78802 glyoogen synthase Idnase 3 beta 1.9 

101249 L18964 Hs.1904 protein kinase C. tota 1-5 

60 101332 J04088 Hs.156346 topoisonnerase(DNA) II alpha (170kD) 5.2 

101332 J04088 Hs.156346 topoisomerase (DNA) II alpha (170kD) 3.4 

101352 AI494299 Hs.16297 C0X17 (yeast) homokjg, cytochrome c oxidase assembly protein 6.3 

101396 BE267931 Hs.78996 proWerating cell nuclear antigen 4.2 

101445 M21259 gbi^uman Alu repeals in the region 5* to the small nuclear rib 1 .9 

65 101470 NRl.000546Hs.1846 tumor protein p53(Li^taumeni syndrome) 1.6 
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101478 NM_002890Hs.758 f^Sp21prot9inactrvator{GTPase activating protein) 1 2.5 

101483 M24486 Hs76768 procdlagenH^lTO,2K)xciglutarate4<)iaxygenase(pro^^^ 5.5 
101540 J04977 Hs.84981 X-ray repair romplemenlingdefecAVe repair in Chinese harns^^ 

101573 AW248421 Hs.250758 proteasome (prosome. macropain) 26S subunit, ATPase, 3 1.6 

101580 NNL012151HS.83363 coagulation factor Vlfl-assodated (Intronic transcript) 57 

101592 AF064853 Hs.91299 guanine nucleotide binding proleiri {G protein), l)eta polypeptide 2 1.8 

101592 Al=064853 Hs.91299 guanine nucleotide binding protein (G pr 5.6 

101621 BE391804 Hs.62661 guanylate binding protein 1. interferonnnducible, 67kD 2.4 

101702 AW504089 Hs.179574 protein phosphatase 2 (formerly 2A), regulatory suburjit B (PR 52), alpha Isoform 1.3 

101734 M74099 Hs.147049 cut (Drosophila)-like 1 (CCAAT displacement protein) 21 

101759 M80244 Hs.184601 solute carierfannily 7 (cationicarnlnoacW transporter, y+ system), rnember 5 5.0 

101767 M81 057 Hs.1 80884 carboxypeptidase 81 (tissue) 14.4 

101782 AA30$495 Hs.1869 phosphoglucomutase 1 5.2 

101805 AW409747 Hs.75612 stmss-lnduced-phosphoproteln 1 (Hsp70/Hsp90-organlzing protein) 8.6 

101806 AA586894 Hs.112408 SlOOcaicium-binding protein A7(psoriasin 1) 8.9 
101810 NM_000318Hs.180612 peroxisomal membrane protein 3 (35kD, Zellweger syndronrte) 3.2 
101879 AA176374 Hs.243886 nuclear autoantjgenlc sperm protein (histone-blnding) 1.6 
101911 AA441787 Hs.1ig689 glycoprotein honnones, alpha polypeptide 31.3 
10192D AF182645 Hs.8024 IKcytokine.down-fegutatDrofKLAII 18 
101973 U41514 Hs.80120 UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylg^dosamui^^ 2.4 
101983 AI904232 Hs.75323 prohibitin 8.4 
102009 BE245149 Hs.82643 protein tyrosine kinase 9 1.3 
102036 BE250127 Hs.82g06 CDC20 (cell division cyde 20, S. cerevisiae, homolog) 2.0 
102083 T35901 Hs.75117 interleukin enhancer binding factor 2, 45kD 1.6 
102083 T35901 Hs.75117 interleukin enhancer binding factor 2, 4 1.3 
102107 BE258602 Hs.1 82366 heat shock protein 75 1.4 
102123 NM_001809Hs.l594 centromere protein A (17kD) 1.8 
102165 BE313280 Hs.1 59627 death associated protein 3 4.6 
102198 AW950852 Hs.74598 polymerase (DNA directed), delta 2, regulatory subunit {50kD) 4.3 
102217 AA829978 Hs.3016l3 JTV1 gene 67 
102220 U24389 Hs.65436 lysosomal 4.3 
102234 AW163390 Hs.278554 heterochromatin^ike protein 1 1.9 
102260 AL039104 Hs.159557 karyophenn alpha 2 (RAG cohort l.tmportin alpha 1) 4.4. 
102302 AA306342 Hs.69171 protein kinase C4ke 2 27 
102330 BE298063 Hs.77254 chromobox homolog 1 (Drosophlla HP1 beta) 15 
102339 BE378432 Hs.95577 cyclin-dependent kinase 4 2.3 

102348 U37519 Hs.87539 aklehyde dehydrogenase 3 femily, member B2 2.0 

102349 AU077055 Hs.289107 baculoviral lAP lepeat-contalning 2 3.2 
102369 U39840 Hs.299867 hepatocyte nuclear factor 3, alpha 2.0 
102374 U33635 Hs.90572 PTK7 protein tyrosine kinase 7 6.2 
102391 AA296874 Hs.77494 deoxyguanosine kinase 1.5 
102455 LI48705 Hs.75562 discoWin domain receptor family, member 1 5.9 
102465 NM_001359Hs.81548 2,4-dienoyl CoA reductase 1, mitochondrial 1.8 

102488 U50939 Hs.61828 amyloW beta precursor protein-binding protein 1,59kD 1.5 

102489 AL080116 Hs74420 ongin recognition complex, subunit 3 (yeasfhomolog)-yke 3.3 
102494 AI188137 Hs75193 COP9homok)g 2.1 
102501 AF217197 Hs.74562 siah bindiig protein 1;FBP interacyng repressor pyrimidine trad binding splidng 3.2 
102522 BE250944 Hs.1 83556 solute carrier family 1 (neutral amino acid transporter), member 5 2.8 
102532 AF040253 Hs701B6 suppressor of Ty(S.cerevlsiae) 5 homotog 57 
102564 U59423 Hs.79067 MAD (mothers against decapentaplegic. Drosophlla) homolog 1 2.3 
102568 W81489 Hs.223025 RAB31, member RAS oncogene family 5.3 

102580 U60808 Hs.1 52981 CDP-dlacylglycerol synthase (phosphatidatecytidylyltransferase) 1 21 

102581 AU077228 Hs77256 enhancer of zeste (Drosophlla) homolog 2 1.6 

102582 U61232 Hs.32675 tubulin-specifrc chaperone e 21 

102617 AW161453 Hs.198767 C0P9 (constitutive photomorphogenlc, ArabkJopsis, homolog) subunit 5 1.8 

102618 AL037672 Hs.81071 extracellular matrix protein 1 5.8 
102627 AL021918 Hs. 158174 zinc finger protein 184 (Knjppel-fike) 1.3 
102663 NM_002270Hs.168075 karyopherin (importin) beta 2 18 
102676 BE262989 Hs.1 2045 putative protein 2.3 
102687 NiVL007019Hs.93002 ubiquttin canler protein E2-C 43 
102689 U96132 Hs.171280 hydroxyacyl-Coenzyme A dehydrogenase, type II 6.0 
102696 BE540274 Hs.239 forkhead box Ml 4.2 

102704 AU077058 Hs.54089 BRCA1 associatBdRlNGdonrain 1 1.9 

102705 T97490 Hs.500Q2 small inducible cytokine subfamily A (Cys-Cys). member 19 23 
102750 AB014460 Hs.66196 nth (E.coll endonudease lll)-nke 1 12 
102801 BE252241 Hs.38041 pyridoxal(pyridoxine, vftamfn 86) kinase 6.4 
102812 U90549 Hs.236774 high-mobility group (nonhistone chromosomal) protein 17-like 3 1.6 
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102827 BE244588 Hs.6456 chaperonin containing TCPl.sUbunlt 2 (bela) 5.6 

102831 AA262170 Hs.80917 adaptor-related protein complex 3. slgma 1 subunH 2.0 

102644 AV653790 Hs.324275 WW domain-containing protein 1 1.3 

102868 X02419 Hs.77274 plasminogen activator, urokinase 4.4 

5 102925 BE440142 Hs.2943 signal recognition particle 19kD 1.9 

102935 BE561850 Hs.80506 small nuclear rtbonucleoprotein polypeptWe A' 2.4 
102968 AU076611 Hs.154672 methylene letrahydrofolate dehydrogenase (NAD+ dependent), methenyltetrahydrofote^^ 

102983 BE387202 Hs.118638 non-metastatic cells 1, protein (NM23A) expressed In 3.1 

102985 U95742 Hsi707 61 to S phase transitfon 1 5.2 

10 103023 AW500470 Hs.117950 multifuncttona! polypeptide similar to SAICAR synthetase and AIR carboxylase 1.6 

103038 AA926960 Hs.334883 CDC28 protein kinase 1 2.5 

103060 NM_005940Hs.155324 matrix metalloproteinase 11 (MMP11; stromelysln 3) 4.5 

103080 AU077231 Hs.82932 cyclln D1 (PRAD1: parathyroid adenomatosis 1) 3.1 

103089 D31152 Hs.179729 collagen, type X. alpha 1 (Schmid metaphyseal chondrodysplasia) 2.4 

15 103177 BE244377 Hs.48876 famesy diphosphate famesyltransferase 1 ' 3.5 

103178 AA205475 Hs.275865 ribosomal protein S18 9.9 

103179 NM.001777HS.82685 CD47 antigen (Rh-related antigen. Enfegrln-assodated signal transducer) 1.3 . 
103181 X69636 Hs.334731 Homo sapiens, clone IMAGE:3448306. mRNA. partial cds 2.0 
103185 NM.006825HS.74368 trarismembraneproteb(63kD),eridoplasm]creticulum/Golg1intemie t6 

20 103191 AA401039 Hs.2903 protein phosphatase 4 (fonnerlyX), catalytic subunit 2.5 

103193 NM_004766Hs.75724 coatomer protein complex, subunit beta 2 (beta prime) 2.2 

103194 NMja4939Hs.78580 DEAD/H (Asp^lu-Ala^sp/His) box polypeptide 1 6.3 
103206 X72755 Hs.77367 monokine Induced by gamma interferon 8.8 
103223 BE275607 Hs.1708 chaperonln containing TCP1. subunit 3 (gamma) 3.0 

25 103232 X75962 Hs. 129780 tumor necrosis factor receptor superfamlly, member 4 1-8 

103238 A1359285 Hs.75189 dealh-assodated protein 5.6 

103297 NM_001545Hs.9078 Immature colon carcinoma transcript 1 1-9 

103330 A1803447 Hs.77496 small nuclear ribonucleoprotein poiypeptWe G 2.5 

103349 X89059 gb:H.saptens mRNA for unknown protein expressed In macrophage 1.6 

30 103376 AL036166 Hs.323378 coated vesicle membrane protein 1.8 

103391 X94453 Hs.114366 pym)line-5-cafboxylate synthetase (glutamategamma-semialdehyde synthetase) 23 

103392 X94563 gbHsapiens dbi/acbp gene exon 1 & 2. 4.0 
103430 BE564090 Hs.20716 transk)case of Inner mitochondrial membrane 17 (yeast) homolog A 1.3 
103491 AF264750 Hs.288971 myelokJ/lymphoid or mtxed-Kneage leukemia 3 5.6 

35 103505 AL031224 Hs.33102 lranscripttonfactorAP-2 beta (activating enhancer-binding protein 2 beta) 5.1 

103547 A1376722 Hs.1 80062 proieasome (prosome, macropain) subunit beta type, 8 (large multifunctional protease 7) 97 

103588 NM_006218Hs.85701 phosphoinositide-3-kinase, catalytic, alpha polypeptide 2.0 

103613 NM_000346Hs.2316 SRY {sex delemiining region Y)4xdx 9 (campomelic dysplasia, autosomal sex-reversal) 1.3 

103621 BE379766 Hs.150675 polymerase (RNA) II (DNA directed) polypeptide K (Z.OkD) 2.0 

40 103622 AA609685 Hs.278672 membrane component, chromosome 11, surface mariner 1 2.3 

103727 AI878883 Hs.296381 growth factor receptor-bound protein 2 1.3 

103749 AL135301 Hs.8768 hypothetical protein FU10849 1.8 

103754 AI015709 Hs. 172089 Homo sapiens mRNA; cDNADKFZp58612022prom done DKFZp58612022) 1.3 

103780 AA094752 Hs.169992 hypothetical 43.2 Kd protein 7.5 

45 103795 H26531 Hs.7367 Homo sapiens BTB domain protein (BDPL) mRNA, partial cds 1.2 

103797 AA0fl0912 gb2n04d03.r1 Stratagene hNT neuron (937233) Homo sapiens cDNA clone 5' similar 1.5 

103813 AI042582 Hs.18127l CGH20 protein 1.5 

103855 W02363 Hs.302267 hypothetical protein FU10330 1.5 

103866 AK001278 Hs.105737 hyi)othetlcal protein FU10416 similar to constitutive photornorphog^ 6.5 

50 104052 NM_002407Hs.97644 mammaglobIn2 2.9 

104079 AA251242 Hs.103238 ESTs 1.4 

104174 AA478984 Hs.6451 PRO0659 protein 5.6 

104227 AB002343 Hs.98938 protocadherin alpha 9 1.6 

104275 A1751970 Hs.101067 GCN5 (general control of amino^dd synthesis, yeast, homolog).^ike 2 5.4 

55 104325 BE379766 Hs.150675 polymerase (RNA) 11 (DNA directed) polypeptide K(7.0kD) 6.3 

104370 AA324597 Hs.21851 Homo sapiens cDNA FUl 2900 fs. clone NT2RP2004321 1.6 

104423 R83113 Hs.1432 protein kinase C substrate 80K-H 5.2 

104482 AB037762 Hs.44268 myelin gene expression factor 2 1J2 

104532 AI498763 Hs.203013 hypothetical protein FU 12748 2A 

60 104563 AL117403 Hs.306189 DKFZP434F1 735 protein 1^ 

104667 AI239923 Hs.30098 ESTs 1.3 

104757 AI694413 Hs.332649 olfactofy receptor, family 2, subfamily I, mento 6 2.3 

104804 AI858702 Hs.31803 ESTs. Weakly simBar to N-WASPlH.saplens] 1.3 

104806 AB023175 Hs.22982 KIAA0958 protein 2.3 

65 104827 AW052006 Hs.8551 PRP4/STK/WD splicing factor 10.9 

104846 AI250789 Hs.32478 ESTs 5.6 

104854 AA041276 Hs.154729 3-phosphoinosltide dependent prote^kInase-1 123 
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104B67 AA278B98 Hs.225979 hypothetical protein slmDar to small G proteins, espeda^ 2.0 

104871 T78044 Hs.28893 Homo sapfensmRNA;cDI^DKFZp66402364 (from cbneDK^ 1.3 

104896 AW015318 Hs.23165 ESTs 177 

104909 AW408164 HS.2491B4 transcnption factor 19 (SCI) 5.0 

5 104916 AW958157 Hs.1 55489 NS1-associated protein 1 1-7 

104919 AA026880 Hsi5252 pnoladin receptor 1-4 

104930 AF043467 Hs.32893 neurexophllln 2 2.2 

104973 NM_015310Hs.6763 KIAA0942 protein 5.0 

104974 Y12059 Hs.278675 bromodomaln-contalnlng 4 1.4 
10 104975 AL1 36877 Hs.50758 SMC4 (structural maintenance of chromosonies 4, yeast)-like 1 2.4 

104975 AL136877 Hs.50758 SMC4 (structural maintenance of chromoso 2.3 

104978 A1199268 Hs.19322 Homo sapiens. Similar to RiKENcDNA2010317E24 gene, clone I MAGE:350201 9. mRNA, partial cds 7.2 

104979 AA937934 Hs.321062 ESTs 1-3 
104994 A1499930 Hs.334885 mitochondrial GTP binding protein 3.5 

IS 105009 BE37g584 Hs.34789 dolichyiKiiphosphooilgosaccharkJe-proteinglyoosyltransrerase 5.5 

105012 AF098158 Hs.9329 chromosome 20 open reading frame 1 33 

105028 AI050715 Hs.2331 E2F transcription fector 5. p130>bindlng 22 
105032 AA127818 gb3d12a02.s1 SoaresjprBgnanLuterusJJbHPU Homo sapiens cDNA done IMAGE:501674 3* 6.8 

105039 AA907305 Hs.36475 ESTs 2.5 

20 105041 AB037716 Hs.26204 K1AA1295 protein 2.2 

105045 BE242899 Hs.1 29951 speckle-type POZ protein 3.8 

105079 AA151342 Hs.12677 C6M47 protein 9.5 

105087 AA147884 Hs.9812 Homo sapiens cDNAFU14388fjs. clone HEMBA1002716 5.6 

105088 H58589 Hs.35156 Homo sapiens cDNAFUl 1027 fis. clone PLACE10041 14 2.2 
25 105095 Z78407 Hs.27023 vesWe transport-relalBd protein 22 

105110 BE387350 Hs.33122 KIAA1160 protein 1.6 

105126 AW975433 Hs.36288 ESTs 6.3 

105127 AA045648 Hs.301957 nudix (nucleoside diphosphate linked molely X).type motif 5 2.1 
105141 AA164687 Hs,177576 mannosyI(alpha-1,3-)'glycoprotelnbeta-1^N-acetylglucosaminyltransferase/ 2.7 

30 105158 AW976357 Hs.234545 hypothetical protein NUF2R 1.9 

105169 BE245294 Hs.180789 S164 protein 1.7 

105186 AA191512 Hs.28005 HomosapIenscDNA FU1 1309 fis. clone PLACE1010076 4.8 

105254 AA071276 Hs.19469 KIAA0859prot^ 1.9 

105281 AA263143 Hs.24596 RADSI-interacting protein 2.8 

35 105268 N99673 Hs.3585 ESTs, Weakly simBar to AF126743 1 DNAJdomain-a)ntaining protein MCJ [H.sapiera^ 1.9 

105302 AA700122 Hs.3355 sentrin-spedfic protease 8.0 

105331 AW270037 Hs.179507 KIAA0779 protein 1-8 

105359 Nt\L016015Hs.8054 CGI-68 protein 8.2 

105356 BE264645 Hs.282093 hypothetical protein FLJ219 IB 5.0 

40 105373 AW887701 Hs.32356 hypothetical protein aJ20628 2.5 

105374 BE242803 Hs.262823 hypothetical protein FU10326 2.2 

105387 AW592146 Hs.108636 membrane protein CHI 23 

105393 AF167570 Hs.256583 interieukin enhancer binding factor 3. 90kD 5.4 

105399 BE386877 Hs.334811 Npw38-binding protein NpwBP 1.6 

45 105400 AF198620 Hs.65648 RNA binding nwtif protein BA 1.6 
105445 AA252395 gb:zs12g10.s1 NCI CGAP.GCB1 Homo sapiens dDNA done IMAGE«65026 3', mRNA sequence. 5.0 

105507 BE268348 Hs.226318 CCR44^0Ttranscriptk)n complex, subunit 7 1.6 

105529 AA1 13449 Hs.32471 hypothetical protein FU20364 1.3 

105530 AB023179 Hs.9059 KIAA0962 protein 3.4 
50 105547 AA262640 Hs.27445 unknown 9.3 

105564 BE616694 Hs.288042 hypothetical protein FU 14299 1.4 

105596 AA579535 Hs.18490 hypothetical protein FU20452 10.9 

105597 AF054284 Hs.334826 splicing factor 3b» subunH 1. 155kD 2.9 
105608 AI808201 Hs.287863 hypothetteal protein FU 12475 1.7 

55 105610 AA280072 Hs.99872 fetel Alzheimer antigen ' 1.4 

105617 AK000892 Hs.4069 glucocorticoid modulatory element binding protein 1 1.7 

105620 AW302245 Hs,18l390 casein kinase 1. gamma 2 5.5 

105658 AA985190 Hs.246875 hypothetical protein aJ20059 9.4 

105697 AW49998B Hs.27801 zinc finger protein 278 2.0 

60 105708 R26944 Hs.180777 Homo sapiens mRNA; cDNADKFZp564M0264 (from ctoneDKFZp564M0264) 1.7 

105743 BE246502 Hs.9598 soma domain, immunogtobulin domain (tg). transmembrane domain (TM) and short 2.6 
105746 AW151952 Hs.46679 hypothetk^al protein FU20739 1.5 
106759 AI123116 Hs.15159 chemokine-like factor, altematlvely spliced 1.3 
105771 AI267720 Hs.153221 synovialsarcoma, translocated to X chromosome 1-6 

65 105820 AA741336 Hs.152108 transcriptbnal unit N 143 2.2 
105826 AA478756 Hs.194477 E3 ubiquifin Rgase SMURF2 1.3 
105856 AI262106 Hs.l2653 ESTs 2.4 
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105856 AF151066 Hs.281428 hypothetical protein 2.9 

105875 AK001708 Hs.32271 hypothetical protein FLJ10846 1-4 

105930 AF018371 Hs.9880 peptidyl prolyl isomerase H (Gyck}ph9tn H) S.2 

106000 AW194426 Hs.20726 ESTs 1-7 

106011 AW081202 Hs.12284 Homo sapiens, done lMAGEa9B9556. mRNA, partial cds 2.8 

106017 AA477956 Hs.26268 ESTs 1.4 

106073 AL157441 Hs.17834 downstream neighbor of SON 1-4 
106078 AA130158 Hs.l9977 ESTs. f^derately similar to ALU8JiUIWJ ALU SUBFAMILY SX SEQUENCE CONTAMINATION 1.6 

106094 AA533491 Hs.23317 hypothetical protein FU14681 6-8 

106140 AB005624 Hs.14912 KIAA0286 protein 1-6 
106271 AA251393 Hs.289052 Homo sapiens, Similar to RlKENcDNA5430429M05 gene, ctoneMGC:13155. mRNA. complete ods 10,8 

106288 AB037742 Hs.24336 KIAA1321 protein 1.3 

106300 Y10043 Hs.19114 high-mobiJity group (nonhistone chromosomal) protein 4 3.6 

106333 AL043114 Hs.22410 ESTs, Weakly similar to A54849 collagen alpha 1(VII) chain precursor IH.sapiensl 5.4 

106350 AK001404 Hs.194698 cyciinB2 5.7 

106359 AW390282 Hs^il30 transmembrane 7 superfamlly member 2 6.3 

106381 AB040916 Hs.24106 KIAA1483 protein 6.5 

106389 AW748420 Hs.6236 Homo sapiens cDNA: FLJ21487 fis. done COL05419 2.2 

106457 AF1 19256 Hs.27801 zinc finger protein 278 2.7 

106470 D63078 Hs.1 86180 Homo sapiens cDNA:FLJ23038 fis. clone LNG02039 2.3 

106531 AA454036 Hs.8832 ESTs 1.6 

106586 AA243837 Hs.57787 ESTs 1.6 

106589 AK000933 Hs.28661 Homo sapiens cDNA aJ10071 fis. clone HEMBA1001702 2.4 

106610 AA45BB82 Hs.79732 fibulini 7.9 

106624 NM_003595Hs.26350 lyrosylprotein sulfotransferase 2 7.7 

106550 AL049951 Hs.22370 Homo sapiens mRNA;cDN A DKFZp564O0122 (from clone DKFZp564O01 22) 1.6 

106669 AV657117 Hs.184164 ESTs, Moderately similar to S65657alpha-1C^drenergic receptor splice forni 2 (H.saplens] U 

106713 BE614802 Hs. 184352 hypothetical protein FU 12549 4.5 

106717 AA600357 Hs.239489 TIA1 cytotoxic granule-associated RNA-binding protein 1.3 

106723 BE3880g4 Hs.21857 ESTs 1-6 

106795 AF174487 Hsi93753 Bd-2-related ovarian killer protein-like 5.7 

106829 AW959893 Hs.27099 hypothetical protein FLJ23293 similar to ARL-6 interacting proteln-2 16.2 

106831 BE564871 Hs.29463 centrin, EF-hand protein. 3 (CDC31 yeast homotog) 1.5 

106846 AB037744 Hs.34892 KIAAI 323 protein 2.2 

106852 AF151031 Hs.300631 hypothetical protein 1.3 

106873 N49809 Hs.11l97 Homo sapiens, clone lMAGE:3343149.mRNA. partial cds 16.8 

106886 W79171 Hs.9557 GL002 protein 1.5 

106905 AA861271 Hs.222024 transcription factor BMAL2 2,2 

106920 AK001838 Hs.296323 serum/glucocortlcold regulated kinase 3.3 

106945 AKD00511 Hs,6294 hypothetical protein DKFZp434L1436 similar to valyltRNA synthetase 6.8 

106973 BE156256 Hs.11923 hypothetical protein 6.6 

106977 AL043152 Hs.50421 K1AA0203 gene product 4.8 

106978 AW631480 Hs.8688 ESTs 6.0 
107004 AA146872 Hs.300700 hypothetical protein FLJ20727 1.3 
107029 AF264750 Hs.288971 myeloid/lymphoid or mixed-lineage leukemia 3 1.8 
107071 AW385224 Hs.35198 ectonucteotkie pyrophosphatase/phosphodiesterase 5 (putative function) 1.7 
107113 AK000733 Hs.239W GTPase activating proton 2.5 
107125 AK000512 Hs.69388 hypothetical protein aJ20505 17 
107136 A\/661958 Hs.8207 GKD01 protein 4.6 
107136 AV651958 Hs.8207 GKDOl protein 3.3 
107146 AK001455 Hs.5198 Down syndrome critical legton gene 2 2.0 
107151 AW378065 HS.B687 ESTs 6.3 
107155 AW391927 Hs.7946 K1AA1288 protein 33.5 
107174 BE122762 Hs.25338 ESTs 5.2 
107197 W15477 Hs.64639 gltoma pathogenesls-related protein 6.1 
107221 AW888411 Hs.8l915 leukemia-assodated phosphoprotein pl8 (stethmin) 17.4 
107243 BE219716 Hs.34727 ESTs, Moderately similar to 138759 zinc finger/leucine zipper protein IH.sapiens] 7.4 
107248 AW263124 Hs.315111 nudear receptor GO^epres8or/HDAC3 complex subunlt 1.8 
107263 060341 Hs.21198 transkxase of outer mllochondrtal membrane 70 (yeast) homolog A 6.6 
107265 BE379594 Hs.49136 ESTs, ModeratelyslmiIartoALU7_HUMAN ALU SUBFAMILY SO SEQUENCE CONTAMINATION 2.5 
107298 N95657 Hs.6820 ESTs, Moderately similar to YOJI^CAEEL HYPOTHETICAL 63.5 KD PROTEIN ZK353.1 IN 2.5 

107298 N95657 H8.6820 ESTs, Moderately similar to YOJIJ^AEELH 17 

107299 BE277457 Hs.30661 hypothetical protein MGC4606 35 
107316 T63174 Hs.l93700 Homo sapiens mRNA; cDNA DKFZp58610324 (from done DKFZp58610324) 2.0 
107354 NM^006299Hs.96448 zinc finger protein 193 5.0 
107392 AW299900 Hsi67632 TATA element modulatoiy factor 1 1.2 
107481 AA307703 Hs.279766 kinesin family member 4A 1.6 
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107529 BE515065 Hs.296585 nucleolar protein (KKEyo repeat) 3.0 

107554 AA001386 Hs.59844 ESTs 1-3 
107681 BE379594 Hs.49136 ESTs. Mode ratelysirnaar to ALU7JiUI\^N ALU SUBFAMILY SOSEQUErCECONH-AMlNATION 2.2 

107772 AA0185B7 Hs.303055 ESTs, Weakly similar to ALUl.HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 2.1 

5 107859 AW732573 HS.475B4 potassium vdtage-gated chaririel, (telayed-rectifier, subfamily S, me 8.4 

107901 L42612 Hs.335952 keratin 6B 2.5 

107901 L42612 Hs.335952 keratin 6B 1-6 

107922 BE153855 Hs.61460 Ig superfamily receptor LNIR 2.2 

107974 AW956103 Hs.61712 pyruvate dehydrogenase kinase, isoenzyme 1 6.7 

10 108040 AL121031 Hs.l59971 SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily b, ^ 1.5 

108230 AA054224 Hs.59847 ESTs 1.3 

108274 AF129535 Hs.272027 F-box only protein 5 7.1 

106296 N31256 Hs.16l623 ESTs 2.5 

108496 AA083069 Hs.339659 ESTs 3.5 

15 108607 BE300380 Hs.69476 Homo sapiens cDNA FU12756 fis. done NT2RP2001326 3.4 

108621 AA101809 Hs.182685 ESTs 1.6 

108634 AW022410 Hs.69507 ESTs 17 

108647 BE546947 Hs.44276 homeoboxCIO 9-8 

108695 AB029000 Hs70823 KIAA1077 protein 7.2 

20 108717 AA122393 Hs70811 hypothetical protein FLJ2051 6 1.3 

108740 AI089575 Hs.9071 progesterone membrane binding pnotein 27 

108828 AK001693 Hs.273344 DKFZP564O0463 protein 1-8 

108859 AL121500 Hs.l78904 ESTs 1.5 

108872 H06720 Hs.l11680 endosulfine alpha 2.1 

25 108891 A1801235 Hs.48480 ESTs 5.3 

108894 AK001431 Hs.5105 hypothetical protein FIJ10569 4.0 

108955 AA149754 Hs.l95155 Homo sapiens amino add transport system N2(SN2)mRNA, complete cds 5.6 

108982 AA151708 HS.1719B0 honneo box (expressed in ES ceils) 1 1.6 

108987 AA152178 Hs.23467 hypothetical protein FLJ10633 6.2 

30 109002 AB028987 Hs.72134 KIAA1064 protein 1-7 

109011 AA156542 Hs.72127 ESTs 1-4 
109026 AA157811 gb:zo35d07.s1 Stratagene coton (937204) Homo sapiens cDNA clone 3' simUar to contains Alu repetitive 5.3 

109068 AA164293 Hs.72545 ESTs 2.9 

109101 AW608930 Hs.521 84 hypothetical protein FLJ206 18 1-6 

35 109112 AW419196 Hs.257924 hypothetical protein FLJ 13782 3.2 

109124 AK000684 Hs.l83887 hypothetical protein FLJ22104 1.7 

109139 AJ132592 Hs.59757 zinc finger protein 281 2.6 

109166 AA219691 Hs73625 RAB6 interacting, kinesin-like (rabkinesin 6) 2.9 

109198 BE566742 Hs.58169 highly expressed In cancer, rich In leucine heptad repeats 2.0 

40 109213 NM_016603Hs.82035 potential nuclear protein C50RF5: GAP-like protein 5.3 

109220 AW958181 Hs.189998 ESTs 57 

109233 AU077281 Hs. 170285 nucleoporin 214kD (CAIN) 5.3 

109270 N9g673 Hs.3585 ESTs, Weakly similar to AF126743 1 DNAJ domain-containing protein MCJ [H.sapiens] 1.4 

109273 AA375752 Hs.82719 Homo sapiens mRNA; cDNA DKFZp586F1822 (from done DKFZp586F1822) 2.9 

45 109313 AF153201 Hs.86276 C2H2 (Kaippel-lype) zinc finger protein 1.3 

109341 AA213506 Hs.1 15099 EST 2.9 

109391 AL096858 Hs.184245 KIAA0929 protein Msx2 interacting nuclear target (MINT) hoowtog 1.5 

109420 H83603 Hs.4O408 homeoboxC9 2.2 

109426 N30531 Hs.42215 protein phosphatase 1. regulatory subunit 6 3.0 

50 109429 A1160029 Hs.61438 ESTs 1.9 

109445 AA232103 Hs.189915 ESTs 1.8 

109450 AB032969 Hs.173042 KIAA1 143 protein 37 

109468 NM„015310Hs.6763 KIAA0942 protein 3.2 

109478 AW074143 Hs.87l34 ESTs 2.0 

55 109570 L40027 Hs.l 18890 glycogen synthase kinase 3 alpha 2.1 

109662 F02614 Hs.27319 ESTs 1.4 

109825 R71264 Hs.16798 ESTs 13 

110039 H11938 Hs.21907 histone acetyttransferase 2X} 

110056 AA503041 Hs.279009 matrix Gla protein 2.5 

60 110085 AA603840 Hs.29956 K1AA0460 protein 17 

110110 T07353 Hs7948 ESTs 2.9 

1 10129 R51853 Hs.226429 ESTs. Weakly similar to ALULHUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 17 

110154 NM_014521Hs.17667 SH3-domain binding protein 4 4.2 
110240 A1668594 Hs.l76588 ESTs, Weakly similar to CP4Y_HUMAN CYTOCHROME P4504A11 PRECURSOR IH.sapiens] 4.2 

65 110242 N41744 Hs.l9978 C6I-30 protein 1.3 

110259 H28428 Hs.32406 ESTs, Weakly similar to 138022 hypothetfcal protein [H.sapiensl 2.2 

110312 BE256986 Hs.11d96 hypothetical protein FLJ12089 2.1 
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110330 AI288666 Hs.l6621 DKFZP434I1 16 protein 6.2 

1 10501 H55748 gb:yq94a01.s1 Scares fetal liver spleen 1NFLS Homo sapiens cONA done 1MAGE:203400 3* 6.1 

110504 H55915 Hs.210859 hypothetical protein FU 11016 6.1 

110525 H57330 Hs.37430 EST 6.3 

110568 AK001160 Hs.5999 hypothetical protein FU10298 1.3 

110699 T97586 Hs.l8090 ESTs 1.8 

110705 AB007902 Hs.32168 KIAA0442 protein 1.6 

110742 AW190338 Hs.28029 hypothetical protein MGC1 1256 7.6 

110761 AL138077 Hs.16157 hypothetical protein FU12707 2.5 

110762 BE044245 Hs.30011 hypothetical protein MGC2963 9^ 
110765 AK000322 Hs.l8457 hypothetical protein FU20315 5.5 
110769 BE000831 H3.23a37 Homo sapiens cDI^FUl 1812 fe. done HEMBA1006364 2.1 
110799 AI089660 Hs.323401 dpy^O-iike protein 1.5 
110805 T25829 Hs.24048 FK506 binding protein precursor 6.6 
110813 AA767373 Hs.35669 ESTs. IV)oderately similar to ALUIJiUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 57 
110820 R33261 Hs.6614 ESTs. Weakly similar to A43932 mudn 2 precursor, intestinal [Ksapiens] 3.4 
110840 N31598 Hs.12727 hypothetical protein FLJ21610 1.7 
110844 A1740792 Hs.167531 methylcrotonoyl-Coenzyme A carboxylase 2 (beta) 17 
110854 BE612992 Hs.27931 hypothetical protein FU10607 similar to glucosamine-phosphate N-acetyltransferase 47 
1 10856 AA992380 gb:ot37g06.s1 SoaresJestis.NHT Homo sapiens cIDNA clone 7 similar to contains element 2.3 
110885 BE384447 Hs.16034 hypothetical protein MGC1 31 86 3.5 
110897 AL1 17430 Hs.6880. DKF2P434D156 protein 2.2 
110915 BE092285 Hs.29724 hypothetical protein FU 13187 2.6 * 
110918 H04360 Hs24283 ESTs, Moderately similar to reduced expression in cancer [H.sapiens) 1.9 
110958 NM.005864HS.24587 signal transduction protein (SH3 containing) 67 
110963 AK002180 Hs.11449 DKFZP5640123 protein ZO 
110981 AK001980 Hs.24284 ADP-nl)osyltransferase (NAD^ poiy(ADP-rii3os6) poiymerase)-llke 2 13 
110984 AW613287 Hs.80120 UDP-N-acetyl-alpha0.galadosaniine:polypeptide N-acetylgatactosamlnyltransferase 1 ( 1.8 
111125 N63823 Hs.269115 ESTs, Moderately similar to Z1 95 J1UMAN ZINC FINGER PROTEIN 195 (H^plens) 3.6 
111132 AB037807 Hs.83293 hypothetical protein 2.1 
111164 N46180 Hs.122489 Homo sapiens cDNAaJ13289fis. clone OVARC1001170 2.3 
111172 R67419 Hs.21851 Homo sapiens cDNA FL J 12900 lis. done NT2RP2004321 37 
111174 AL050166 Hs.26295 Homo sapiens mRNA; cDNA DKFZp586D1 122 (from done DKFZp586Dl 122) 7.5 
111179 AK000136 Hs.10760 asporin (LRR dass 1) 7.1 
111184 A1815486 Hs.243901 Homo sapiens cDNA FLJ20738 fis, clone HEP08257 67 
111184 AI815486 Hs.243901 Homo sapiens dDNA FLJ20738 fis, clone HE 3.3 
111189 N67603 Hs.272130 ESTs, Weakly similar to S65824 reverse transcrtptase homobg [H.saplens] 3.6 
111216 AW139408 Hs.152940 ESTs 1.5 
111221 AB037782 Hs.15119 KIAA1361 protein 2.6 
111223 AA852773 Hs.334838 KIAA1866 protein 4.6 
111239 N90956 H5.17230 hypotheUcal protein aJ22087 7.9 
111285 AA778711 Hs.4310 eukaryotk: translation initiation factor 1 A 6.9 
111299 AB033091 Hs.74313 KIAA1265 protein 5.0 
111312 AI523913 Hs.34504 ESTs 3.8 
111318 T99755 Hs.334728 ESTs 1.2 
111337 AA837396 Hs.263925 LISI-interacting protein NUDE1. rat homolog 5.1 
111352 H58589 Hs.35156 Homo sapiens dDN A FUl 1027 fis, done PLACE10041 14 2.2 
111370 A1478658 Hs.94631 brefeldin A-lnhibtted guanine nudeotide-exchange protein 1 2.8 
111384 N94606 Hs.288969 HSCARG protein 2.2 
111389 AK000987 Hs.169111 owdation resistance 1 2.1 

111391 NlVl_003896Hs.225939 sialyltransferase 9 (CMP-NeuAc:lactosylceramide aIpha-2,3-sialyltransferase; GM3 synthase) 5.1 

111392 W46342 Hs.325081 Homo sapiens, done 1MAGE:3659680. mRNA. partial ods 8.4 
111452 R02354 Hs.15999 ESTs 27 
111486 AI051194 Hs.22797d EST 6.5 
111549 W90638 Hs.20321 ESTs, Moderately similar to ZRFLHUMANZUOTIN RELATED FACTOR-1 (MfHASE 1.4 
111585 R10720 Hs.20670 EST 1.6 
111627 R52656 Hs.21691 ESTs 1.6 
111870 AB037834 Hs.18685 Homo sapiens mRNA for KIAA1413 protein, partial cds 2.4 
111937 BE298655 Hs.14846 Homo sapiens mRNA; cDNA DKFZpOB4D016 (from done DKFZp564D016) 10.6 
111944 AW083791 Hs.21263 suppressor of potassium transport defiect 3 6.6 
111987 NIVL015310HS.6763 KIAA0942 protein 5.1 
112134 R41823 Hs.7413 ESTs; calsyntenin-2 2.8 
112244 AB029000 Hs70823 KIAA1 077 protein 14.6 
112388 R46071 Hs.301693 Homo sapiens, done 1MAGE:3638994. mRNA. partial cds 9.0 
112456 NiyL016248Hs.232076 Akinase(PRKA) anchor protein 11 1.4 
112464 AW007287 Hs.28538 Homo sapiens cDNA: FU21086 fis. done CAS03272 1.4 
112506 A1742756 Hs.26079 ESTs 3.2 
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112513 R68425 Hs.13B09 hypothetical protein FU10648 2.0 

112752 AK001635 HS.14B38 hypothetical protein FLJ10773 1.8 

112884 AK000004 Hs.5013 Homo sapiens mRNA for aJ00004 protein, partial cds 6.6 

112923 T10258 Hs.5037 EST 1.5 

112936 AW970826 Hs.6185 KIAA1 557 protein 3.2 

112958 R61388 Hs.6724 ESTs 6.0 

112966 Z44718 Hs.l02548 glucocorficoid receptor DNA binding factor 1 6.4 

112978 AK000272 Hs7099 hypothetical protein FU20265 1.2 

112995 AA737033 Hs7155 ESTs, Moderately similar to 21 15357ATYKi protein [Mjnusculus] 5.6 

112996 BE276112 Hs.7165 zinc finger protein 259 2.0 
113047 AI571940 Hs7549 ESTs 1.9 
113049 AW965190 Hs.7560 Homo sapiens mRNA for K1AA1729 protein, partial cds 2.4 
113089 T40707 Hs.270862 ESTs 1.3 
113196 T57317 gbyb51a03.s1 Stratagene fetal spleen (937205) Homo sapiens cONA clone IMAGE74668 3'. 1.7 
1 13248 T63857 gb.7cl6eOI.sl Stratagene lung (937210) Homo sapiens cDNA ckxie 3\ mRNA sequence 2.8 
113254 AK002180 Hs.l1449 DKFZP5640123 protein 1.3 
113277 AW971049 Hs.11774 protein (peptk^l-prolyl cisftrans isomerase) Nll^-tnteracting, 4 (par^^^ 3.2 
113429 AA688021 Hs.179808 ESTs 1.2 
113499 AI467908 Hs.8882 ESTs 5.9 
113547 H59588 Hs.15233 ESTs 2.0 
113554 AW503990 Hs.142442 HP1-BP74 3.6 
113647 AA813887 Hs.188173 Homo sapiens cONA aJ 12187 fis, done MAMMA1000831 1.3 
113702 T97307 gb:ye53h05.sl Scares fetal Twer spleen INFLS Homo sapiens cDNA done 1MAGE:1 21 497 3*, 4.4 
113722 AV653556 Hs.1844ll albumin 1.3 

1 13769 AW499665 Hs.9456 SWl/SNF related, matrix associated, adin dependent regulator of chromatin, subfamily a, member 5 1 .2 

113777 BE26e947 Hs.10590 zinc finger protein 313 13.4 

113783 AL359588 Hs.7041 hypothetical protein DKFZp762B226 1.7 

113791 AI269096 Hs.l 35578 chitoblase, di-N-acetyl- 1.3 

113808 W44735 Hs.9286 Homo sapiens cDNA:FU21278fis. done COL01 832 3.3 

113811 BE207480 Hs.6994 Homo sapiens cDNA:FLJ22044 fis. done HEP09141 3.1 

113817 H13325 Hs.332795 hypothetical protein DKFZp761017121 3.2 

113826 AW378212 Hs.24d09 hypothetical protein FU 10826 2.3 

113834 T26463 Hs.6059 EGF-containingfibulIn-like extracellular matrfx protein 2 11.3 

113868 W57902 Hs.90744 proteasome(prosome,macropain)26Ssubuniirion-ATPase.11 2.7 

113870 AL079314 Hs.16537 hypothetical protein, s&nflar to (U06944)PRAJA1 6.1 

113885 AW959486 Hs.21732 ESTs 6.6 

1 13923 AW953484 Hs.3849 hypottietical protein FU22041 similar to FK506 binding proteins 1.9 

113989 W87544 Hs.26B828 ESTs 1.2 

114022 A1539519 Hs.l20969 Homo sapiens cDNA FLJ11562ris, done HEMBA1003197 5.4 

114030 AI825386 Hs.164478 hypothetical protein FLJ21939 similar to S-azacytidlne induced gene 2 9.4 

114060 AB029551 Hs.7910 RlNGl and YY1 binding protein 1.8 

114196 AF017445 Hs.150926 fucose-1 -phosphate guanylyltransferase 1.5 

114226 AB028968 Hs.7989 KIAA1045 protein 1.8 

114253 BE149B66 Hs.l4831 Homo sapiens. Similar to zinc finger protein 136 (done pHZ-20), done MGC:10647, mRNA, complete cds 2.3 

114262 AL117518 • Hs.3686 KIAA0978 protein 1.4 

114275 AW515443 Hs.306117 K1AA0306 protein 15.8 

114292 A1815395 Hs.184641 fat^ acid desaturase 2 1.9 

114309 AA332453 Hs.20824 CGI-85 protein 2.4 

114392 AA249590 Hs.100748 ESTs, Weakly similar to A28996 proline-rldi protein M14 precursor - mouse lM.musculi^^ 1.8 

114407 BE539976 Hs.103305 Homo sapiens mRNA; cDNA DKFZp434B0425 (from clone DKFZp434B0425) 1.2 

1 14455 H37908 Hs.271616 ESTs, Weakly similar to ALU8_HUMAN AlOJ SUBFAMILY SX SEQUENCE 5.5 

114463 AL120247 Hs.40109 KIAA0872 protein 5.2 

114464 AI091713 Hs.106597 Homo sapiens, Similar to RIKENcDNA 11 10012M11 gene, done 1MAGE:3688605, mRNA, partial cds 1.2 
114471 AA028074 Hs.104613 RP42homoiog 1.8 
114480 BE056778 Hs.l51678 UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgaladosamlnyltfansferase 6 (GalNAc^^^ 13.4 
114671 AA766268 Hs.266273 hypolheUcal protein aj 13346 1.9 
114698 AA476966 Hs.l 10857 polymerase (RNA) 111 (DNA direded) polypeptide K (123 kDa) 3.5 
114730 AI373544 Hs.331328 intermediate filament protein syncoilin 3.8 
114767 AI859865 Hs. 154443 minichromosome maintenance defident(S.cerevisiae) 4 1.6 
114774 AV656017 Hs.184325 CGI-76 protein 3.1 
114798 AA159181 Hs.54900 serologically defined colon cancer antigen 1 3.5 
114860 AL157545 Hs.42179 bromodomain and PHD finger containing, 3 4.3 

114895 AA236177 Hs.76591 KIAA0887 protein 7.1 

114896 BE539101 Hs.5324 hypothetical protein 1.3 

114911 AA236672 gb:zt29f02.s1 Soares ovary tunnorNbHOT Homo sapiens cDNA done 1MAGE:723771 3*. mRNAsequence. 1.5 

114930 AA237022 Hs.188717 ESTs 2.0 

114938 AA242834 Hs.58384 ESTs 2.9 
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114965 AI7338B1 Hs.72472 BMP-RIB 2.3 

115023 AF102546 H8.63931 dachshund (Drosophila) homolog 13 

115038 AA252360 Hs.87968 toll-like receptor 9 16 

115061 AI751438 Hs.41271 Homo sapiens mRNA lull length Insert cDNA done EUROIMAGE 1913076 11.8 

5 115062 AA253314 Hs.154103 LIM protein (similar to rat protein Idna 1.5 

115117 A1670847 Hs.5324 hypothetical protein 1-5 

115121 AI634549 Hs.8B155 ESTs 2.8 

115206 AW183695 Hs.186572 ESTs 2.5 

115221 AW365434 Hs79741 hypothetical protein FU 101 16 1.5 

10 115239 BE251328 HsJ3291 hypothetical protein FU 10881 1-3 

115242 AI368236 Hs.283732 ESTs, Moderately simOar to ALU1J1UMAN ALU SUBFAMILY J SEQUENCE 1.4 

115278 AK002163 Hs.301724 hypothetical protein FU 11 301 1-5 

115285 AW972872 Hs.293736 ESTs 2.4 

115291 BE545072 Hs. 122579 hypothetical protein FU10461 6^ 

15 115400 AI215069 Hs.89113 ESTs 6»6 

115468 AA314349 Hs.48499 tumor antigen SLP-8p 7.4 

115471 AK001376 Hs59346 hypothetical protein FU10514 1-4 

115479 AW301608 Hs.278188 ESTs, Moderately similar to 154374 gene NF2 protein [H.sapiens] 4.0 

115496 AW247593 Hs.71819 eukaiyotlctransJatfon initiation factor 4E binding protein 1 16.3 

20 115500 Y14443 Hs.88219 zinc finger protein 200 5.0 

115553 AJ275986 HsJ1414 transcription factor (SMIF gene) 2.5 

115581 A1540842 Hs.61082 ESTs 6.1 

115587 BE081342 Hs783037 HSPC039 protein 2.9 

115590 AA399477 Hs.67895 7-60 protein 5.3 

25 115646 N36110 Hs.305971 solute carrfer family 2 (facilitated glucose transporter), member 10 4.7 

115652 BE093589 Hs.38178 hypothetical protein FU23468 10-6 

115655 AL048269 Hs.288544 Homo sapiens, clone MGC:16063. mRNA. complete cds 12.7 

115663 A1138785 Hs.40507 ESTs 2.0 

115676 AA953005 Hs.88143 ESTs 3.0 

30 115690 AA625132 Hs.44159 hypothetical protein FU21 61 5 1-7 

115693 AF231023 Hs.55173 cadherin. EGF LAG seven-pass G^e receptor 3, flamingo (Drosophila) homolog 6.8 

115715 BE395161 Hs.1390 proteasome(prosome,macropain)subunit. betatype,2 17 

115734 AI950339 Hs.40782 ESTs 2.6 

115811 NIVL015434HS.48604 DKFZP434B1 68 protein 2.1 

35 115823 AI732742 Hs.87440 ESTs 2.1 

115837 A1675217 Hs.42761 ESTs 1-3 

115844 A1373062 Hs.332938 hypothetical protein MGC5370 4.4 

115866 AW062629 Hs.52081 K1AA0867 protein 7.2 

115875 N55669 Hs.333823 mitochondrial ribosomal protein L13 12 

40 115941 AI867451 Hs.46679 hypotheUcal protein FU20739 5.5 

115968 AB037753 Hs.62767 KlAAl 332 protein 9-8 

116003 BE275469 Hs.66493 Down syndrome critical region gene 5 1-4 

116011 AU59053 Hs.57664 Homo sapiens mRNA fulllengthinserlcDNA clone EUROIMAGE 2005735 2.4 

116108 AA770688 Hs.28777 H2A histone family, member L 1.8 

45 116134 BE243834 Hs.50441 CGI-04 protein 1-4 

116189 N35719 Hs.44749 ESTs, Moderately similar to T00358 hypotheficaJ protein KIAA0584 [H.sapiens] 1.2 

116195 AW821113 Hs72402 ESTs 2.1 

116238 AV660717 Hs.47144 DKFZP586N0819 protein 17 

116246 AF265555 Hs.250646 baculovlral lAP repeat-containing 6 17 

50 116252 AI936442 Hs.59838 hypothetical protein FLJ10808 17 

116298 AI955411 Hs.94109 Homo sapiens cDNAFU13634fis. clone PLACE101 1133 1.9 

116318 AF097645 Hs5e570 deleted In cancer liRNAhellcaseHDB/DICEl 4.9 

116325 AI472106 Hs.49303 Homo sapiens cDNAFU11663fis, clone HEMBA1004631 1.4 

116336 AL133033 Hs.4084 KIAA1025 protein 1-9 

55 116339 AK000290 Hs.44033 dipeptidyl peptidase 8 1-5 

116350 AA497129 Hs.lM771 nuclear factor UC (CCAAT-binding transcription factor) 1.9 

116358 A1149586 Hs.38125 interferon-induced protein 75, 52kD 1.9 

116365 N50174 Hs.46765 ESTs 6.1 

116368 N90466 Hs.7ll09 KlAAl 229 protein 1*6 

60 1 16417 AW499664 Hs.12484 Hunran done 23826 mRNA sequence 7.4 

116436 AA161411 Hs.58668 chronnosome 21 open reading frame 57 2.1 

116462 AF218313 Hs.236828 putative heficase RUVBL 1-5 

116470 AI272141 Hs.83484 SRY (sex determining region Y>*0)t 4 2.1 

116470 AI272141 Hs.83484 SRY (sex determining region Y)-box 4 1.2 

65 116575 AA312572 Hs.6241 phosphoinositlde-3-ltinase, reguiatoiysubunit, polypeptide 1 (p85 alpha) 1.5 

116637 AK001043 Hs.92033 inlegrin-linked kinase-assodated serine/threonine phosphatase 2C Zl 

116640 X89984 Hs.211563 B^:eDCLUIymphoma7A 2.3 
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116700 
116705 
116732 
116921 
116926 
117034 
117132 
117247 
117276 
117284 
117367 
117368 
117382 
117412 
117557 
117588 
117745 
117754 
117879 
117881 
117904 
117911 
117933 
117983 
118078 
118301 
118429 
118472 
118488 
118509 
118528 
118656 
118670 



A1800202 Hs.317589 hypothetical protein MGC10765 
AW074819 Hs.12313 hypothetical protein FU14566 

AW152225 Hs.16590g ESTs, Weakly similar to 138022 hypothetical protein (K.sapiens] 
AW068115 Hs.821 t)iglycan 
H73608 Hs.290830 ESTs 

YY1-assodated factor 2 
plO-binding protein 



Hs.180324 
Hs.42315 

Hs.121806 



U72209 
AI393666 
N21032 
N71183 

AK001701 Hs.183779 

AI041793 Hs.42502 

AI878942 Hs.90336 

AF150275 Hs.40173 

N32536 Hs.42645 

AF123050 Hs.44532 

N34895 Hs.4464d 

BE294925 Hs.46680 

AA121673 Hs.59757 

N54706 Hs.303025 

AF161470 Hs.260622 

BE540675 Hs.332938 

AL1 37379 Hs.47125 

Y10518 Hs.1 16470 

AL1 10246 Hs.47367 

N54321 Hs.47790 

AA453902 Hs.293254 

AA243332 Hs.74649 

AL157545 Hs.42l79 



1.4 
3.4 
2.9 
8.3 
IJ 
3.4 
5.2 



118737 
118925 
118984 
118986 
119206 
119235 
119235 
119265 
119279 
119298 
119338 
119349 
119403 
119478 
119486 
119513 
119601 
119602 
119676 
119682 
119774 
119780 
119789 
119805 
119818 
119863 
119905 
119966 
120132 
120206 
120248 
120253 
120269 



gb:yx46{D6.s1 Scares melanocyte 2NbHM Homo sapiens cDNA done IMAGE:264803 3', mRNA sequence 5.5 

I Homo sapiens cDNA FLJ11971 fis, clone HEMBB1001208 1 5 

Homo sapiens cDNA FLJ10590 fis, done hn2RP2004392, weakly similar to MNN4 PROTEIN 2.0 

ESTs 2.0 

ATPase. H+ transporting, lysosoma I (vacuolar proton pump), memfcier J 2.1 

ESTs 2.7 

solute canler family 16 (monocarboxylic add tianspofters). member 6 1-4 

diubiquitin 3.4 

ESTs 3.4 

CGM2 protein 3.0 

zinc finger protein 281 1-9 

chromosome 1 1 open reading frame 24 1 .8 

butyrate-induoed transcript 1 ^ 5.7 

hypothetical protein MGC5370 ' 5-9 

hypothetical protein FU13912 t-7 

hypothetical protein FLJ20048 1.7 

KIAA1785 protein 5.4 

EST 5.2 

ESTs 2.6 

cytochrome c oxidase subunit Vic 2.5 

bromodomain and PHD finger containing, 3 4.1 

Hs.501 02 rapa-2 (rapa gene) 1 -2 

Hs.43228 Homo sapiens cDNAaj11835 fis, done HEMBA1006595 13 

Hs.49397 ESTs 7.4 

Hs.293287 ESTs 2.5 

AA332845 Hs.152618 ESTs, Moderately slmDar to ZN91_HUMAN ZINC FINGER PROTEIN 91 IH.sapiensl 1.2 

AB033113 Hs.50187 KIAA1 287 protein 2.1 

AA199686 gb:zq75g09.n Stratagene hNT neuron (937233) Homo sapiens cDNA clone 1IVIAGE:647488 5* 5.2 

N92293 Hs.206832 ESTs. Moderately similar to ALU8 HUMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION 1.4 
AI668709 Hs.240722 ESTs, Moderately similar to ALU8^HUMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION 3.6 



AJ277275 
N22617 
AI949952 
AI458020 



Hs.206832 
Hs.240722 
AF148713 Hs.125830 
W24781 hte.293798 
AW453069 Hs.3657 
AW453069 ^te.3657 
BE539706 Hs.285363 
N57568 Hs.48028 
NM_001241Hs.155478 
AI417240 Hs.320836 
T65004 Hs.163561 
AL117554 Hs.119908 



Hs.170042 
H8.55513 



AI624342 
AI796730 
W37933 

AK000155 Hs.91684 
AW675298 Hs.233694 
AA243837 Hs.57787 
W61019 Hs.57811 
AB032977 Hs.6298 
NK.016625HS.191381 
BE393948 Hs.50915 
AJ223810 Hs.43213 
AA130970 Hs.58382 
AA081218 Hs.58608 
AW449064 Hs.119571 
AA703129 HS58963 
W57554 Hs.125019 
H26735 H5.91668 
AI924294 Hs.173259 
AA131376 Hs.326401 
AW131940 H5.104030 



bladder cancer overexpressed protein 4.8 

KIAA1710 protein 1.7 

activity-dependent neuroprotective protein 2.2 

activity-dependent neuroprotective prote 1 -6 

ESTs ' 1.4 

EST 25.1 

cyclinT2 1-6 

ESTs. Weakly similar to A47582 B-cell growth fax^r precursor [H.sapiens] 1 .3 

ESTs 8-4 

nudeolar protein NOP5/NOP58 6.7 

ESTs 2.4 

ESTs 2.1 

Empirically selected from AFFX single probeset 1-9 

Homo sapiens mRNA; cDNA DKFZp6671103 (from done DKFZp667I103) 3.7 

hypothetical protein FLJ11350 3.0 

ESTs 1.4 

ESTs 1.2 

K1AA1151 protein 1.8 

hypothetical protein 3.1 

kallikrein 5 (KLK5; KLK-L2; stratum comeum tryptlc enzyme) 9.2 
ESTs, Weakly similar to lEFS^HUMAN TRANSFORMATION-SENSmVE PROTEIN lEF SSP 3.6 

tiypothetical protein FLJl 1 101 2.5 

Homo sapiens cDNA FLJ14205 fe. clone NT2RP3003157 2.7 

collagen, type 111. alpha 1 (Ehlers-Dantos syndrome type IV. autosomal dominant) 2.6 

ESTs 2.7 

lymphokJ nuclear protein (LAF-4) mRNA 1.2 

Homo sapiens clone PP1498 unknown mRNA 45.7 

uncharaderized bone manow protein BM033 ' 1-2 

fibrobiastgrowthfodor12B 38.9 

ESTs 9.6 
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120274 AA1 77051 gb:nc02a02.s1 NCLCGAP_Pr3 Homo sapiens cDNA done IMAGE:194 similar to contains Alu 4.6 

120280 AA190577 gb:zp52g02.s1 Stralagene HeU cell s3 937216 Homo sapiens cDNA clone 3', mRNA sequence 20 

120296 AW995911 Hs.299883' hypothetical protein FLJ23399 1-8 

120297 AA191384 Hs.104072 ESTs, Weakly slnoilar to Z195_HUMAN ZINC FINGER PROTEIN 195 [H^aplens] 15.2 

120324 AA195517 Hs.191643 ESTs 5-5 

120325 AA195651 Hs.104106 ESTs 6.4 
120327 AK000292 Hs.278732 hypothetical protein FLJ20285 16.1 
120336 NB5785 Hs.181165 eukaryotic translation elongation fodor 1 alpha 1 2.9 
120342 AW450669 HS.4506B hypothetical protein DKFZp434l1 43 5.7 
120345 AA210722 Hs.104158 ESTs 4.5 
120349 AW969481 Hs.55189 hypothetical protein 16.8 
120352 R06859 Hs.193172 ESTs, Weakly slmflar to I3B022 hypothetical protein [H^sapiensl 5.0 
120356 AF000545 Hs.296433 putath/e punnergic receptor 28.1 
120371 AA219305 Hs.104196 EST 12 4 

120382 AA228026 Hs.38774 ESTs 4.0 

120383 AL109963 Hs.123122 FSH primary response (LRPR1 , rat) homotog 1 9.7 
120386 AW969665 Hs.154848 hypothetical protein DKFZp434D01 27 32.6 

120388 AA232874 Hs.104245 ESTs 3.1 

120389 AW967985 Hs.325572 ESTs, Moderately similar to ALU7.HUMAN ALU SUBFAMLYSQ SEQUENCE CONTAMINATION 21.7 
120396 AA134006 Hs.79306 eukaryotic translation initiation factor 4E 125 
120404 AB023230 Hs.95427 K1AA1013 protein 7.2 
120418 AW966893 Hs.26613 Homo sapiens mRNA; cDNA DKFZp586F1323 (from done DKFZp586F1323) 11.4 
120423 AA235453 Hs.18978 Homo sapiens cDNA: FLJ22822 fis, done KAIA3968 1.9 

120472 AI950087 gbwqOScOZ-xl NCLCGAP_Kfd12 Homo sapiens cDNA done 3\ mRNA sequence 19.4 

120473 AA251973 Hs.269988 ESTs 5.4 
120484 AA253170 Hs.9e473 EST 10.4 
120504 AA256837 gb.'2r84d10.8l Soares^NhHMPu.SI Homo sapiens cDNA done IMAGE:682387 J, mRNA sequence. 3.9 
120509 BE047718 Hs.9e545 ESTs 9-4 
120520 AA258601 Hs.161731 EST 2.4 
120535 BE350244 Hs.96547 ESTs 2.5 
120551 AA279160 Hs.111407 Homo sapiens, done IMAGE3613029. mRNA. partial cds 5.2 
120570 AA280679 Hs.271445 ESTs. Weakly similar to ALU IJiUMAN ALU SUBFAMILY J SEQUENCE C0NTA1\4INATI0N 14.4 
120582 BE244830 Hs.284228 ZNF135-like protein 10.2 
120590 AW372799 Hs.125790 leucine-rich repeatK»ntafnlng 2 2.1 
120596 AA282074 Hs.237323 N-acetylglucosamlne-phosphatemutase 7.5 
120619 AW965339 Hs.1 11471 ESTs 2.5 
120624 AW407987 Hs.173518 M-phase phosphoprotein homolog 52.0 
120639 AA286942 gb:zs56f05.s1 NCLCGAP.GCB! Honno sapiens cDNA done IMAGE:701505 3' similar to contains AIu2.4 
120648 AA287095 Hs.140309 Homo sapiens, done 1MAGE3677194, mRNA. partial cds 5.0 
120653 AW063659 Hs.191649 ESTs 2.2 

120668 AW969638 Hs.1 12318 6.2 kd protein 2.2 

120669 BE536739 Hs.109909 ESTs 1.9 

120695 AA976503 gb:oq30a04.s1 NCLCGAP„GC4Hon» sapiens cDNA done 3* simflar to contains PTR7.t1 PTR7 46.8 

120696 AI821539 Hs.97249 ESTs 2.5 
120713 AW449855 Hs.96557 Homo sapiens CDNAFLJ12727 fis. done NT2RP2000027 5.9 
120718 AA292747 Hs.97296 ESTs 2.9 
120750 AI191410 Hs.96693 ESTs, Moderately similar to 21 09260A B cell growth factor [H.saplens) 7.0 
120774 AI608909 Hs.193985 ESTs 7.8 
120807 AA346385 Hs.30002 SH3-containlng protein SH3GLB2; K1AA1848 protein 6.8 
120809 AA346495 gb:EST52657 Fetal heart II Homo sapiens cDNA 3' end similar to EST containing 0 family repeat. 4.4 
120938 AA386260 Hs.104632 EST 4.4 
120977 AA398155 Hs.97600 ESTs 4.4 

120984 BE262951 Hs.99052 ESTs 5.6 

120985 AI219896 Hs.97592 ESTs 1.2 
121011 AA398360 Hs.97608 EST 3.1 
121026 AI439713 Hs.165295 ESTs 3.5 
121081 AA396721 Hs.186749 ESTs. Highly similar to 137550 mismatch repair protein MSH2[H.sapiens] 5.4 
121133 AA363307 Hs.97032 ESTs 3.7 
121176 AL121523 Hs.97774 ESTs 1.7 
121223 A1002110 Hs.97l69 ESTs, Weakly similar to dJ667H12.2.1 (H.sapiens] 2.9 
121320 AA403008 Hs.301927 c6.1A 1.9 
121340 AW956981 Hs.97910 Homo sapiens cDNAFLJ13383 fis, clone PLACE1001024 3.5 
121408 AA406137 Hs.98019 EST 6.0 
121439 AA410190 Hs.98076 ESTs, Weakly similar to A47582 Bk»II growth factor precursor [H.sapiens] 7.4 
121450 AA406430 Hs.105362 Homo sapiens, done MGC:18257rmRNA. complete cds 6.9 
121452 AW971063 Hs.292882 ESTs 1.8 
121455 H5d306 Hs.15l65 retinoic add induced 14 10.5 

124 



wo 02/059377 



PCT/US02/02242 



121457 W07404 Hs.144502 hypothetical protein FLJ22055 34 

121496 AA442224 Hs.97900 ESTs 14.4 

121505 AA494172 Hs.194417 ESTs 13.1 

121508 AA402515 Hs.97887 ESTs 28.0 

121513 AA416653 Hs.181510 ESTs 6.2 

121514 AA412112 gb:zt69b02.s1 SoaresJestis.NKT Homo sapiens cDNA clone 1MA6E:727563 3'. mRNA sequence. 2.6 
121549 AA412477 Hs.98142 EST 7.4 
121558 AA412497 gb:2t95g12.s1 Soares testls^NHT Homo sapens cDNA done 1MAGE:730150 3* similarto contains2.8 
121577 AA411970 Hs.98096 EST 35 

' 121581 AA416568 gb:zu05c10.s1 SoafesJeslis_NHT Homo sapiens cDNA done 3'. mRNA sequence 6.1 

121589 AD001528 Hs.89718 spemfiine synthase 35 

121594 AA626010 Hs.98247 ESTs 2.2 

121622 AA416931 Hs.126065 ESTs 4.2 

121655 AA421537 Hs.178072 Homo sapiens mRNA; cDN A DKFZp434B 1023 (from done DKFZp434B1 023) 7.8 

121682 AA418160 Hs.86043 Homo sapiens cDNA FU13558 fis, done PLACE1007743 2.0 

121690 AV66O305 Hs.1 10286 ESTs 4.7 

121706 U55184 Hs.154145 hypothetical proleh aJ115B5 12.7 

121714 AA419225 Hs.98269 Homo sapiens cDNAFtJ11953frs; done HEMBB1000883 8.1 

121729 A1949597 Hs.98325 ESTs 1-8 

121731 AA421041 Hs.180744 ESTs 4.0 

121744 AA398784 Hs.97514 ESTs 7.1 

121748 BE536911 Hs.234545 hypothetical protein NUI=2R 19.5 

121773 AB033022 Hs.158654 K1AA1 196 protein 7.9 

121775 AA421773 Hs.161008 ESTs 1-7 

121776 AA292579 Hs.125133 hypothetical protein FU22501 6.6 
121785 A1810774 Hs.98376 ESTs 10.5 
121832 AW340797 Hs.98434 ESTs 5.8 
121836 AA328348 Hs.218289 ESTs 3.8 
121839 AA425691 Hs.191606 ESTs, H*ghly similarto KIAA1048 protein [H.sapiens] 5.0 
121842 AF027406 Hs.104865 serine/threonine kinase 23 2.7 
121847 AA446628 Hs^799 caitHage iinidng protein 1 2.3 
121871 AW972668 Hs.293044 ESTs 2.9 
121882 AA426376 Hs.98459 ESTs 5.0 
121911 AA427950 gb:zw50ID2.s1 Soares_tolaLfetusJlb2HF8_9w Homo sapiens cDNA clone iMAGEi773499 3' 7.2 
121915 AA428179 Hs.223405 ESTs, Moderately similarto A46010X-flnl(ed retinopathy protein [H .sapiens) 2.5 
121935 AA428647 Hs.98611 EST 2.3 
121983 AA29B760 Hs.180191 hypothetical protein PL J 14904 3.4 
121985 A1862570 Hs.29921 4 Homo sapiens, clone 1IVIAGE'.2822295, mRNA. partial cds 114 
121995 AA210863 Hs.3532 nemo-IH^e kinase 3.8 
121999 AA430211 Hs38668 EST 6.4 
122009 AW292763 Hs.160822 Homo sapiens cDNA: FLJ20863 fis. done ADKA01804 2.2 
122013 AA431085 Hs.98706 ESTs 6-5 
122036 W92142 Hs.271963 ESTs, Weakly similar to ALU5_HUIVIAN ALU SUBFAMILY SC SEQUENCE CX)NTAMINATION 13.1 
122050 AI453076 Hs.166109 ELAV (embryonic lethal, abnormal vision. Drosophila)-like 2 9.1 
122060 AA431738 Hs.98750 EST 13.1 
122114 AW161023 Hs.104921 ESTs 1.5 
122188 AA398838 gbztSOdOl .rl SoaresJestis^NHT Homo sapiens cDNA done , mRNA sequence 3.3 
122204 AA435936 Hs.98842 EST 5.6 
122246 AA329550 Hs.29417 HCF-binding transcription factor Zhangfei 5.1 
122257 AA436819 Hs.98899 ESTs 5.6 
122302 AA441801 Hs.104947 ESTs 5.8 
122341 AW601969 Hs.99010 hypothetical protein FLJ22263 similar to nuclear receptor-binding SET-domain protein 1 . 2.0 
122356 AA443794 Hs.98390 ESTs 7.3 
122369 AA443985 Hs.303222 ESTs 12.2 

122371 AA868555 Hs.178222 ESTs 5.0 

122372 AA446008 Hs.336677 EST 7.6 
122378 AB032948 Hs.21356 hypothetical protein E)KFZp762K2015 2.5 
122405 AA446572 Hs.303223 EST 2.8 
122412 AA446869 Hs.1 19316 ESTs 7.3 
122415 AA446918 Hs.99088 EST 1.9 
122418 AA446966 Hs.99090 ESTs, Moderately similar to similar to KIAA0766[H.sapiens] 6.8 
122440 AW505139 Hs.9460 Homo sapiens mRNA: cDNA DKFZp547C244 (from done DKFZp547C244) 2.6 
122446 AA447603 Hs.99123 EST 1.8 
122448 AA447626 Hs.99127 EST 3.5 
122458 AI266159 Hs.104980 ESTs 1.5 
122460 AW418788 Hs.99148 ESTs. Weakly similar to S43569 R01H1 0.6 protein -Caenorhabditiselegans[G.elegans] 97 
122464 AA448158 Hs.99152 EST 4.8 
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122490 AA44a349 Hs.238151 EST 6.1 

122492 AA448417 Hs.1 04990 ESTs 5.4 

1 22502 AA204969 Hs!234B63 Homo sapiens cDNA FU12082 lis, done HEMBB1002492 1 .3 

122510 AA449232 Hs.99195 ESTs 11-2 

122530 AW959741 Hs.40368 adaptor-retatBd protein oomplexl.slgma 2 subunH 10.1 

122547 AA779725 Hs.164589 ESTs 2.5 ' 

122555 AA194055 Hs.293858 ESTs 1-9 

122570 AA452578 Hs.262907 ESTs 9.5 

122572 AA452601 Hs.99287 EST 11-0 

122586 AK001910 Hs.99303 Homo sapiens cDNAaj11048ris, clone PLACE1004516 3.4 

122587 AB040893 Hs.6968 KIAA1460 protein 2.0 

122598 AI028173 Hs.99329 ESTs 17 

122599 AL355841 Hs.99330 hypothetical protein FU23588 4.4 
122602 AA411925 Hs.301960 ESTs 4.6 
122607 AA453518 Hs.98023 ESTs 61.5 
122614 AA453630 Hs.99339 EST 107 

122616 AA453638 Hs.161873 ESTs 107.3 

122617 AI681535 Hs.148135 serine/lhrBonlne kinase 33 121.4 

122618 AA453641 gb:zx48e06.s1 Soares_testis_NHT Homo sapiens cDNA clone 3', mRNA sequence 31.1 
122622 AA453987 Hs.144802 ESTs 5.6 
122717 AA455859 Hs.178358 ESTs 8.5 
122762 AI376875 Hs.105119 ESTs 10.4 
122829 AW204530 Hs.99500 ESTs 81.8 
122834 AA461492 Hs.99545 Homo sapiens cDNA FLJ10658 fis. done NT2RP2006052 3.6 

122836 AA460581 Hs.290996 ESTs 4.5 

122837 AA461509 Hs.293S65 ESTs, Weakly similar to putative p150[H.sapiens] 2.7 

122838 AA4605d4 Hs.334386 ESTs 75.3 
122854 AA600235 Hs.9625 . NIMA (never in mitosis gene a)-related kinase 6 77 
122856 A1929374 Hs.75367 Src-like-adapter 5.8 
122861 AA335721 Hs.1 19394 ESTs 13 
122866 BE539656 Hs.283705 ESTs 4.1 
122868 AF005216 Hs.1 15541 Janus kinase 2 (a protein tyrosine kinase) 5.3 
122870 AW576312 Hs.318722 Homo sapiens cDNA: FLJ21766 fls, clone CX)LF7179 9.9 
122872 AW081394 Hs.97103 ESTs 5.3 
122879 AA769410 Hs.128654 ESTs 13-9 
122907 AA470074 Hs.169896 ESTs 11.5 
122916 AA470140 Hs.229170 EST 17 
122981 AA478951 Hs.105629 ESTs 5.0 
123013 AW968324 Hs.173B4 ESTs 15.4 
123016 AW338067 Hs.323231 Homo sapiens cDNA FUl 1946 fis, ck)neHEMBB1000709 2.8 
123034 AL359571 H5.44054 ninein (GSK3B Interading protein) 8.7 
123072 AI382600 Hs.104308 ESTs, Weakly similar to KIAA1395 protein [H.saptens] 8.8 
123082 AA485360 Hs.1056.61 ESTs 3.9 
123088 AI343652 Hs.105667 ESTs 3.8 
123110 AA485256 Hs.193510 EST 7.4 
123114 BE304942 Hs.265848 myomegafin 2.8 

123131 T52027 Hs.271795 ESTs, Weakly similar to 138022 hypothetkal protein [H.sapiens] 2.4 

123132 A)061582 Hs.324179 Homo sapiens cDNAFU12371fts, clone MAMMA1002434 15.6 
123136 AW451999 Hs.194024 ESTs - 5.1 
123149 AI734179 Hs.105676 ESTs 23.8 
123152 AW601773 Hs.270259 ESTs 5.2 

123258 AA490929 Hs.105274 ESTs, Weakly similar to RMS1_HUMAN REGULATOR OF MITOTIC SPINDLE ASSEMBLY 1 [H.sapiensl 9.3 

123315 AA496369 gb:zv37d10.s1 Soares ovaiy tumor NbHOT Homo sapiens cDNA done IMAGE:755827 3' simflar to4.1 

123369 AA504757 Hs.105738 ESTs 6.9 

123394 AA731404 Hs.105510 ESTs 3.6 

123433 AW450922 Hs.112478 ESTs 3.7 

123466 AA599042 Hs.112503 EST 7.4 

123470 AW303285 Hs.303632 Human DMA sequence from done RP11-110H4 on chronxwome 5 Contabs a pseudogene similar to 3.5 

123471 AB021644 Hs.197219 zinc finger protein 14 (KOX 6) 5.2 
123475 BE439553 Hs.250528 Homo sapiens, clone IMAGE:4098694, mRNA, partial cds 1.7 
123482 N95059 Hs.55098 ESTs 1.6 
123486 BE019072 Hs.334802 Homo sapiens cDI^FLJI 4680 fis, atone NT2RP2004242, weakly similar to 2.4 
1 23508 AW380388 Hs.1 55546 K1AA1080 protein; Golgi-assodated, gamma-adaptin ear containing, ARF-bindIng protein 2 2.2 
1 23615 AA609170 gb:af12a12.s1 SoaresJesHs.NHT Homo sapiens cDNA done 3', mRNA sequence 7.8 
123619 AA602964 gb:no97o023l NCI CGAP_Pr2 Homo sapiens cDNA done, mRNA sequence 2.8 

123658 AA609364 gb:zu71d09.s1 Soares testis NHT Homo sapiens cDNA done IMAGE:743441 3' similar to contains Alu. 17 

123674 AI269609 Hs.l05187 kinesin protein 9 gene"* " 57 
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123735 NM.013241HS.95231 FH1/FH2 domah.<»ntaIning protein m 

123738 AA609891 Hs.112777 EST 5.2 

123753 AA609955 Hs.234961 Huntingtin Interacting protein E 30.6 

123804 AA620464 Hs.261915 EST, Weakly similar to S65657alpha-1C^drenergicre(«ptor splice fomi 2 IH.s^^^^ Z1 

1 2381 1 AA620586 gb:ae60g05.s1 Stratagene lung carcinoma 937218 Homo sapiens cDNA done IMAGE:951320 3' 2.7 

123951 AB012922 Hs.173043 metastasis-associated 1-lil« 1 6.2 

123983 AJ272267 Hs.146178 clwline dehydrogenase . 4.4 

124001 L42542 Hs.75447 ralA binding protein 1 7.0 

124006 A(147155 Hs.270016 ESTs 8.1 

124070 AI950314 Hs.154762 HlV-1 rev binding protein 2 3.7 

124074 H05635 Hs^94030 topoisomerase-related function protein 4-2 1.2 

124178 BE463721 Hs.97101 putative G protein-coupled receptor 3.1 

124203 AA372796 Hs.269339 ESTs. Weakly similar to AF161356 1 HSPC093 [H.sapiens] 5.7 

124352 AA640891 Hs.102406 ESTs 3.1 

124375 D87454 Hs.192966 K1AA0265 protein 3.5 

124385 AI267647 gb:aq49a10.x1 Stanley Frontal NB pod 2 Horno sapiens cDNA done Simitar to contains 57.1 

124390 AA317338 Hs.7535 COBW-like protein Z8 

124391 AF155099 Hs.279780 NY-REN-18 antigen 7.1 

1 24417 N34059 gb7v28h09.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA done 3* similar to contains Alu 3.3 

124428 H13540 Hs.82202 ribosomal protein L17 2.9 

124440 AA532519 Hs.129043 Human DNA sequence from done 989H 11 on chromosome 22q1 3.1-1 3.2. Contains part of a 7.8 

124466 R10084 Hs.113319 kinesln heavy chain member 2 2.6 

1 24482 N53935 gb7v59d09.s1 Soares fetal liver spleen 1 NFLS Homo sapiens cDNA done 3*. mRNA sequence 7.9 

124498 H79433 Hs.268997 ESTs 7.8 

124515 AA669097 Hs.109370 ESTs 3.3 

1 24608 N71076 H&102800 ESTs, Weakly similar to neuronal thread protein A070-NTP |H.sap}ensl 4^ 

124631 NM.014053HS.270594 FLVCR protein 3.2 

124634 A1765123 Hs.143671 HomosapienscDNAFU13533fis.donePUCE1006371 5.6 

124637 AA160474 Hs.75798 hypothetical protein 9.3 

124642 AW968856 Hs.2785e9 sorting nexln 17 3.5 

124649 N92593 Hs.313054 ESTs 6.1 

124656 AW297702 Hs.102915 ESTs 8.3 

124661 R48170 Hs.78436 EphBl 5.6 

124683 AA381661 Hs.l19878 ESTs. Weakly similar to M3K9_HUMANMIT0GEN-ACT1VATED PROTEIN KINASE KINASE 7.9 

124712 R09166 Hs.191148 ESTs 5.7 

124735 R22952 Hs.268685 ESTs 11.3 

124761 AA374756 Hs.93560 Homo sapiens mRNA for KIAA1 771 protein, partial cds 9.0 

124768 AW368528 Hs.l00855 ESTs 8.1 

124775 R41772 Hs.l00878 ESTs 4.9 

124777 R41933 Hs.l40237 ESTs, Weakly similar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE 2.8 

124788 R43543 Hs.100912 Homo sapiens cDNA: FLJ22726 fs, ctone HSI15005 5.1 

124809 AL355722 Hs.106875 Homo sapiens EST from done 35214, fiitl insert 4.2 

124811 R46068 Hs.288912 hypothetnal protein FLJ22604 14.2 

124812 R47948 Hs.188732 ESTs 7.9 
124822 AA418160 Hs.86043 Homo sapiens cDNAFyi3558fis, done PLACE1007743 6.6 
124825 AA501669 Hs.335693 ESTs 2.3 
124833 AW975868 Hs.294100 ESTs Z7 
124857 R63652 Hs.l37190 ESTs Z3 
124860 R65763 Hs.101477 EST 23.9 
124863 A1382555 Hs.127950 bromodomain-contalning 1 2.0 
124876 AF135422 Hs.27059 GDP-mannose pyrophosphorylase A 4.4 
124878 BE397530 Hs.288057 hypothetical protein FLJ22242 2.7 

124902 H37941 Hs.101883 ESTs 5.7 

124903 AW296713 Hs.221441 ESTs 32.4 
124930 A1076343 Hs.173939 ESTs, Weakly simOar to ALUBJiUlVlAN !1!! ALU CLASS B WARNING ENTRY 111 [H.saplens] 22.8 
124942 R99978 Hs ^268892 ESTs. Moderately simDar to B34087 hypothetical protein [Rsapiens] 6.1 
124958 A1078645 Hs.431 murine leukemia viral (bmi-1) oncogene homolog 1.9 
124980 T40841 HsJ8681 ESTs 4.5 
125002 T59338 Hs.269463 ESTs. Weakly similar to ALU1J^UMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 4.9 
125047 T79815 Hs.279793 ESTs 5.0 
125051 T79956 Hs.100588 EST 135.3 " 
125056 T81310 Hs.100592 ESTs 5.4 
125101 A1472068 Hs.286236 K1AA1856 protein 5.6 
125113 T96595 Hs.302270 ESTs. Weakly slmflar to ALUF^HUMAN I!!! ALU CLASS F WARNING ENTRY!!! [H.sapiens) 1.8 

1251 15 T97341 gb:ye57e05.sl Soares fetal liver spleen INFLS Homo sapiens cDNA clone 1MAGE:121856 ? similar to 9.6 

125125 A1222382 Hs.240767 Human DNA sequence from done RP1-12614 on chromosome 6q24.1-25.2. Contains the 5' end of the gene 1.5 

125147 W38150 Emplrfcallyseleded from AFFX single probeset 1.7 
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125161 W44657 Hs.144232 EST 10.7 

125249 AAB30863 Hs.131375 ESTs, Wtoderatelysimilar to ALUBJ^UMANI!Il ALU CLASS B WARNING 1.3 

125255 AF098162 Hs.1 18631 timeless (Drosophila) homdog 9-4 

125279 AW401809 Hs.4779 KIAA1 150 protein 1.5 

5 125280 AI123705 Hs.106932 ESTs 8.0 

125298 AW972542 Hs.289008 Homo sapiens cDNA: FLJ21814 fis, done HEPai068 1.5 

125660 AW292171 Hs.23978 scaffold attachment factor B 5.9 

125827 NM.003403HS.97496 YY1 transcription factor 1.2 

125891 U29589 Hs.7l38 cholinergic receptor, muscarinic 3 6.4 

10 126005 AW409701 Hs.1578 baculoviral lAP repeat-containing 5 (sun/ivin) 14.3 

126202 AA1 57632 Hs.272630 vacuolar proton pump delta polypeptide 2.4 

126695 AA643322 Hs.172028 adisintegrinandmetallopnotelnasedomalnlO 9.1 

127050 AW411066 Hs.274351 CGI^9 protein 17.0 

127274 AW966158 HS58582 Homo sapiens cDNA FLJ12789 fis, done NT2RP2001947 12.8 

15 128355 AW293012 H8.161623 ESTs 7.3 

128493 D87466 Hs.240112 KIAA0276 protein 3.1 

128493 D87466 Hs.240112 KIAA0276 protein 1.3 

128522 BE173977 Hs.10098 putative nudeolar RNA helicase 9.4 

128527 AA504583 Hs.101047 transcription factor 3 (E2A immunogloburm enhancer binding factors E12/E47) 1.5 

20 128528 R39234 Hs.251699 ESTs, Weakly similar to IDN4-GGTR1 4 {H.sapiens] 2.8 

128595 U31875 Hs.272499 short-chain alcohol dehydrogenase family member 12.1 

128599 NM_015366Hs.1 02336 Rho GTPase activating protein 8 2.3 

128604 AI879099 Hs. 102397 GIOT-3 for gonadotropin indudble transcription repressor-3 • 1.3 

128608 BE267994 Hs.102419 zinc finger protein 7.1 

25 128625 AB037841 Hs.102652 hypothetical protein ASH1 1.3 

128629 AL096748 Hs.102708 DKFZP434A043 protein 3.2 

128639 AW582962 Hs.102897 CGl-47 protein 2.0 

128656 AA458542 Hs.10326 coatomer protein complex, subunitepsllon 1.4 

128656 AA458542 Hs.10326 coatomer protein complex, subunitepslk) 1.3 

30 128656 BE397354 Hs.324d30 dtptheria toxin resistance protein required for diphthamide biosynthesis (Saocha^^ 2.4 

128670 AA975486 Hs.103441 Homo sapiens, Similar to RIKENcDNA1700010L19 gene, done IWGC:16214. mRNA, oompleleods 7.1 

128691 W27939 Hs.103834 hypothetical protein IVIGC5576 7.7 

128696 BE081143 Hs.225977 nu dear receptor coactivator 3 3.8 

128700 Y15221 Hs.103982 small Inducible cytoldne subfamOy B (Cys-X-Cys). member 11 1.6 

35 128714 T85231 Hs.179661 tubulin, beta 5 7.6 

128717 AKD01564 Hs.104222 hypotheOcal protein FU1 0702 5.5 

128733 BE147740 Hs.104558 ESTs. Moderately similar to 138022 hypolhefe'cal protein [H.sapiensJ 2.7 

128737 AF292100 Hs.104613 RP42homoIog 2.8 

128742 AA307211 Hs .251531 proteasome (prosome, macropain) subunit alpha type, 4 4.4 

40 128746 A1470163 Hs.323342 actin related protein 2/3 complex, subunit 4 (20 kD) 2,2 

128747 AB027249 Hs.104741 PDZ-bindIng kinase; T-cell originated protein kinase 2.8 

128772 BE302796 Hs.105097 thymidine kinase 1, soluble 5.3 

128761 N71826 Hs.1 05465 small nuclear ribonudeoprotein polypeptide F 53.9 

128797 NIVL002975HS.105927 stem cell groivth factor, lymphocyte secreted C-type lecGn 13.3 

45 128806 AW630942 H&106061 RD RNA-bindIng protein 2.6 

128814 AW248431 Hs.256526 nudear prelamtn A reoognttton fsdor 2.2 

128830 BE281170 Hs.106357 valosin-containing protein 5.9 

128835 AK001731 Hs.106390 Homo sapiens mRNA; cDNA DKFZp586H0924 (from ctone DKFZp586H0924) 1.6 

128854 BE159181 Hs.168232 hypothetical protein FU13855 2.2 

50 128854 BE159181 Hs.168232 hypolheUcal protein FU13855 1.9 

128868 AA419008 Hs.106730 chromosome 22 open reading frame 3 3.0 

128868 AA419008 Hs.106730 chromosome 22 open reading frame 3 2.2 

128871 AF189723 Hs.106778 ATPase, Ca++ transporting, type 2C, member 1 1.5 

128891 F34856 Hs.292457 Homo sapiens, ctone MGC:16362, mRNA, complete cds 13.3 

55 128906 R57988 Hs.10706 epithelial protein lost in neoplasm beta 4.7 

128920 AA622037 Hs.166468 programmed cell death 5 1.4 

128925 R67419 Hs.21851 Homo sapiens dDNAFLJ12900fis, done NT2RP2004321 1.9 

128946 Y13153 Hs.107318 kynuren{ne3-monooxygenase(kynursnine3'hydroxylase) 72 

128949 AA009647 Hs.8850 a disintegiln and metalloproteinase domain 12 (meltrin alpha) (ADAM-12) 2.4 

60 128958 AW139032 Hs.107376 hypothetical protein DKFZp434N035 1.3 

128959 AI580127 Hs.107381 hypothetical protein FU1 1200 10.9 

128965 AW150697 Hs.107418 ESTs 1.4 

128970 Ai375672 Hs.165028 ESTs 1.3 

128975 BE560779 Hs,284233 NICE-5 protein 14.0 

65 128979 AW271217 Hs.281434 Homo sapiens cDNA FU14028 fis, done HEMBA1003838 1.6 

128995 AI816224 Hs.107747 DKFZP566C243 protein 1.9 

129019 AI950087 gbwq05c02j(1 NCLCGAP«Kld12 Honw sapiens cDNA done 3*. mRNA sequence 2.9 
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129021 AIJ044675 Hs.173081 KIAA0530 protein 3.8 

129021 AL044675 Hs.173081 KIAA0530 proie&i 2.5 

129032 R80088 Hs.108104 ubiquitin^x)niugating enzyme E2L 3 3.4 

129076 AW296806 Hs,326234 ESTs, H'ghly similar to T46422 hypothetical protein DKFZp434M2023.1 [H^sapiens] 5.0 

5 129078 AI351010 Hs.102267 lysosomal 2.1 

129088 AA744610 Hs.1 94431 palladin "l^.l 

129095 L12350 Hs.108623 thrombospondin 2 2J 

129096 AA463189 Hs.2B8906 WW Domain-Containing Gene 20.9 

129097 BE243933 Hs.108642 zinc finger protein 22 (KOX 15) 3.0 
10 129099 AF146074 Hs.1 08660 ATP-blnding cassette. sub-fem{lyC(CFTFyMRP), member 5 5.8 

129136 W93048 Hs.250723 hypothetical protein MGC2747 5.9 

129149 AA356620 Hs.108947 KIAA0050 gene product 6.3 

129172 AW162916 Hs.241576 hypothetical protein PR02577 1-8 

129192 AA286914 Hs,183299 ESTs 2.1 

15 129194 AA150797 Hs.109276 latexin protein 3.2 

129198 N57532 Hs.109315 KIAA1415 protein 5.8 

129207 AI934365 Hs.109439 osteoglydn (osteoinductive ^ctor, mimecan) B.O 

129228 U40714 Hs.239307 tyrosyl-tRJSIA synthetase 2.9 

129229 AF013758 Hs.109643 polyadenylate binding protein-interacting protein 1 3.2 
20 129254 AA252468 Hs.1098 DKF2p434J 181 3 protein 2.6 

129255 AI961727 Hs.l09804 H1 histone family, member X 7.3 

129288 W26392 Hs.110080 ESTs, Weakly similar to S13495 pregnancy zone protein lH.saplensl 9.6 

129296 AI051967 Hs.110122 ESTs 1-2 

129323 AA287239 Hs.5518 Homo sapiens cDNA FU11311 fis, clone PLACE1010102 5.1 

25 129340 H75334 Hs.11050 F-box only protein 9 4.6 

129347 8E614192 Hs.279869 melanoma-assodated antigen recognised by cytotoxic T lymphocytes 7.6 

129362 U30246 Hs.1 10736 solute carrier family 12 (sodium/potassium/chloride transporters), member 2 6.7 

129366 BE220806 Hs.184697 Homo sapiens done 23785 mRNA sequence 8.6 

129370 AI686379 Hs.1 10796 SARI protein J-J 

30 129372 NM_016039Hs.110803 CGt-99 protein 2.0 

129403 AF149785 Hs.111126 pituitary tumor-transfomiing 1 Interacting protein 7.4 

129404 AI267700 Hs.317584 ESTs 5.0 
129404 AI267700 Hs.317584 ESTs 2 5 
129423 AA204686 Hs.234149 hypothetical protein FU20647 10.2 

35 129449 A1096988 Hs.l11554 ADP-ribosylatlon factor-like 7 8.0 

129453 AW974265 Hs.111632 Lsm3 protein 3.2 

129482 AA188185 Hs.289043 spindlin 6.7 

129482 AA188185 Hs.289043 spindiin 3.6 

129513 AW843633 Hs.306163 hypothetical protein ALl 10115 7.1 

40 129515 AF255303 Hs.1 12227 membrane-associated nudeic add binding protein 2.5 

129527 AA769221 Hs.270847 delta^tubulin 3.2 

129559 W01296 Hs.1 1360 hypothetical protein FU14784 7.5 

129560 AA317841 Hs.7845 hypothetical protein MGC2752 6.8 
129570 AI923097 Hs.1 1441 chromosome 1 open reading frame 8 2.0 

45 129575 F08282 H&27d428 progestin induced protein ^-^ 

129587 H14718 Hs.1 1506 Human done 23589 mRNA sequence 6-8 

129588 BE408300 Hs.301862 postmeiotk; segregatkxi increased 2>like 9 1-4 
129591 N57423 Hs.17989B HSPC055 protein 7.3 
129594 AW403724 Hs.36989 coagulation factor VII (serum prothrombin conversion accelerator) 9.0 

50 129596 AF035537 Hs.115521 REV3 (yeast homolog)-likB, catalytic subunit of DNA polymerase zeta 1.6 
129628 U38945 Hs.1174 cyclin-dependent kinase inhibitor 2A (melanoma, pi 6, inhibits CDK4) 2.2 

129628 U38945 Hs.1174 cyclin-dependent kinase inhibitor 2A (me 1-4 

129629 AK000398 Hs.1 1747 hypothetfcal protein FU20391 3.8 
129649 AD000092 Hs.16488 calreticulin 3^ 

55 129675 NIVL015556HS.172180 KIAA0440 protein 13.4 
129680 U03749 gb:Human chromogranin A (CH6A) gene, promoter an 14.1 

129689 AW748482 Hs.77873 B7homok)g3 ' 2.6 

129702 AI304966 Hs.12035 ESTs. Weakly similar to 138022 hypothetical protein [H.sapiens] 7.4 
129720 AA156214 Hs.12152 APMCF1 protein 2.0 

60 129721 NIVL001415HS.211539 eukaryotic translation inifiatton factor 2. subunit 3 (gamma, 52kD) 1.7 
129726 H15474 Hs.132896 fatty add desaturasel 6.3 

129778 AK001676 Hs.12457 hypothefical protein FU10814 1-6 

129779 AA394090 Hs.12460 Homo sapiens done 23870 mRNA sequence 5.4 
129800 AF052112 Hs.12540 lysosomal 1-7 

65 129806 AB023148 Hs.173373 KIAA0931 protein 1-2 
129815 BE565817 Hs,26498 hypothetical protein FU21657 3.1 
129840 NM.006590HS.12820 SnRNP assembly defective 1 homolog 1-6 
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129861 AL049999 Hs.85963 DKFZP564M1 82 protein 2.2 

129864 AI393237 Hs.129914 runt-related transcriptfon 1 (acute myebid leukemia ^ 1.7 

129869 AI222069 Hs.13015 hypothetical protein similar to mouse Dnajli 2.7 

129922 AF042379 Hs.l3386 gamma-tubuKn complex protein 2 4.5 

5 129945 BE514376 Hs,165998 PAI-1 mRNA-bindIng protein 1.8 

129953 AA412195 Hs.13740 ESTs 2.5 

129972 AW753185 Hs.180628 dynamln 1-llke 1.8 

1 29983 U09848 Hs.132390 zinc finger protein 36 (KOX 18) 1.3 

129989 AB015856 Hs.247433 activating transcription factor 6 4.0 

10 130010 AA301116 Hs.142838 nucleolar phosphoprotefn Nopp34 1.6 

130081 AA287325 Hs.14713 ESTs 4.0 

130082 S73265 Hs.1473 gastrin-releasing peptide 1.8 
130097 ALJ046962 Hs.14845 forkhead box 03A 2.8 
130100 AL135561 Hs.14891 hypothetical protein FLJ2 1047 2.3 

15 130111 X53002 Hs.149846 Integrin.beta5 2.3 

130112 AA916785 Hs.18061O splidng factor proline/glutamine rich (poIypyn'midInetrad4}ind{ngprotel^^ 3.0 

130112 AA916785 Hs.18061O splicing factor proline/glutamine rich ( 2.1 

130128 L76937 Hs.150477 Werner syndrome ' 18 

130135 AA311426 Hs.21635 tubulin, gamma 1 6.1 

20 130211 NM_003358Hs.23703 ESTs, ^toderately similar to CEGTJHUMANCB^IDEGLUCOSYLTRANSFERASE[H.sapiens] 1.6 

130212 D80001 Hs.152629 KIAA0179 protein 1.3 

130236 R85367 Hs.51957 splicing factor, arginine/serine-rich 2, interacting protein 2.0 

130241 AL035588 Hs.153203 MyoD family inhibitor 3.2 

130242 X79201 Hs.153221 synovial sarcoma, Iranstocated to X chromosome 5.4 
25 130249 D81983 Hs.322852 GAS2-related on chromosome 22 4.8 

130263 NM^002497Hs.153704 NIMA (never in mitosis gene a)-related kinase 2 1 .4 

130287 AA479005 Hs.154036 tumor suppressing subtransferable candidate 3 2.6 

130310 AB011121 Hs.154248 amyotrophic lateral sclerosis 2 Ouvenile) chromosome region, candidate 3 6.3 

130353 Z19084 Hs.172210 MUF1 protein 6.2 

30 130356 AF127577 Hs.155017 nuclear receptor Interacting protein 1 2.4 

130357 AJ224442 Hs.155020 puta^e methyltransferase 3.4 

130359 NI\A.013449Hs^77401 bromodomain adjacent to zinc finger domain. 2A 8.5 

130367 AL135301 Hs.8768 hypothetical protein FLJ 10849 1.4 

130372 AI077464 Hs.5011 RNA binding motif protein 9 3.3 

35 130393 N89487 Hs.155291 KIAA0005 gene product 1-8 

130399 AW374106 Hs.155356 hypothetical protein |yiGC2840 similar to a putative glucosyltiansferase 3.4 

130407 BE385099 Hs,334727 hypothetical protein IV1GC301 7 2.3 

130409 NWL001197HS.155419 BCL2-lnteracting killer (apoptosis-inducing) 2.7 

130419 AF037448 Hs.155489 NS1 -associated proteini 1.8 

40 130441 U63630 Hs.1 55637 protein kinase. DNA-acttvated. catalytic polypeptide 2.3 

130448 BE513202 Hs.15589 PPAR binding protein 3.9 

130455 D90041 Hs.155956 N-acetyltransferase 1 (arylamine N^cetyllransferase) 33.6 

130455 D90041 Hs.155956 N-acetyltransferase 1 (aiylamlne N-acety 4.6 

130471 AL121438 Hs.183706 adducin 1 (alpha) ' 2.7 

45 130485 6E245651 Hs.180779 H2B histone family, member B 5.0 

130487 U49844 Hs.77613 ataxia tetangledasia and Rad3 related 4.3 

130498 L38951 Hs.180446 karyopherin Cmportln) beta 1 16 

130503 BE208491 Hs.2951 12 KIAA061 8 gene product 16.1 

130511 L32137 Hs.1584 cartilage oligomeric matnx protein (pseudoachondroplasla. epiphyseal dysplasia 1. ^ 6.1 

50 130511 L32137 Hs.1584 cartilage oligomeric matrix protein (pse 5.3 

130525 AW876523 Hs.15929 hypothetical protein FLJ 12910 2.1 

130542 U64675 Hs. 179825 RAN binding protein 2-lik8 1 7,8 

130544 AA321238 Hs.4310 eukaiyotic translation Initiation factor 1 A 1-5 

130553 AF062649 Hs.252587 pituitaiy tumor-transfonning 1 14.4 

55 130556 AI907018 Hs.15977 Empirically selected from AFFX single probeset 4.7 

130567 AA383092 Hs.1608 replication protein A3 (14kD) 7.9 

130568 AA232119 Hs.l6085 putative G-proteIn coupled receptor 3.3 
130574 AF083208 Hs.16178 apoptosis antagonizing transcription factor 1.2 
130586 AB007891 Hs.16349 KIAA0431 protein 5.6 

60 130598 AL042210 Hs.16493 hypothetical protein DKFZp762N231 6; KIAA1 803 protein 1.4 

130601 AA609738 Hs.l6525 ESTs 1.5 

130614 AI354355 Hs.16697 dovm-regulator of trartscription I.TBP-blndIng (negative cofsK^ 1.3 

130617 M90516 Hs.1674 glutamine>fmctos&6-phosphate transaminase 1 12.1 

130617 M90516 Hs.1674 glutamine-fructose-6-phosphatetransamin 2.4 

65 130618 AA383439 Hs.16758 Spir-1 protein 15.9 

130667 BE246961 Hs.17639 Homo sapiens ubiquitin protein ligase (UBE3B) mRNA. partial cds 13.9 

130674 AL048842 Hs.194019 attractin 1.5 
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130675 AA442233 Hs.17731 hypothetical protein FU12892 5.4 

130692 AA6525ai Hs.13561 hypothetical protein MGC4692 5.0 

130893 R68537 Hs,17962 ESTs 2.0 

130712 AJ271881 Hs.279762 bromodomain-containing 7 1^ 

5 130714 AI348274 Hs.18212 DNA segment on chromoscn» X (unique) 9879 expressed sequence 2.0 

130730 AB007920 Hs,18586 KIAA0451 gene product 3.7 

130744 H59696 Hs.18747 P0P7 (processing of precursor, S. cerevfeiae) homolog 3.1 

130751 AF052105 Hs.18879 chromosome 12 open reading frame 1.4 

130757 AL036067 Hs.18925 protein x 0001 5.7 

10 130768 AF258627 Hs.2 1 1562 ATP-binding cassette, sub-family A (ABC1). member 1 5.1 

130789 AK000355 Hs.8899 sfrtuin (silent mating type information regulation 2, S.cerevisiae. hoiTK>t^^ 5.2 

130815 AB018298 Hs.19822 S EC24(S.cerevisiae) related gene family, member D 1.5 

130836 J05068 Hs.2012 transcobalamin I (vHam1nB12 binding protein, Rb'ffider family) 15.7 

130841 AL157468 Hs.325825 Homo sapiens cDNAFU20848tis. done ADKA01732 2.8 

15 130843 AA447492 Hs.20183 ESTs. WeaWy similar to AF1647931 protein x 013 [H.sapiensj 15 

130844 U76248 Hs.20191 seven in absentia (DrosophBa) honralog 2 3.4 

130855 AJ243706 Hs.1 43323 putative DNA/chromatin binding motif 1.7 

130861 NIVL016578HS.20509 HBV pX associated proteln-8 1.9 

130879 NM_003416Hs.2076 zinc finger protein 7 (KOX 4, done HF.16) 1.4 

20 130880 BE514434 Hs.20830 kinesin-lilte 2 2,1 

130892 AL120837 Hs.20993 high-glucose-regulated protein 8 2.4 

130898 AB033078 Hs,186613 sphtngosine-1-phosphate lyase 1 1.7 

130911 BE409769 Hs,21189 DnaJ (Hsp40) homolog, subfamily A» member 2 1.8 

130919 N791 10 Hs.21276 collagen, type IV. alpha 3 (Goodpasture antigen) binding protein 2.3 

25 130944 BE382657 Hs.21486 signal transducer and activator of transcription 1,91 kD 5.4 

1X971 N39842 Hs.301444 KIAA1673 2.2 

130992 BE398091 Hs.74316 desmoplakin (DPI, DPIl) 1.8 

130993 T97401 Hs.21929 ESTs 1.6 
131005 AV668308 Hs.2210 thyroid homwne receptor interactor 3 1.6 

30 131028 AIB79165 Hs.2227 CCAAT/enhancer binding protein (C/EBP). gamma 12 

131042 AI626288 Hs.171637 hypothetical protein MGC2628 15 

131046 AA321649 Hs.2248 smairmduciblecytDkinesuMamilyB(Cys-X-Cys). member 10 7.4 

131046 AA321649 Hs.2248 small inducible c^ldne subfamily B (Cy 3.0 

131047 H23230 Hs,22481 ESTs, f^floderateV simOar to A46010 X-llnked retinopathy protein [H-sapiens] 17 
35 131060 AA194422 Hs,22564 myosin Vl 5.1 

131060 AA194422 Hs.22564 myosin VI 2.5 

131070 N53344 Hs.22607 ESTs 7.1 

131076 AA749230 Hs.26433 dolichyl-phosphate (UDP-N-acetylglucosamtne) N-aoetylglucosamlnephosphotransferase 1 ( 2,0 

131076 AA749230 Hs.26433 dolichyl-phosphate (UDP-N^cetylgluoosam 19 

40 131099 AL133353 Hs.226581 C0X15 (yeast) homolog. cytxhromec oxidase assembly protein 7,0 

131174 N[VL006540Hs.29131 nuclear receptor coactivatDr 2 19 

131185 BE280074 Hs.23960 cycllnBI 5.8 

131206 AW138839 Hs.24210 ESTs 2.0 

131213 AA885699 Hs.24332 CGl-26 protein 7.0 

45 131225 H62087 Hs.31659 thyroid honnone receptor-assodated protein, 95-kD subunit 7.5 

131231 N47468 Hs.59757 zinc finger protein 281 2.9 

131233 D89053 Hs,268012 fatty-acid-CoenzymeAKgase,long-diain3 3.5 

131243 AW383256 Hs.24752 spectrin SH3 domain binding protein 1 2.8 

131245 AL080080 Hs.24766 thioredoxin domain^ntaining 2.8 

50 131247 AL043100 Hs.326190 fatty add amide hydrolase 5.6 

131281 AA251716 Hs,25227 ESTs 5.7 

131283 X80038 Hs.339713 Homo sapiens clone F19374APOE-C2 gene cluster 13 

131305 AV656017 Hs.184325 001-76 protein 5.0 

131320 AA505691 Hs.145696 splidng factor (CC1. 3) 18 

55 131339 AF058696 Hs.25812 Nijmegen breakage syndrome l(nlbrin) 2.6 

131339 AF058695 Hs.25812 Nijmegen breakage syndrome 1 (nibrin) 2.6 

131375 AW293165 Hs.143134 ESTs 5.4 

131390 BE269388 Hs.182698 mitochondnal ribosomal protein L20 5.3 

131410 BE259110 Hs.279836 HSPC166 protein 2,2 

60 131412 NI\/L012247Hs.124027 SELENOPHOSPHATE SYMTHETASE : Human selenium donor protein 2,0 

131429 AL046302 H&26750 hypolhettealprotsinFU21908 14 

131458 BE297567 Hs57047 hypothetical protein FU20392 17 

131475 AA992841 Hs.27263 KIAA1458 proton 2.0 

131501 AV661958 Hs.8207 GK001 protein 2.6 

65 131501 AV661958 Hs.8207 GK001 protein 16 

131511 AA732153 Hs,27865 Homo sapiens d3NA: FU21 333 fis, done COL02535 2.0 

131528 AU076408 Hs.26309 UDP-glucose dehydrogenase 16 
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131532 BE268278 Hs.26393 hypothetical protein MGC2592 7.4 

131543 AW966881 Hs.41639 programmed cell death 2 2.2 

131544 AL355715 Hs.28555 programmed cell death 9 (PDCD9) 2.1 
131562 NM_003512Hs.28777 H2A histone family, memt)er L 1.7 
131564 T93500 Hs.28792 Homo sapiens cDNA RJ 11041 fis, clone PU^CE1004405 5.1 
131564 T93500 Hs.28792 Homo sapiens cDNA aJ1 1041 fis, clone PL 1,8 
131569 AL389951 Hs.271623 nucleoporin50kD 5.0 
131618 BE393822 Hs.29645 Homo sapiens mRNA;cONADKFZp761C029 (from done DKFZp761C029); partial cds 1.8 

131622 R78195 Hs.29692 Homo sapiens cONA FU 11436 fis, clone HEMBA1001213 1.3 

131623 AB037791 Hs29716 hypothetical protein FLJ1 0980 2.2 
131623 AB037791 Hs.29716 hypothetical protein FU10980 1.9 
131643 AW410601 Hs.30026 HSPC1 82 protein 2.9 
131653 AW960597 Hs.30164 ESTs 1-3 
131656 At2189ie Hs.30209 KIAA0854 protein 2.8 
131669 X52486 Hs.3041 uradl-DNA glycosylase 2 2.8 
131692 BE559681 Hs^0736 KIAA0124 protein 5.6 
131714 AA642831 Hs.31016 putative DNA binding protein 2.9 
131722 D13757 Hs.311 phosphorikwsyl pyrophosphate amid otransferase 3.4 
131737 AK001641 Hs.31323 inhibitorof kappa light polypeptide gene enhancer in B^lls, kinase complex-assodatedprote^ 3.8 
131760 X76732 Hs.3164 nucleobindin 2 2.9 
131760 X76732 Hs.3164 nucleobindin 2 2.8 
131763 AI878932 Hs.317 topoisomerase (DNA) I 3.4 
131772 AA382590 Hs.170980 KIAA0948 protein 25.5 
131774 BE267158 Hs.169474 DKFZP586J01 19 protein 5.5 
131787 D87077 Hs.196275 KIAA0240 protein 2.4 
131793 AW966127 Hs.32246 Homo sapiens cDNA FLJ14656 fis. cfone NT2RP2002439 7.9 
131795 BE501849 Hs,32317 high-mobility group 20B 1.4 
131798 XB6098 Hs.301449 adenovirus 5 E1A binding protein 4.1 
131817 U20536 Hs.3280 caspase 6, apoptosis-related cysteine protease 4.2 
131824 U28838 Hs.32935 TATA box binding protein (r6P)^sodated factor. polymerase III, 6TF3B subun» ^ 3.5 
131850 AI251317 Hs.33184 ESTs 5.1 
131878 AA083764 Hs.6101 hypothetical protein M6C3178 5.8 
131885 BE502341 Hs.3402 ESTs 13.7 
131885 BE502341 Hs.3402 ESTs 2.4 
131887 W17064 Hs.332848 SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily e. member 1 3.2 
131900 AA099014 Hs.231029 Homo sapiens, done MGC:1 5961. mRNA, complete cds 8.7 
131900 AA099014 Hs.231029 Homo sapiens, clone MGC: 15961, mRNA, com 2.0 

131904 AF078866 Hs.284296 Homo sapiens dDNA: FU22993 fis, clone KAT1 1914 5.5 

131905 AA179298 Hs,3439 stomatln-like 2 11.3 
131913 AW207440 Hs. 185973 degenerative spermatocyte (homolog Drosophila; lipid desaturase) 1.7 
131916 AA025976 Hs.34569 ESTs 5.2 
131925 AF151048 Hs.183180 anaphase promoting complex subunit 11 (yeast APC11 homolog) 2.7 
131929 BK41211 Hs.34804 Homo sapiens d3NAFU1 1472 fls. done HEMBA1001711 5.3 
131941 BE252983 Hs.35086 ubiquitin spedfic protease 1 2.3 
131950 AA355113 Hs.35360 xOOl protein 15 
131962 AK000046 Hs.267448 hypothetical protein FU20039 2.3 
131965 W79283 Hs.35962 ESTs 1-4 
131971 BE567100 Hs.154938 hypothetical protein MDS025 3.5 
131977 U90441 Hs.3622 procollagen-firoline, 2-oxoglutarate 4-dtoxygenase (proline 4-hydroxylase), alpha polypeptide 11 6.5 
131985 AA5a3020 Hs.36663 hypothetical protein FU22418 2.4 
131991 AF053306 Hs.36708 budding uninhibited by benzimidazoles 1 (yeast homolog), beta 2.1 
132019 H56995 Hs.37372 Homo sapiens DNA binding peptide mRNA. partial cds 3.2 
132031 AF193844 Hs.3758 C0P9 complex subunit 7a ' 5.8 
132062 BE266155 Hs.3832 clathnn^ssodated protein AP47 1.5 
132084 NIVL002267HS.3886 karyopherin alpha 3 (importin alpha 4) 3.7 
132103 BE171921 Hs.3991 ESTs 1.4 
132105 AV646076 Hs.39959 ESTs 5.8 
132116 AW960474 Hs.40289 ESTs 1.7 
132176 AA857025 Hs.8878 kinesln-like 1 3.3 
132180 NM.004460Hs.418 fibroblastactivatlon protein, alpha 14.7 
132192 AA206153 Hs.4209 mitochondrial ribosomal protein L37 5.5 
132194 R42432 Hs,4212 ESTs 4.4 
132203 NM004782HS.194714 synaptosomal-assodated protein. 29kO 2.2 
132207 BE206939 Hs.42287 E2F transcription factor 6 2.2 
132235 AV658411 Hs.42656 KIAA1 681 protein 7.8 
132240 AB018324 Hs.42676 KIAA0781 protein 1.5 
132252 A1566004 Hs.141269 Homo sapiens cDNA: FU21550fts, clone COL06258 1-3 
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132266 AA301228 Hs.43299 hypothetical protein RJ 12890 5.7 

132273 AA227710 Hs.43658 DKFZP586L151 protein 4^ 

132276 AA553507 Hs.285711 tiypothetical protein FU 13089 2.1 

132288 N36110 Hs.305971 solirtecarrierfamily 2 (facilitated glucose transporter), member 10 1.5 

5 132294 AB023191 Hs.44131 KIAA0974 protein 10.0 

132298 NM_015986Hs.7120 cytokine receptor-like nwlecule 9 1-9 

132299 AW405882 Hs.44205 cortistatin 9.2 
132325 N37065 Hs.44856 hypothetical protein FLJ1 21 16 2.0 
132348 AW06770d Hs.170311 heterogeneous nuctearn1}onucteopro1ein D-like 6.5 

10 132370 AW572805 Hs.46645 ESTs 3.8 

132374 AF155682 Hs.46744 corel UDP-galactDse:N-acetylgalactosamine^a4^ beta l^^alactDsylfr 1.5 

132376 AI279892 Hs.46801 sorting nexln 14 12.5 

132384 AA312135 Hs.46967 HSPC034 protein 28.3 

132393 AL135094 Hs.47334 hypothetical protein FLJ14495 19 

15 132450 AA100012 Hs.48827 hypothetical protein FU 12085 1.9 

132452 AW973521 Hs.247324 mitochondrial nbosomal protein S14 6.1 

132456 AB011084 Hs.48924 KIAA0512 gene product; ALEX2 1-7 

132465 AW169847 Hs.49169 KIAA1634 protein 8.6 

132470 AI224456 Hs.4934 H.saplens polyA site DNA 5.2 

20 132484 X16560 Hs.119007 RAB4. member RAS oncogene family 1.4 

132518 AW885606 Hs.5064 ESTs 6.1 

132528 T78736 Hs.50758 SMC4 (stnictural maintenance of chromosomes 4, yeast)-like 1 3.3 

132530 AA306105 Hs.50785 SEC22, veslde trafficking protein (S. cerevislae)-like 1 2.0 

132532 AA454132 Hs.5080 mitochondnal nbosomal protein L16 2.9 

25 132534 BE388673 Hs.5066 hypotheUcal protein MGC10433 2.2 

132543 BE568452 Hs.5101 protein regulator of cytokinesis 1 7.3 

132571 AW674699 Hs.5169 suppressorof G2 allele of SKPl,S.cerevlslae,homolog of 1.7 

132574 AW631437 Hs.5184 THl drosophfla homolog 7.1 

132596 AK001484 Hs.5298 CGI-45 protein 2.2 

30 132611 AA345547 Hs.53263 hypothetical protein FLJ 13287 . 2.2 

132612 H12751 Hs.5327 PR01914 protein 6.8 

132616 BE262677 Hs.283558 hypothetical protein PR01 855 14.0 

132638 A1796870 Hs.54277 DNA segment on chromosome X (unkjue) 9928 expressed sequence 11.4 

132648 U51127 Hs.54434 hypothetical protein MGC171 5 1.9 

35 132668 AB018319 Hs.5460 K1AA0776 protein 2.6 

132692 AW191962 Hs.249239 collagen, type VIII, alpha 2 2.0 

132715 F11875 Hs.5534 HomosapienscDNA FU 12961 Hs, done hrT2RP2005645 1.5 
132718 NM_004600Hs.554 Sjogren syndrome antigen A2 (60kD, ribonudeoprotein autoantigen SS-A/Ro) ' 3.0 

132724 AI142265 Hs.55498 geranylgeranyi diphosphate synthase 1 2.4 

40 132731 AI189075 Hs.301872 hypothetical protein MGC4840 12.4 

132744 AA010233 Hs.55921 glutamyl-prolyl-tRNA synthetase 14.6 

132760 AA125985 Hs.56145 thymosin, beta, identified in neuroblastoma cells 2.7 

132771 Y10275 Hs.56407 phosphoserine phosphatase 3.0 

132773 AA459713 Hs.295901 KIAA0493 protein 2.3 

45 132784 AI142133 Hs.56845 GDP dissociation inhibitor 2 1.8 

132798 AI026701 Hs.5716 KIAA0310 gene product 3.7 

132807 U07418 Hs.57301 mulL (E. coli) homolog 1 (coton cancer, nonpolyposis type 2) 1.8 

132810 AB007944 Hs.5737 KIAA0475 gene product 5.9 

132813 BE313625 Hs.57435 solute carrier family 11 {proton-coupled divalent metal ion transporters), member 2 8.7 

50 132815 AI815189 Hs.57475 sex comb on midleg homolog 1 6.4 

132817 N27852 Hs.57553 tousled-like kinase 2 3.6 

132821 AJ251595 Hs.169610 CD44 antigen (homing function and Indian Wood group system) 2.8 

132833 U78525 Hs.57783 eukaiyotic translation initiation factor 3, subunit 9 (ela, 116kD) 14.6 

132842 NWL016154HS.279771 Homo sapiens done PP1596 unknown mRNA 1.6 

55 132844 F12200 Hs.5811 chromosome 21 open reading frame 59 2.5 

132851 U09716 Hs.287912 lectin, mannose-bindlng, 1 1.4 

132663 BE268048 Hs.236494 RABID, member RAS oncogene family 4.2 

132869 AW963217 Hs.203961 ESTs. Moderately similar to AF1 16721 89 PR02168 |H.sapiens] 2.8 

132873 AW007683 Hs.58598 KIAA1 266 protein 2.0 

60 132875 NM_0O4850Hs,58617 Rho-assodated, coiled-coil containing protein Wnase 2 16 

132891 BE267143 Hs.59271 U2(RNU2) small nuclear RNA auxffiary factor 1 (non-standard symboO 1.4 

132897 AW503667 Hs.59545 ring fmger protein 15 5.4 

132902 A1936442 Hs.59838 hypothetical protein FU 10808 6.1 

132912 AW732760 Hs.167578 Homo sapiens cDNAaJ! 1095 fis, done PIACE1005374 7.1 

65 132913 W78714 Hs.60257 Homo sapiens cDNAFU13598fis, done PLACE1009921 2.8 

132940 T79136 Hs,127243 Homo sapiens mRNA for KIAA1724 protein, partial cds 6.1 

132941 AI817165 Hs.6120 hypotfjetical protein FU13222 10.3 
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132942 AA554458 Hs.19775l KIAA0666 protein 1.8 

132952 AI668580 Hs.61426 Homo sapiens mesenchymal stem cell protein DSC96 mRNA. partial cds 2.2 

132962 AA576635 Hs.6153 CGI-48 protein 4.9 

132972 AA034365 Hs.288924 Homo sapiens cDNAaJ11392fe, done HEMBA1000575 2.7 

132973 AA035446 Hs.323277 ESTs 5.3 
132977 AA093322 Hs.3O1404 RNA binding motif protein 3 3.2 
132980 AA040696 Hs.62016 ESTs 1.3 
132994 M112748 Hs.279905 clone HQ0310 PRO0310p1 3.0 . 
133012 AA847843 Hs.62711 Homo sapiens, clone IMAGE3351295, mRNA 10.3 

133015 AJ002744 Hs.246315 UDP-N-acetyl-alpha-D^alaclosamine:poIypeptide N-acetylgaladosaminyltransferase 7 (GalNAo^^ 2.1 

133016 AI43968d Hs.6289 hypothetical protdn FLJ20886 1.3 
133053 AI065016 Hs.6390 Homo sapiens done aB3344 PRO0845 mRNA. complete cds 6.0 
133062 AW500374 Hs.64056 PRO0149 protein 5.3 
133069 BE247441 Hs.6430 protein with polyglutamine repeat; caldum (ca2+) homeostasis endoplasmic reticulum proteh 4.9 
133091 AK001628 Hs.64691 KIAA0483 protein • 3.5 
133110 AA808177 Hs.65228 ESTs 13.1 
133134 AF198620 Hs.65648 RNA binding motif protein 8A 1.3 
133145 H94227 Hs.6592 Homo sapiens, clone IMAGE:2961 368, mRNA, partial cds 2.2 
133152 Z11695 Hs.324473 mitogen-activated protein kinase 1 1.3 

133174 AA431620 Hs.324178 hypothetical protein MGC2745 17.1 

133175 AW955632 Hs.66666 ESTs. Weakly similar to S19560 prollne-iich protein MP4 - mouse [M.musculus] 1.8 
133177 X97795 Hs.66718 RAD54 (S.cerevislae)-like 4.9 
133197 A1275243 Hs.180201 hypothetical protein FU20671 3.1 
133208 A1801777 Hs.6774 ESTs 4.4 
133226 AW954569 Hs.296287 Homo sapiens. Similar to bromodomai^containing 4, clone IMAGE:3542455, mRNA. partial cds 1.7 
133228 AI492924 Hs.6831 golgl phosphoprotetn 1 6.0 
133240 AK001489 Hs.242894 ADP-r!bosylation ^r-like 1 1.5 
133254 AI567421 Hs.273330 Homo sapiens, done tMAGE:3544662. mRNA. partial cds 1.4 
133266 AI160873 Hs.69233 zinc finger protein 5.6 
133268 AW956781 Hs.293937 ESTs. Weakly similar to FXD^^HUIVIAN FORKHEAD BOX PROTEIN D2 [H.sapiensl 1.9 
133285 M76477 Hs.289082 GM2ganglioside activator protein . 4.7 
133291 BE297855 Hs.69855 NRAS-related gene 5.0 
133314 AA102670 Hs.70725 gamma-aminobutyric add (GABA) A receptor, pi 2.7 
133321 T79526 Hs.179516 Integral type 1 protein 93 
133327 AL390127 Hs.7104 KnippeMIke factor 13 4.4 
133347 BE257758 Hs.71475 add duster protein 33 1.8 
133360 AI016521 Hs.7l816 v-akt murine thymoma viral oncogene homolog 1 5.5 

133366 AA292811 Hs72050 non-metastatic cells 5. protein expressed in (nucleoside-diphosphate kinase) 2.7 

133367 AF231919 Hs.18759 KIAA0539 gene product 1.7 
133370 AF245505 Hs.72157 DKFZP5641 1922 protein 1.8 
133383 BE313555 Hs.7252 KIAA1 224 protein 1.7 

133390 At950382 Hs.72660 phosphatidylserine receptor 1.3 

133391 AW103364 Hs.727 inhibn. beta A (activin A. adlvin AB a^ha pdypepOde) 16.1 
133394 AA305127 Hs.237225 hypothetical protein HT023 12.2 
133437 AL031591 Hs.7370 phosphotidyllnositol transfer protein, beta 10.4 

133452 NIVU02759HS.274382 protein kinase, interferon-Indudble double stranded RNA dependent 1 .2 

133453 AI659306 Hs.73826 protein tyrosine phosphatase, non-receptor type 4 (megakaryocyte) 1.7 
133500 AW964fl04 Hs.74280 hypolheUcal protein FU22237 11.1 
133529 W45623 Hs.74571 ADP-ribosylafion factor 1 2,8 
133540 AL037159 Hs.74619 proteasome (prosome, macropain) 26S subunit, non-ATPase, 2 2.9 
133543 AU077073 Hs.108327 damage-specific DNA binding protein 1 (127kD) 2.5 

133578 AU077050 Hs.75066 translin 1.5 

133579 X75346 Hs75074 mitogen-activated protein kinase-activated protein kinase 2 2.1 
133582 BE391579 Hs.75087 Fas-activated serine/threonine kinase 1.3 

133594 AW160781 Hs.172589 nudear phosphoprotetn similar to S. cerevislae PWPl 2.2 

133595 AA393273 Hs.75133 transcription factor 6-^ike 1 (mitochondrial transcription fodorl-like) 1.5 
133599 NWL002885HS.75151 RAP1, GTPase activating protein 1 5.7 
133621 Nhy)04893Hs.75258 H2A hlstone family, member Y 25.5 
1336Z7 NIVL002047HS.75280 glycyMRNA synthetase 15.8 
133631 NM-000401HS.75334 exostoses (multiple) 2 3.3 
133649 U25849 Hs.75393 add phosphatase 1, soluble 1.6 
133690 AV661185 Hs.75574 mitochondrial ribosomal protein LI 9 4.1 
133720 L27841 Hs.75737 perrcentriolar material 1 1.5 
133722 AW969976 Hs.279009 matrix Gla protein 6.3 
133751 AW402048. Hs.334787 Homo sapiens. Similar to likely orlholog of yeast ARV1. done 1MAGE3506392, mRNA 3.9 
133757 T52946 Hs. 196209 RAE1 (RNA export 1, S.pombe) homobg 1.7 
133760 BE271766 Hs. 181357 laminin receptor 1 (67kD. ribosomal protein SA) 1.8 
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133765 M62194 Hs75929 cadberin 11, type 2. OB^herin (osteoblast) 1.5 

133780 AA557660 Hs.76152 decorin 3.5 

133784 BE622743 Hs.3O1084 arfaptin 1 6.8 

133791 M34338 Hs.76244 spermidine synthase 2.6 

5 133797 AL133921 Hs.76272 retinoblastoma-bindlng protein 2 1.4 

133822 D50525 Hs.699 peptidylprolyl bomerase B (cydophiTtn B) 8.0 

133842 AW797466 Hs.285013 putative human HLA dass II associated protein I 13.5 

133845 AA147026 Hs.76704 ESTs 2.2 

133850 W^2 Hs7678 ceilular retinoic add-bindlng protein 1 1^ 

10 133859 U86782 Hs.1 78761 26S proteasome^ssociated padi homotog 2.0 

133865 AB011155 Hs.170290 discs Jarge (Drosophila) homolog 5 2.8 

133867 AW340125 Hs.7e989 KIAA0097 gene product 6.7 

133868 AB012193 Hs.183874 cuilin4A 2.5 
133861 U30872 Hs.77204 centromere protein F (35QM00kD, mitosln) • 3.0 

15 133922 U30825 Hs.77608 splidng factor, arglnlne/serine-rfch 9 1.4 

133924 D86326 Hs.32594a vesicle docldng protein pi 15 5.4 

133929 NIVL006306HS.211602 SIWCI (stnjctural maintenance of chromosomes 1» yeast)-lil(e 1 4.9 

133936 L17128 Hs.77719 gamma-glutamyi carboxylase 37 

133941 BE244332 Hs.77770 adaptor-related protein complex3,mu 2 subunil 12.1 

20 133959 X81789 Hs77897 splidng factor 3a, subu nit 3, 60I(D 97 

133976 AI908165 Hs.169946 GATA-blnding protein 3 (T-cell receptor gene activator) 3.1 

133989 AL04032a Hs.78202 SWI/SNF related, matrix associated, actin dependent regulator of chromatin 1.3 

133997 A18241 13 Hs78281 regulator of G-protein signalling 12 97 

134010 AB016092 Hs.197114 RNA binding protein; AT-ridi element binding factor 2.4 

25 134015 D31764 Hs.278569 sorting nexin 17 2.6 

134070 NIVL003590HS.78946 cullin3 1.3 

134110 U410e0 Hs.79136 UV-1 protein, estrogen regulated 4.2 

134129 NI\L014742Hs.79305 KiAAQ255 gene product 2.2 

134134 H86504 Hs.173328 protein phosphatase 2. regulatory subunil B (B56), epsilon isoform 5.0 

30 134200 BK59S98 Hs.197803 KIAA01 60 protein 3.2 

134206 AF107463 Hs.79968 splicing factor 30, survival of motor neuron-related 2.5 

134208 NIVL000288Hs79993 perox'isomal biogenesis factor 7 2.1 

134219 NM_000402Hs.80206 glucose-6-phosphate dehydrogenase 9.1 

134234 BE300078 HS.B0449 Homo sapiens, clone IMAGE:3535294, mRNA. partial cds 2.8 

35 134275 A1878910 Hs.3688 cisplalln resistance-associated overexpressed protein 1.8 

134292 AI906291 Hs.81234 immunoglobulin superfamily, member 3 2.0 

134301 AV\602505 Hs.81360 Homo sapiens cDNA: FU21927fis. done HEP04178,h^hly similar to HSU90909 2.5 

134305 U61397 Hs.81424 ubiquitln-IIke 1 (sentrin) 2.8 

134324 AB029023 Hs.179946 KIAA1 100 protein 10.4 

40 134326 AW903838 Hs.81800 chondroitin sulfate proteoglycan 2 (versican) 1.9 

134329 N92036 Hs.81848 RA021 (S. pombe) homobg 2.6 

134337 NI\L004922Hs.81964 SEC24(S.oerevisiae) related gene family, member C 23 

134348 AW291946 Hs.82065 interteultin 6 signal transducer (gp130.oncostatinM receptor) 13.0 

134367 AA339449 Hs.8228S phosphoribosylglycinamide fomfiyltransferase, phosphoribosylglydnamide synthe^ 8.8 

45 134376 X06560 Hs.82396 2',5'-oligoadenylale synthetase 1 (40^ kD) 1.5 

134379 AW362124 Hs.323193 hypothetical protein M6C3222 8.1 
134384 AI589941 Hs.8254 Homo sapiens, Similar to tumor differentfaily expressed 1. done IMAGE:3639252. mRNA, partial^ 

134391 AA417383 Hs.82582 integrin. beta-like 1 (with EGF-ilke repeat domains) 4.1 

134395 AA456539 Hs.8262 lysosomal 17 

50 134403 AA334551 Hs.82767 spem^ spedfic antigen 2 2.6 

• 134405 AW067903 Hs.82772 collagen, type XI. alpha 1 1.3 

134411 BE272095 Hs.167791 retlculocalbin 1, EF-hand caldum binding domain 3.2 

134415 AI750762 Hs.82911 protein tyrosine phosphatase type IVA, member 2 1.9 

134421 AU077196 Hs.82985 collagen, type V. alpha 2 103 

55 134424 Z44190 Hs.83023 peroxisomal biogenesis factor 1 1B 2.4 

134446 AA112036 Hs.83419 K1AA0252 protein 1>2 

1 34447 M58603 Hs.83428 nuclear fector of kappa light polypeptide gene enhancer In B-cells 1 (pi 05) 1 .6 
134470 X54942 Hs.83758 CDC28 protein kinase 2 2.1 
134480 NI\/L005000Hs.83916 Empirically selected from AFFX single probeset 5.3 

60 134485 X82153 Hs.83942 cathepsin K (pycnodysoslosis) 2.5 

134498 AW246273 Hs.84131 threonyi-tRNA synthetase 2.1 

134513 AA425473 Hs.84429 KIAA0971 protein 3.8 

134516 AK001571 Hs.273357 hypothetical protein FU 10709 2.4 

134520 BE091005 Hs.74861 activated RNA polymerase II transcription cofactor 4 57 

65 134529 AW411479 Hs.848 FK506-binding protein 4 (59kD) 2.3 

134577 BE244323 Hs.85951 exportin.tRNA (nuclear export receptor for 0^As) 5.5 

134582 AA927177 Hs.86041 CGG triplet repeat binding protein 1 5.8 
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134612 AW068223 Hs.17li581 ubiquitin C-lerminal hydrolase UCH37 2.2 

134624 AF035119 Hs.8700 deleted In liver canarl 2.0 

134632 X78520 Hs.174139 chloride channel 3 2.3 

134654 AK001741 Hs.8739 hypothetical protein FU10879 1.4 

5 134664 AA256106 Hs.87507 ESTs ■ 72.9 

134666 BE391929 Hs.8752 transmembrane protein 4 8.5 

134687 U62317 Hs.88251 arylsulfatase A 6.0 

134692 NIVL003474HS.8850 a disintegrin and metalloproteinase domain 12 (meltrin alpha) 4.3 

134705 BE161887 Hs.88799 anaphase-pramoling complex subunft 10 2.3 

10 134714 Y14768 Hs.890 lysosomal 6.7 

134719 AA852985 Hs.89232 chromobox homolog 5 (Drosophila HPI alpha) 2.3 

134722 AF129536 Hs,284226 F4)0X only protein 6 2.9 

134724 AF045239 Hs.321576 ring finger protein 22 6.6 

134746 )a)7871 Hs.89476 CD2 antigen (p60), sheep red blood cell receptor ' 2.3 

15 134751 AW830803 Hs.89497 laminBI 6.2 

134790 BE0Q2798 Hs.287850 integral membrane proteTn 1 1.9 

134806 AD001528 Hs.89718 spemiine synthase 1-8 

134834 AW451370 HS.&991 adaptor-related protein complex 1, gamma 2 subunit 1.4 

134850 AI701162 Hs.90207 hypothetical protein MGC1 1138 1.4 

20 134853 BE268326 Hs.90280 5^minoimldazole-4-carboxamlde ribonudeotide fomiyllransferaseflMP cyck^^^ 5.6 

134859 D26488 Hs.90315 KIAA0007 protein 2.8 

134880 AI879195 Hs.90606 15 kDa selenoprotein 1.7 

134910 AA532963 Hs.9100 HomosapienscDNA FLJ 131 00 fls, clone NT2RP3002255 1.7 

134925 AW885909 Hs.6975 PRO1073 protein 2.1 

25 134955 AW401361 Hs.91773 protein phosphatase 2 (fomieriyZA). catalytic subunit. alpha Isofomi 1.3 

134971 A1097346 Hs.286049 phosphoserine aminotransferase 2.1 

134975 R50333 Hs.92186 Leman coiled-coil protein 2.3 

135011 AB037835 Hs.92991 KIAA1414 protein 1.6 

1 35022 NM-000408HS.93201 glyceral-3i)hosphate dehydrogenase 2 (mitochondrial) 3.9 

30 135032 AW301984 Hs.173685 hypothetical protein FU12619 6.2 

135077 AW503733 Hs.9414 KIAA1488 protein 2.0 

135083 AB036063 Hs.94262 p53-tndudble ribonucleotide reductase small subunit 2 homolog 1.3 

135095 AF027219 Hs.9443 zinc linger protein 202 7.1 

135096 AA081258 Hs.132390 zinc finger protein 36 (KOX 18) 3.2 
35 135153 A1093155 Hs.95420 JM27 protein 2.5 

135181 BE250865 Hs.279529 px19-like protein 1-4 

135199 AA477514 Hs.96247 transfin-associated factor X 5.0 

135207 N26427 Hs.9634 ESTs, Highly similar to ClO.HUfVlAN PUTATIVE CIO PROTEIN [H^saplens] 6.1 

135214 T78802 Hs.96560 hypothetical protein FU 11 656 4.6 

40 135243 BE453721 Hs.97101 putative G protein-coupled receptor 5.6 

. 135245 AI028767 Hs.262603 ESTs 3.5 

135257 AW291023 Hs.97255 ESTs, Weakly simllartoA46010X-linked retinopathy protein [H.saplensl 1.2 

135263 AI088775 Hs.55498 geranylgerar^ diphosphate synthase 1 2.6 

135274 AA448460 Hs.1 12017 6E36gene 5 3 

45 135294 AA150320 Hs.9800 protein kinase NjmifRI 91 

135295 AI090838 Hs.98006 ESTs 2.4 

135307 AI743770 Hs,98368 ESTs, Weakly similar to KIAA0822 protein [H.sapiens] 13.3 

135321 AI652069 Hs.98614 ribosome binding protein 1 (dog 180kD homolog) 2.6 

135354 AA456454 Hs.183418 cell division cyde 2-like 1 (PITSLRE proteins) 8.3 

50 135361 AA373452 Hs.16770O Homo sapiens cDNAFU10174fis, done HEMBA1003959 1.5 

135389 U05237 Hs.99872 fetal Alzheimer antigen 4.9 

135400 X78592 Hs.99915 androgen neceplor (dihydrotestosterone receptor; testicular feminization; spinal and b^^ 2.0 

134975 R50333 Hs.92186 Leman coiled-coil protein 2.6 

-135011 AB037835 Hs.92991 K1AA14 14 protein 1.4 

55 135022 NM_000408Hs.93201 glycerol-3-phosphate dehydrogenase 2 (mi 1.6 

135032 AW301984 Hs.173685 hypothetical protein FLJ12619 1.4 

135077 AW503733 Hs.9414 KIAA1488 protein 1.8 

135083 AB036063 Hs.94262 p53^ndudble ribonucleotide reductase s 2.5 

135095 AF027219 Hs.9443 zinc Finger protein 202 1-5 

60 135096 AA081258 Hs.132390 zinc Hnger protein 36 (KOX 18) 2.1 

135153 AI093155 Hs.95420 JM27 protein 4.4 

135181 BE250865 Hs.279529 px19-like protein 14.9 

135199 AA477514 Hs.96247 translin^ssodaled factor X 1.3 

135207 N26427 Hs.9634 ESTs, Highly slmOar to CIO J^UMANPUTATI 1-7 

65 135214 T78802 Hs.96560 hypothetical protein ajl 1656 6.1 

135243 BE463721 Hs.97101 putative G pratein^upled receptor 2.7 

135245 AI028767 Hs.262603 ESTs 12i 
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135257 AW291023 Hs.97255 ESTs. Weakly similar to A46010X4[nkBd 
135263 AI088775 Hs.55498 geranylgeranyl diphosphate synthase 1 
135274 AA448460 Hs.112017 6E36gene 

135294 AA150320 Hs.9800 protein kinase NjmLi-RI 

135295 AI090838 Hs.98006 ESTs 

135307 AI743770 Hs.98368 ESTs. Weakly similar to KIAA0822 protein 

135321 AI652069 Hs.98614 ribosome binding protein 1 (dog 180kD ho 

135354 AA456454 Hs.183418 cell division cyde 2-like 1 (PITSLRE pr 

135361 AA373452 Hs.167700 Homo sapiens cONAFLJ10174fls.ctone HE 

135389 U05237 Hs.99872 fetal Alzheimer antigen 

135400 X78592 Hs.99915 androgen receptor (dihydroleslDSterone r 

302256 AA857131 Hs.171595 HIV TAT spedfic factor 1 

302276 AW057736 Hs.323910 HER2 receptor tyrosine kinase (o^rW)2. 

303135 AW592789 Hs.279474 HSPC070 protein 

303686 AK000714 Hs.109441 MSTP033 protein 

310085 R43191 Hs.101248 Homo sapiens done iMAGE:32553. rrdRNA seq 

315518 AA808229 Hs.167771 ESTs 

317781 NM-007057HS.42650 ZW10 Inleractor 

320836 AI268997 Hs.197289 rab3 GTPase^tivating protein, non-ola 

321114 AA902256 Hs.78979 Golgi apparatus protein 1 

322221 N24236 Hs.179662 nudeosome assembly protein 1 -like 1 

322474 AF1 18083 Hs.29494 PR01912 protefn 

322556 BE041451 Hs.177507 hypothetical protein 

323541 AF292100 Hs.l04613 RP42homolog 

407827 BE278431 Hs.40323 BUB3 (budding uninhibited by benzimidazo 

408196 AL034548 Hs.43627 SRY (sex determining region Y)-box 22 

408813 AI580090 Hs.48295 RNA helicase family 

409176 R73727 Hs.101617 ESTs, Weakly similar to T32527 hypothetl 

413670 AB000115 Hs75470 hypothetical protein, expressed in osteo 

414108 AI267592 Hs.75761 SFRS protein kinase 1 

414846 AW3044S4 Hs.77495 UBX domain-containing 1 

416980 AA381133 Hs.8a684 hlgh-mobltity group (nonhistone chromoso 

417378 R57256 Hs.82037 TATA box bhdlng protein (TBP)-assodate 

418283 S79895 Hs.83942 cathepsin K (pycnodysostosls) 

418467 N^/L006910Hs.85273 retinoblastoma4)indlng protein 6 

420269 U72937 Hs.96254 alpha thalassemia/mental retardation syn 

420802 U22376 Hs.1334 v-myb avian myeloblastosis viral oncogen 

421225 AA463798 Hs.102696 MCT-I protein 

421642 AF172066 Hs.106346 retinoic acid repressible protein 

421828 AW891965 Hs^79789 hlstone deacetylase 3 

421983 AI252640 Hs.110364 peptldylprolyl Isomerase C (cyctophllin 

422052 AA302744 Hs.104518 ESTs 

422055 N[yL014320Hs.1 11029 putative heme-bindlng protein 

423750 AF165883 Hs.298229 prefoldin2 

424001 W67883 Hs.137476 paternally expressed 10 (PEGIO; KIAA105 

425182 AF041259 Hs»155040 zinc fmger protein 217 

425284 AF155568 H&1S6489 NSI-assodaled protein 1 

426372 BE3O4680 Hs.169531 DEAD/H (Asp-Glu^la^sp/Hls) box polypep 

428049 AW1B3765 Hs.182238 GW128 protein 

428477 AW500533 Hs.11482 spftcing factor, arglnine/serine-rich 1 1 

437562 AB001636 Hs,5683 DEAD/H (Asp-Glu-Ala-Asp/His) box polypep 

438449 AK001333 Hs.6216 Homo sapiens hepatocellular carcinoma-as 

441560 F13386 Hs.7888 Homo sapiens clone 23736 mRNA sequence 

445580 AF167572 Hs.12912 skb1 (S. pombe) homolog 

446999 AA151520 Hs.334822 hypothetical protein MGC4485 

447111 AI017574 Hs. 17409 cystelne-rfch protein 1 Gntestinal) 

447778 BE620592 Hs.7l190 ESTs, Weakly similar to SI 6506 hypothetl 

448873 NIVL003677HS.22393 density-regulated protein 

449687 W68520 Hs.331328 Intermediate filament protein syncoilln 

450701 H3g960 Hs.288467 Homo sapiens cONA FLJ12280fis. done MA 

450703 AA011202 Hs.184771 nudear factor l/C(CCAAT-bindlngtransc 

452461 N76223 Hs.108106 transcrfptlon fodor 

452511 6E408178 Hs.285165 Homo sapiens cDNA aJ20845 Tts, done AD 

453157 AF077036 Hs.31989 DKFZP58661722 protein 

453658 BE541906 Hs.87819 Homo sapiens, done MGC:2492, mRNA, oomp 

100833 AF135168 Hs.108802 N-ethytmaletmlde^ensitive factor 

102481 U50360 gbiHuman calcium, calmodulin-dependent p 
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102827 BE244588 H3.6456 chaperonin containing TCPl.subunil 2 (b 7.9 

103549 BE270465 Hs.78793 protein kinase C. zeta 2.0 

104331 AB040450 Hs.279862 cdk inhibitor p21 Wnding protein 5.3 

110018 AW579842 Hs.104557 iiypothetlcal protein RJ 10697 2.0 

115008 AK001827 Hs.87889 helicasewnol 5.7 

119075 M10905 Hs.287820 fibronectinl 1.3 

119615 AL034423 Hs.75875 ubiquitin-conjugating enzyme E2 variant 2.9 

125006 BE055136 Hs.145696 splicing factor (CC1. 3) 1.7 

127609 X80031 H3.530 collagen, type IV, alpha 3 (Goodpasture 2.4 

129209 R62676 Hs.17820 Rho^odated, coile£k»il containing p 5.2 

129917 M30773 Hs.278540 protein phosphatase 3 (formerly 28). rag 4.5 

130182 BE267033 Hs.192a53 ubiquitin^njugating enzyme E2G 2 (homo 11.0 

130365 W56119 Hs.155103 eukaryotfc translation Inftiatibn factor 3.3 

131135 NNL016569HS.267182 TBX3-bo protein 13 

131853 AI681917 Hs,3321 ESTs, Highly similar to IRX1_HUMANIR0QU 3.2 

131881 AW361018 Hs.3383 upstream regulatory element binding prot 14.3 

132726 N52298 Hs.55608 hypothetical protein [\^GC955 3.0 

135193 X95525 Hs.96103 TATA box binding protein (TBP)-associate 27 

409487 H19886 gb:yn57a05j-1 Soaies adult brain N2b5H 2.3 

416040 AW819158 Hs.2d9044 Homo sapiens cDNA R.J12048 fis, done HE 7.4 
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TABLE 4A 



Table 4 A shows the accession numbers for those pkeys lacking unigenelD's for Table 4. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 



123615 
123619 
101445 
124385 
124417 
124482 
102461 
103M9 
110856 



120280 
113248 
120472 



30686.-15 

371681 1 
1650_-5 
656394 1 
1642364,1 
1657509J 
3128L-28 
11052.-2 
19346J4 . 



160212,1 
328626J 
44573J2 



129019 44573^2 



120695 9683J3 



122188 
121581 
122618 
109026 
123658 
123811 
125115 
125147 
118737 
120274 
113196 
120504 
120639 



275673.1 
283769_1 
305217.1 
150431.1 



AA509170 

AA602964AA609200 
M21259 

AI267847 N27351 
N34059 N46979 
N53935 N63950 
U50360 



AA992380 N33063 N21418 H79958 R2191 1 H79957 

103797 109699 1 AA080912 AA0753 18 AA083403 AA076594AA078992AA084926 AA081881 AA1 13913 AAl 13892 
AA083821 AA134801 AA082953 AA070343AA062835AA075419AA063293AA071252 AA078900 AA082836 AW974305 

AA190577AA181657 

T63857 AW971220 AA493469 T63699 

AI950087 N7020B R97040 N36809 AG081 1 9 AW967677 N35320 A1251473 H59397 AW971573 R97278 W01059 AW96767i 
AA90859B AA251875 Ai820501 AI820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
AI219788 AA884444 N92578 F13493 AA927794 AI560251 AW874068 AL134043 AW235363 AA663345 AW008282 AA488964 
AA283144 AI890387 AI950344 AI741346 A1689062 AA282915 AW102898 AI872193 Aj763273 AW1 73586 AW150329 AI653832 
AI762688 AA988777 AA488892 AI356394 AW103813 A1539642 AA642789 AA856975 AW505512 AI961530 AW629970 
BE612881 AW276997 AW513601 AW512843 AA044209 AW856538 AA180009 AA337499 AW961 101 AA251669 AA251874 
AI819225 AW205862 A1683338 AI858509 AW276905 AI633006 AA972584 AA908741 AW072629 AW513996 AA293273 
AA969759 N75628 N22388 H84729 H60052 T92487 AI022058 AA780419 AA551005 W80701 AW613456 AI373032 AI564269 
F00531 H83488 W37181 W78802 R66056 AI002839 R67840 AA3002D7 AW959581 T63226 F04005 
AI950087 N70208 R97040 N36809 AI308119 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 
AA908598 AA251875 AI820501 A182D532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
A1219788 AA884444 N92578 F13493 AA927794 AI560251 AW874068 AL134043 AW235363 AA663345 AW008282 AA488964 
AA283144 A1890387 AI950344 AI741346 AI689062 AA282915 AW102898 Ai872193 AI763273 AW173586 AW150329 AI653832 
AI762688 AA988777 AA468892 AI356394 AW103ai3 A1539642 AA642789 AA656975 AW506512 Ai981530 AW629970 
BE612881AW276997AW513601AW512843AA044209AW856538AA180009AA337499AW961101AA2^ 
AI819225 AW205862 AI68333B AI858509 AW276905 AI633006 AA972584 AA908741 AW072629 AW513996 AA293273 
AA969759 N75628 N22388 H84729 H60052 T92487 AI022058 AA780419 AA551005 W80701 AW613456 AI373032 AI564269 
F00531 HB3488 W37181 W78802 R66056 Ai002839 R67840 AA300207 AW959581 T63226 F04005 
AA976503 AI917802 AA953664 AA404613 AA428771 BE280542 AW194691 AI927301 A1740458 AI796100 AI935603 AW052210 
AA970201 AI633384 AA425giOAI017004 AI241295AA402816AA291466 
AA398838AA435847 

AA416568 AA442889 AA417233 AA442223 
AA453641 AA454061 



AAl 5781 1AA836869 
genbanlU\A609364 AA609364 
genbanKJ^620586 AA620586 
genbank.T97341 T97341 
NOT_FOUND.entre^W38150 
382979.1 AAl 99686 N73861 
genbanLAA177061 AA177051 
genbanl^.T57317 T57317 
genbanKuAA256837 AA256837 
genbanlLAA266942 AA266942 



W38150 



139 



wo 02/059377 



PCT/US02/02242 



120809 genbankJ\A346495 AA346495 
113702 genbanleT97307 T97307 

129660 23162^1 U03749 NI\1.001275 J03483 J03915 AI214509 AW245744 AL046455 AA318960 Ai741505 AA843875 AI829382 AI560122 
AI858999 D55958 AI684005 D53170 AA854091 AI025609 D53119 D54729 D55504 D55377 D55313 AW512244 AA846441 
AW043898 AI969102 AA405741 Aia91 983 AA788784 AA706586 AA854361 AW470949 AA843095 AA772028 AI148432 
AI038109 AA782478 AA910084 AI220384 AA781298 AA843881 AA854064 AA843125 AA843419 AA319036 AA319054 
AI273831 W32275 AI5841 85 CX)5724 AA789023 AI686818 D54392 AI022485 AA431410 AA854232 W39212 W15214 AA894441 
AI803081 A1167381 AW2453B9 AA319430 AA335156 AI042646 AA327030 AA725170 T27943 AA889304 AA976699 AI687001 
AI621107 AI865540AA772107 C06286 AA319661 AA405992 

101045 entrez^J05614J05614 

117247 genbank N21032 

110501 genbank H55748 

103392 entre?J04563X94563 

105032 genbank AA127818 

119513 NOT_FOUND_entrez W37933 

105445 genbank AA252395 

121514 genbank AA412112 

121558 genbank AA412497 

121911 genbank AA427950 

123315 714071_1 AA496369 AA496646 

114911 genbank AA236672 

409487 1134778J H19886AW402806T10231 
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TABLE 5: Figure 5 from BRCA 001 US 



Table 5 shows genes upregulated in tumor tissue compared to normal breast tissue. 



Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Acoessron number, Genbank accession number 

UnjgenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of tumor to normal breast tissue 



Pkey ExAccn UnigenelD UnlgeneTitle R1 

100114 X02308 Hs.82962 thymklylate synthetase 2.9 

100147 D13666 Hs.136348 osteoblast specific factor 2 (fasdcBn 7.5 

100154 H60720 Hs.81892 KIAA01 01 gene product 9.2 

100335 AW247529 Hs.6793 platelet-actjvating factor acetylhydrola 2.7 

100666 L05424 Hs.169610 CD44 antigen (homing function and Indian 5.7 

100667 L05424 Hs.169610 CC)44 antigen (homing function and Indian 9 

100668 L05424 Hs.169610 CD44 antigen (homing function and Indian 7.6 

1 00678 AW502935 Hs.740 PTK2 protein tyrosine kinase 2 53.2 

100988 AK000405 Hs.76480 ubiquilin-IIke 4 11.4 

101031 J05070 Hs.151738 matrix metaltoprote&iase 9 (gelatnase B 8.2 

101045 J05614 gb:Human proliferating cell nuclear antl 5 

101332 J04086 H5.156348 topoisomerase (DNA) 11 alpha (170kD) 3.4 

101352 AI4S42g9 Hs.16297 C0X17 (yeast) homotog, cytochrome coxid 6.3 

101580 NI\L012151Hs.83363 coagulation factor VII I-assocIated(lntr 5.7 

101592 AF064853 Hs.91299 guanine nucleoOde binding protein ( 5.6 

101767 M81057 Hs.180884 caitoxy peptidase B1 (tissue) 14.4 

101806 AA5B6894 Hs.1 12408 8100 calcium-binding protein A7(psorias 8.9 

101810 NIVL000318HS.1B0612 peroxisomal membrane protein 3 (35kD, Ze 3.2 

101983 AI904232 Hs.75323 prohibltin 8.4 

102107 BE25B602 Hs.182366 heat shock protein 75 1.4 

102165 BE313280 Hs.159627 death associated protein 3 4.6 

102198 AW950852 Hs.74598 polymerase (DNA directed), delta 2, regu 4.3 

102217 AA829978 Hs.301613 JTVIgene 6.7 

102220 U24389 Hs.65436 lysosomal 4.3 

102302 AA306342 Hs.69171 protein kinase C-like 2 2.7 

102348 U37519 H5.87539 aldehyde dehydrogenase 3 family, member 2 

102374 U33635 Hs.90572 PTK7 protein tyrosine kinase 7 6.2 

102455 U48705 Hs.75562 discoldin domain receptor family, member 6.9 

' 102568 W81489 Hs.223025 RAB31. member RAS oncogene family 5.3 

102618 AL037672 Hs.81071 extracellular matrix protein 1 5.8 

102687 NM_007019Hs.93002 ubiquitin carrier protein E2-C 4.3 

102689 U96132 Hs.171280 hydroxyacyl-Coenzyme A dehydrogenase, ty 6 

102704 AU077058 Hs.54089 BRCA1 associated RING domain 1 1.9 

102705 T97490 Hs.50002 small inducible cytokine subfamily A (Cy 2.3 
102801 BE252241 Hs.38041 pyridoxal (pyridoxine, vitamin B6) kinas 6.4 
102827 BE244588 Hs.6456 chaperonin containing TCP1,subunit 2 (b 5.6 
103060 NIVL005940HS.155324 matrix metalloproteinase 1 1 (MMP11: stro 4.5 
103080 AU077231 Hs.82932 cydinDI (PRADItparathyroWadenomatos 3.1 
103178 AA205475 Hs.275865 ribosomal protein 818 9.9 
103206 X72755 Hs.77367 monokine induced by gamma Interferon 8.8 
103238 AI369285 H5.75189 death^assoclated protein 5.6 
103547 AI376722 H5.180062 prcteasome (prosome, macropain) subunit, 9.7 
103549 BE270465 Hs.78793 protein kinase C, zeta 7.9 
103886 AK001278 Hs.l05737 hypothetical protein FU 10416 similar to 6.5 
104325 BE379766 Hs.l 50675 polymerase (RNA) II (DNA directed) polyp 6.3 
104827 AW052006 Hs.8551 PRP4/STK/WDspllclng factor 10.9 
104846 AI250789 Hs.32478 ESTs 5.6 
104854 AA041276 Hs.154729 3-ph05pho)nositide dependent protein kin 12.3 
104867 AA278898 Hs.225979 hypotheb'cai protein simitar to small G 2 
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104886 AW015316 Hs.23165 
104909 AW408164 H5^49184 
104916 AW958157 H5.155489 
104919 AA026880 Hs.25252 
5 104974 Y12059 Hs.278675 
104978 AI199268 Hs.19322 
105012 AF098158 Hs.9329 
105039 AA907305 Hs.36475 
105079 AA151342 Hs.12677 

10 105088 H5B589 Hs.35156 
105393 AF167570 Hs.256583 
105547 AA262640 Hs.27445 
105564 BE616694 Hs.288042 
105658 AA985190 H5.246875 

15 105746 AW151952 Hs.46679 
105856 AF151066 Hs.281428 
105930 AF016371 H5.9880 
106094 AA533491 Hs.23317 
106350 AKD01404 Hs.194698 

20 106359 AW390282 Hs.31130 
106610 AA458882 Hs.79732 
106624 NM 003595Hs^6350 
106713 BE614B02 Hs.184352 
106829 AW959893 Hs.27099 

25 106846 AB037744 Hs.34892 
106873 N49809 Hs.11197 
106973 BE156256 Hs.11923 
107029 AF264750 Hs.288971 
107197 W15477 Hs.84639 

30 107859 AW732573 Hs.47584 
107901 L42612 Hs.335952 
107922 BE153855 Hs.61460 
107974 AW956103 Hs.61712 
108274 AF129535 Hs.272027 

35 108647 BE546947 Hs.44276 
108695 AB029000 Hs.70823 
108894 AK001431 Hs.5105 
109011 AA1 56542 Hs.72127 
109068 AA164293 HsJ2545 

40 109273 AA375752 Hs.82719 
109468 NM_015310Hs.6763 
110240 AI668594 Hs.176588 
110330 AI288666 Hs.16621 
110501 H55748 

45 ' 110742 AW190338 Hs.28029 
110762 BE044245 Hs.30011 
110856 AA992380 
110958 NIVL005864HS.24587 
111125 N63823 Hs,269115 

50 111179 AK000136 Hs.10760 
111239 N90956 Hs.17230 
111285 AA778711 Hs,4310 
111392 W46342 Hs,325081 
111937 BE298665 Hs.14846 

55 112244 AB029000 Hs.70823 
112995 AA737033 Hs7155 
113777 BE266947 Hs.10590 
113791 AI269096 Hs.135578 
113811 BE207480 Hs.6994 

60 113834 T26483 H&6059 
113868 W57902 Hs.90744 
113870 AL079314 Hs.16537 
113923 AW953484 Hs.3849 
114275 AW515443 Hs.306117 

65 114895 AA236177 Hs.75591 
114965 AI733881 Hs72472 
115061 AI751438 Hs.41271 



ESTs 


17.7 


banscriptton fector 19 (SCI) 


5 


NSI-associatBd protein 1 


17 


prolactin receptor 


1.4 


bromodomain-containing 4 


1.4 


Homo sapiens, Similar to RiKEN cDNA 2010 


7.2 


chromosonne 20 open reading frame 1 


3.3 


ESTs 


2.5 


CGI-147 protein 


9.5 


Homo sapiens cDNA FIJ11027 fis. done PL 


2.2 


interleukin enhancer binding factor 3, 9 


5.4 


unlviown 


9.3 


hypothetical protein FU14299 


1.4 


hypothetical protein FLJ20059 


9.4 


hypothetical protein FLI20739 


1.5 


hypothetical protein 


2.9 


peptidyl prolyl isomerase H (cydophiiin 


5.2 


hypothetical protein FLJ14681 


6.8 


cydinB2 


57 


transmembrane 7 superfamily member 2 


6.3 


fibulin 1 


7.9 


tyrosylproteln suifolransferase 2 


7.7 


hypotheUcal protein FLJ12549 


4.5 


hypothetical protein FLJ23293 similar to 


16.2 


KIAA1323 protein 


2.2 


Homo sapiens, done IMAGE:3343149, mRNA, 16.8 


hypothetical protein 


6.6 


myeloid/lymphoid or mixed-lineage leukem 


1.8 


glioma pathogenesis-relaled protein 


6.1 


potassium voltage-gated bhannel, delayed 


8.4 


keratin 68 


2.5 


Ig superfamily receptor LNIR 


2.2 
67 


pyruvate dehydrogenase kinase* isoen^e 
F-boxonly protein 5 


7.1 


homeoboxCIO 


9.8 


KIAA1077 protein 


7.2 


hypothetical protein FU10569 


4 


ESTs 


14 


ESTs 


2.9 


Homo sapiens mRNA; cDNA DKFZp586F1 822 (f 2.9 


KIAA0942 protein 


3.2 


ESTs, Weakly similar to CP4Y_HUMAN CYTOC 4.2 


DKFZP434I116 protein 


6.2 


gb:yq94aOU1 Scares fetal Irver spleen 


6.1 . 


hypothetical protein MGC11258 


7.6 


hypothetical protein MGC2963 


9.3 


gb:ot37g065l SoaresJesfe^NHT Homo sap 2.3 


signal transduction protein (SH3 contain 


6.7 


ESTs. Moderately similar to Z195.HUMAN Z 3.6 


asporin (LRR dass 1) 


7.1 


hypothetical protein FIJ22087 


7S 


eukaryotic translation Initiation factor 


6.9 



Homo sapiens, clone IMAGE:36596B0, mRNA. 8.4 
Homo sapiens mRNA: cDNA DKFZp564D016 (fr 10.6 

KIAA1077 protein 14.6 

ESTs, Moderately similar to 2115357A TYK 5.6 

zinc finger protein 313 13.4 

ch!toblase.di^-aGetyl- 1.3 

Homo sapiens cDNA:FU22044 fis, done H 3.1 

EGF-contatnIng fibulin-like extraoellula 1 1 .3 

proteasome(prosome,macropain)26Ssubu 27 

hypotheQcal protein, similar to (U06944 6.1 

hypothetical protein FLI22041 similar to 1 .9 

KIAA0305 protein 15.8 

K1AA0887 protein 7.1 

BMP-RIB 2.3 

Homo sapiens mRNA fiill length Insert cDN 1 1.8 
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115278 AKD02163 Hs.301724 hypothetical protein FU1 1301 1.5 

115291 BE545072 Hs.122579 hypothetical protein FLJ10461 6.2 

115652 BE093589 Hs.38178 hypothetical protein aJ23468 10.6 

115693 AF231023 Hs.55173 cadhefin. EGF U\G seven-pass G^pe race 6.8 

5 115941 A1B67451 Hs.46679 hypothetical protein aJ20739 5.5 

115968 AB037753 Hs.62767 K1AA1 332 protein 9.8 

115011 AL359053 Hs.57664 Homo sapiens mRNA full length insert cDN 2.4 

116417 AW499664 Hs.12484 Human done 23826 mRNA sequence 7.4 

116470 AI272141 Hs.83484 SRY (sex determining region Y)-bQX 4 2.1 

10 116637 AK001043 Hs.92033 Inlegrin-linked kinasenassodated serine 2.7 

117132 AI393666 Hs.42315 p10-blnding protein 5.2 

117881 AF161470 Hs.260622 butyrate^uoed transcript 1 5.7 

118528 Ai949952 Hs.493g7 ESTs 7.4 

119075 M10905 Hs.287820 fibronecHnl 5.7 

15 119265 BE539706 Hs.285363 ESTs 1.4 

119349 T65004 Hs.l63561 ESTs 8.4 

119403 AL117554 Hs.l19908 nucleolar protein NOP5/NOP58 6.7 

119789 BE393948 Hs.50915 kalllkrein 5 (KLK5; KLK-L2; stratum com 9.2 

120206 H25735 Hs.91668 Homo sapiens done PP1498 unknown mRNA 45.7 

20 120253 AA131376 Hs.326401 fibroblastgrowthfador 12B 38.9 

120297 AA191384 Hs.104072 ESTs, Weakly similar to Z195.HUMAN ZINC 15i 

120325 AA195651 Hs.104106 ESTs 6.4 

120327 AK000292 Hs.278732 hypothetical protein FU20285 16.1 

120349 AW969481 Hs.55189 hypothetical protein 16.8 

25 120356 AF000545 Hs.296433 putative purinergic receptor 28.1 

120371 AA219305 Hs.104196 EST 12.4 

120383 AL109963 Hs.123122 FSH primary response (LRPR1, rat) homoto 9.7 

120386 AW969665 H5.154848 hypothetteal protein DKFZp434D0127 32.6 

120389 AW967985 Hs.325572 ESTs, Moderately similar to ALU7_HUMAN A 21.7 

30 120395 AA134006 Hs.79306 eukaryotic translation Initiation factor 12.5 

120418 AW966893 Hs.26613 Homo sapiens mRNA; cDNA DKFZp586F1323 (f 11.4 

120472 AI950087 gbwq05c02j(1 NCLCGAP_K{d12Homosapien 19.4 

120484 AA253170 Hs.95473 EST 10.4 

120570 AA280679 Hs.271445 ESTs, Weakly similar to ALU INHUMAN ALU S 14.4 

35 120582 BE244830 Hs.284228 ZNF135-like protein 10.2 

120596 AA282074 Hs.237323 N-acetylglucosamine-phosphate mutase 7.5 

120624 AW407987 Hs.173518 M-phase phosphoprotein homolog 52 

120695 AA976503 gb:oq30a04.sl NCI CGAP_GC4 Homo sapiens 46.8 

120713 AW449855 Hs.96557 Homo sapiens d)NAFLJ12727fls, done NT 5.9 

40 120750 AI191410 Hs.96693 ESTs. Moderately similarto 21 0926QA Be 7 

120774 AI608909 Hs.193985 ESTs 7.8 

120807 AA346385 Hs.30002 SH3KX)n!ainir« protein SH3GLB2; KIAA1 848 6.8 

120809 AA346495 gb:EST52657 Fetal heart II Homo sapiens 4.4 

120984 BE262951 Hs.99052 ESTs 5.6 

45 121081 AA398721 Hs.186749 ESTs, Highly slmflar to 137550 mismatdi 5.4 

121408 AA406137 Hs.98019 EST 6 

121505 AA494172 Hs.194417 ESTs 13.1 

121508 AA402515 Hs.97887 ESTs 28 

121513 AA416653 Hs.l8l510 ESTs 6.2 

50 121549 AA412477 Hs.98142 EST 7.4 

121558 AA412497 gb:zt95g12.s1 Soares_testis_NHT Homo sap 2.8 

121655 AA421537 Hs.178072 Homo sapiens mRNA; cDNA DKFZp434B1023 (f 7.8 

121744 AA398784 Hs.97514 ESTs 7.1 

121748 BE536911 Hs.234545 hypothetical protein NUF2R 195 

55 121773 AB033022 Hs.158654 KIAA1 196 protein 7.9 

121832 AW340797 Hs.98434 ESTs 5.8 

121839 AA425691 Hs.191606 ESTs. HighlysimilartoKIAA1048 protein 5 

121882 AA426376 Hs.98459 ESTs 5 

121911 AA4Z7950 gb:2w50K)2.s1 Scares totaLfetus.Nb2HF8^ 7.2 

60 121999 AA430211 Hs.98668 EST 6.4 

122013 AA431085 Hs.98706 ESTs 6.5 

122036 W92142 Hs.271953 ESTs. Weakly similar to ALU5J<UMAN ALUS 13.1 

122356 AA443794 Hs.98390 ESTs 7Z 

122371 AA868555 Hs.l78222 ESTs 5 

65 122372 AA446008 Hs.336677 EST 7.6 

122460 AW418788 Hs.99148 ESTs, Weakly similar to S43569 R01 HI 0.6 9.7 

122490 AA44d349 Hs.238151 EST 6.1 
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122492 
122510 
122530 
122572 
122607 
122614 
122616 
122618 
122622 
122717 
122829 
122838 
122856 
122868 
122907 
123016 
123034 
123136 
123152 
123394 
123466 
123486 
123615 
123735 
123753 
124006 
124385 
124440 
124656 
124683 
124735 
124761 
124768 
124788 
124811 
124812 
124822 
124860 
124903 
124930 
124942 
125051 
125056 
125101 
125115 
125280 
127274 
128528 
128670 
128691 
128772 
128781 
128797 
128868 
128891 
128946 
128975 
128995 
129019 
129076 
129088 
129096 
129198 
129347 
129362 
129372 
129404 



AA448417 Hs.104990 
AA449232 Hs.99195 
AW959741 Hs.40368 
AA452601 Hs.99287 
AA453518 hte.98023 
AA453630 hte.99339 
AA453638 Hs.161873 
AA453641 

AA453987 Hs.144802 
AA456859 Hs.178358 
AW204530 Hs.99500 
AA460584 Hs.334386 
AI929374 Hs.75367 
AF005216 Hs.115541 
AA470074 Hs.169896 
AW338067 Hs.323231 
AL359571 H5.44064 
AW451999 Hs.194024 
AW601773 Hs,270259 
AA731404 Hs.105510 
AA599042 Hs,1 12503 
BE019072 Hs.334802 
AA609170 

NM_013241Hs.9523l 
AA609955 Hs.234961 
AI147155 Hs.270016 
AI267847 

AA532519 Hs.129043 
AW297702 Hs.102915 
AA381661 H&1 19878 
R22952 H&268685 
AA374756 Hs.93560 
AW368528 Hs.100855 
R43543 Hs.100912 
R46068 Hs.288912 
R47948 Hs.188732 
AA418160 Hs.86043 
R55763 Hs.101477 
AW296713 Hs.221441 
AI076343 Hs.173939 



R99978 

T79956 

T81310 

AI472068 

T97341 

AI123705 



Hs.268892 
Hs.100588 
H&100592 
H5.286236 

Hs.10e932 
AW966158 Hs.58582 
R39234 Hs.251699 
AA975486 Hs.103441 
W27939 Hs.103834 
BE302796 Hs.105097 
N71826 Hs.105465 
NM.002975HS.105927 
AA419008 Hs.106730 
F34855 Hs.292457 
Y13153 Hs.107318 
BE560779 Hs.284233 
AI816224 Hs.107747 
AI950087 

AW296806 Hs.326234 
AA744610 Hs.194431 
AA463189 Hs.288906 
N57532 Hs.109315 
BE614192 Hs.279869 
U30246 Hs.110736 
NI\L016039Hs.110803 
AI267700 Hs.317584 



ESTs 5.4 

ESTs 11.2 

adaptor-related protein complex 1 , sfgma 10.1 

EST 11 

ESTs 615 

EST 10.7 

ESTs 107.3 
gb:zx48e06.s1 Soares_testis_NHT Homo sap 31.1 

ESTs 5.6 

ESTs 8.5 

ESTs 81.8 

ESTs 75.3 

Snolilce^dapter 5.8 

Janus kinase 2 (a protein tyrosine Idnas 5.3 

ESTs 11.5 

Homo sapiens cDNA FU1 1946 fis, done HE 2.8 

nfnein (GSK3B interacting protein) 8.7 

ESTs 5.1 

ESTs 5.2 

ESTs 3.6 

EST 7.4 

Homo sapiens cDNA FLJ14680 fis, done HT 2.4 
gbaf12a12.s1 SoaresJestis^NHT Homo sap 7.8 

FH1/FH2 domain-containing protein 1 0 

Huntingtin interacting protein E 30.6 

ESTs 8.1 

gbaq49a10.xl Stanley Frontal NB pool 2 57.1 
Human ONA sequence from done 989H1 1 on 7.8 

ESTs 8.3 
ESTs.Weal«lysImi!artoM3K9_HUMANMITOG 7.9 

ESTs ' 11.3 

' Homo sapiens mRNA for Kl AA 1 771 protein, 9 

ESTs 8.1 

Homo sapiens cDNA: F1J22726 fis, cbne H 5.1 

tiypothetical protein aJ22604 14.2 

ESTs 7.9 

Homo sapiens CDNAFLJ13558 fis, done PL 6.6 

EST 23.9 

ESTs 32.4 

ESTs. Wealdy similaf to ALUB.HUMAN 22.8 

ESTs. Moderately similar to B34087 hypot 6.1 

EST 135.3 

ESTs 5.4 

K1AA1856 protein 5.6 

gb7e57e05.s1 Soares fetal liver spleen 9.6 

ESTs 8 

Homo sapiens cDNA FLJ 12789 fis. done NT 12.8 

ESTs, Weakly similar to IDN4-GGTR14 [H.s 2.8 
Homo sapiens. Similar to RIKENcDNA 1700 7.1 

hypothefical protein MGC5576 7.7 

thymidine kinase 1 , soluble 5.3 

small nuclear ribonudeoprotein polypept -53.9 

stem cell growth fadon lymphocyte seer 13.3 

diromosome 22 open reading frame 3 3 
Homo sapiens, done MGC:16362, mRNA. com 133 

kynurenlne 3-monoQxygenase (kynurenine 3 7.2 

NlCE-5 protein 14 

DKFZP566C243 protein 1.9 
gb:wq05d)2jc1 NCI_CGAP_K!d12Homosapfen 2.9 

ESTs, Highly similar to T46422 hypotheti 5 

palladin 17.1 

WW Domain-Containing Gene 20.9 

KIAA1415 protein 5.8 

melanoma-associated antigen recognised b 7.6 

solulecamerfamIly12{sodIum/potassi 6.7 

CGI-99 protein 2 

ESTs 5 
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129482 AA188185 Hs.289043 spindlfn 6.7 

129559 W01296 Hs.11360 hypdhetlcal protein FLJ 14784 7.5 

129587 H14718 Hs.11506 Human clone 23589 mRNA sequence 6.8 

1296^ AK000398 Hs.11747 hypothetical protein RJ20391 3.8 

5 129649 AC)000092 H5.16488 cafreticulin 3.3 

129680 U03749 gb•>^uman chromogranin A (CHGA) gene, pro 14.1 

129589 AW748482 Hs.77873 B7homolog3 2.6 

129702 AI304965 Hs.12035 ESTs. Weakly similar to 138022 hypothetl 7.4 

129720 AA156214 Hs.12152 APMCF1 protein 2 

10 130010 AA301116 Hs.142838 nucleolar phosphoprotein Nopp34 1.6 

130097 Ali}46962 Hs.14845 forkhead box 03A 2.8 

130135 AA311426 Hs.21635 tubulin, gamma 1 6.1 

130211 NIVL003358K5j23703 ESTs, ModeratelysimilartoCEGT.HUMANC 1.6 

130242 X79201 Hs.153221 synovial sarcoma, translocated to X chro 5.4 

15 130359 NM 013449HS.277401 bromodomain adjacent to zlncfingerdoma 8.5 

130365 W56119 Hs.155103 eukaryotic translation initiation factor 11 

130448 BE513202 Hs.15589 PPAR binding protein 3.9 

130455 D90041 Hs.1 55956 N-acetyltransferase 1 (arylamine Nicety 33.6 

130471 AL121438 Hs.183705 addudn 1 (alpha) 2.7 

20 130503 BE208491 Hs.295112 KIAA0618 gene product 16.1 

130511 L32137 Hs.1 584 cartilage oligomeric matrix protein (pse 6.1 

130542 U64675 Hs.179825 RAN binding protein 2.|ike 1 7.8 

130553 AF062649 Hs.252587 pituitary tumor-transforming 1 14.4 

130556 AI907018 Hs.15977 Empirically selected from AF FX single pr 4.7 

25 130567 AA383092 Hs.1 608 replication protein A3 (14kO) 7.9 

130574 AF083208 Hs.16178 apoptos/s antagonizing transcription fac 1.2 

130617 IV)90516 Hs.1674 glutamine-fructose-6-phosphatetransamIn 12.1 

130667 BE246961 Hs.17639 Homo sapiens ubiqultin protein t!gase(U 13.9 

130693 R68537 Ks.17962 ESTs 2 

30 130744 H59696 Hs.18747 P0P7 (processing of precursor, S.cerevi 3.1 

130757 AL036067 Hs.18925 protein x 0001 5.7 

130880 BE514434 Hs.2083O klnesin-filte 2 2.1 

130944 BE382657 Hs.21486 signaltransducer and activator of trans 5.4 

131046 AA321649 Hs,2248 small indudble cytokine subfamily B (Cy 7.4 

35 131060 AA194422 Hs.22564 myosin VI 5.1 

131099 AL133353 Hs.226581 C0X15 (yeast) homolog. cytochrome coxld 7 

131135 NM_016569Hs.267182 TBX3-iso protein 3.3 

131185 BE280074 Hs.23960 cydinBI 5.8 

131225 H62087 Hs.31659 thyroid homione receptor-associated prot 7.5 

40 131245 AL080080 Hs.24766 thioredoxin domain-containing 2.8 

131283 X80038 Hs.339713 Homo sapiens clone F19374APOE-C2 gene 13 

131569 Al^9951 H5.271623 nudeoporin 50kD 5 

131643 AW410G01 H5.30026 HSPC182 protein 2.9 

131714 AA642831 Hs.31016 putalh/e DNA binding protein 2.9 

45 131722 D13757 Hs.311 phosphoribosyl pyrophosphate amidotransf 3.4 

131760 X76732 Hs.3164 nudeobindin 2 2.9 

131793 AW966127 Hs.32246 Homo sapiens cDNAFU 14656 (Is, clone Iff 7.9 

131885 BE502341 Hs.3402 ESTs 137 

131900 AA099014 Hs.231029 Homo sapiens, done MGC:15961, mRNA, com 8.7 

50 131905 AA179298 Hs.3439 stomabMke 2 11.3 

131941 BE252983 Hs.35086 ubiquitin spedfic protease 1 2.3 

131971 BE567100 Hs.154938 hypothetical protein MDS025 3.5 

132180 NM_004460Hs.418 fibroblast activation protein, alpha 14.7 

132203 NML004782HS.194714 synaplosomal-assodated protein, 29kD 7.8 

55 132273 AA227710 Hs.43658 DKFZP586L151 protein 10 

132288 N36110 Hs.305971 solute earner famny 2 (fadittatedgiu 9.2 

132294 AB023191 Hs.44131 KIAA0974 protein 2 

132348 AW067708 Hs.170311 heterogeneous nudear n'bonudeoprotern 1Z5 

132370 AW572805 Hs.46645 ESTs 28.3 

60 132384 AA312135 Hs.46967 HSPC034 protein 6.1 

132450 AA100012 Hs.48827 hypothetical protein FU12085 8.6 

132465 AW1 69847 Hs.49169 KIAA1634 protein 6.1 

132532 AA454132 Hs.5080 mitochondrial ribosomal protein L16 7.1 

132574 AW631437 Hs.5184 TH1 diosophlla homolog 14 

65 132638 AI796870 Hs.54277 DNA segment on chronwsomeX (unique) 992 12,4 

132718 NIVL004600HS.554 Sjogren syndrome antigen A2 (60kD, ribon 3.7 

132726 N52298 Hs.55608 hypothetical proteh MGC955 143 

• 145 
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132731 AI189075 Hs.301872 hypothetical protein MGC4840 5.9 

132744 AA010233 HS.S5921 glufamyl-prolyl-tRNA synthetase 6.4 

132773 AA459713 Hs.295901 KIAA0493 protein 14.6 

132798 A1026701 Hs.5716 KIAA0310 gene product 2.5 

5 132810 AB007944 Hs5737 KIAA0475 gene product 4.2 

132833 U78525 Hs.57783 eukajyotic translation initiation factor 6.1 

132842 NMJ16154HS.279771 Homo sapiens clone PP 1596 unltnoimmRNA 7.1 

132851 U09716 Hs.287912 lectin, mannose-binding, 1 6.1 

132891 BE267143 Hs.59271 U2(RNU2) small nuclear RNA auxillaiy fac 2.7 

10 132941 AI817185 Hs.6120 hypothetical protein ajl 3222 2.1 

132972 AA034365 Hs.288924 Homo sapiens cDNA FU1 1392 fis.ctone HE 3.5 

132980 AA040696 Hs.62016 ESTs 1.3 

132994 AA112748 Hs.279905 clone HQ0310 PRO0310p1 17.1 

133016 AI439688 Hs.6289 hypothetical protein RJ20886 4.4 

15" 133177 X97795 Hs.66718 RAD54 (S.cerevisiaeHike 4.4 

133208 AI801777 Hs.6774 ESTs 5.5 

133254 AI567421 Hs.273330 Homo sapiens, done IMAGE:3544662, mRNA, 1.3 

133266 AI160873 Hs.69233 zinc finger protein 16.1 

133268 AW956781 Hs.293937 ESTs. Weakly similar to FXD2JiUMAN FORKH 12.2 

20 133285 M76477 Hs.289082 GM2 gangnoskie acQvator protein 10.4 

133390 A1950382 Hs.72660 phosphaiidylserine receptor 5.7 

133391 AW103364 Hs.727 Inhibin, beta A (acfivin A, actlvin AB a 25.5 
133540 AL037159 Hs.74619 proteasome (prosome, macropain) 26S subu 1.7 
133594 AW160781 Hs.172589 nudear phosphoprotein similar to S. cer 2.6 

25 133621 NM_004893Hs.75258 H2A histone family, member Y 13.5 

133720 L27841 Hs.75737 pericentriolar materiall 6.7 

133760 BE271766 Hs.181357 lamlnin recepbr 1 (67kD. ribosomal prot 5.4 

133784 BE622743 Hs.301064 arfeptin 1 12.1 

133791 M34338 Hs.76244 spennldine synthase 9.7 

30 133797 AL133921 Hs.76272 retinoblastoma-bindlng protein 2 1.3 

133822 D50525 Hs.699 peptidylprolyl isomerase B (cyctophilin 9.7 

133850 W29092 Hs.7678 cellular retinoic ackj-bindlng protein 1 4.2 

133865 AB011155 H5.170290 discs, large (Drosophila) homolog 5 5 

133881 U30872 Hs.77204 centromere protein F (350/400kD, mitosin 9.1 

35 133924 D86326 Hs.325948 vesfcle docking protein pi 15 1.8 

133959 X81789 Hs.77897 splicing factor 3a, subunit 3. 60kD 10.4 

133989 AIJ)40328 Hs.78202 SWI/SNF related, matrix assodated, act! 2.6 

133997 AI824113 Hs.78281 regulator of G-protein signalling 12 13 

134234 BE300078 Hs.B0449 Homo sapiens, done IMAGE:3535294, mRNA, 10.3 

40 134348 AW291946 Hs.82065 Interieukin 6 signal transducer (gp130. 6.7 

134376 X06560 Hs.82396 2'.5'-ollgoadenylatesynthetase1 (4046 5.5 

134379 AW362124 Hs.323193 hypothetical protein MGC3222 5.8 

134405 AW067903 Hs.82772 collagen, type XI. alpha 1 72.9 

134421 AU077196 Hs.82985 collagen, type V, alpha 2 6.7 

45 134480 NM_005000Hs.83915 Empirically selected from AFFX single pr 6.2 

134516 AKD01571 Hs.273357 hypothetical protein FU 10709 1.4 

134529 AW411479 Hs.848 FK506.binding protein 4 (59kD) 2.8 

134751 AW630803 Hs.89497 laminBI 6.1 

134790 BE002798 Hs.287850 integral membrane protein 1 1.2 

50 134806 AD001528 Hs.89718 spennine synthase 2.6. 

134850 AI701162 te.90207 hypothetical protein MGC1 1138 9.1 

134859 026488 Hs.90315 KIAA0007 protein ' 13.3 

134971 AI097346 ^fe.286049 phosphoserine aminotransferase 2 

135181 BE250865 Hs.279529 px1 9^ike protein 14.9 

55 135207 N26427 hte.9634 ESTs, Highly similar to C10_.HUMAN PUTAT1 1.7 

135245 AI028767 Hs.262603 ESTs 12.2 

135257 AW291023 Hs.97255 ESTs. Weakly similar to A46010X-linked 7.6 

135307 AI743770 Hs.98368 ESTs, Weakly similar to KIAA0822 protein 5.8 

135321 AI652069 Hs.98614 ribosome binding protein 1 (dog ISOkD ho 12.3 

60 135354 AA456454 Hs.183418 cell division cycle Mike 1 (PlTSU^ pr 5.7 

135400 X78592 Hs.99915 androgen receptor (dihydrotestosterone r 13.9 

302276 AW057736 Hs.323910 HER2 receptor tyrosine kinase (c 5.3 

317781 NIVL007057HS.42650 ZWIO Interactor 2.8 

321114 AA902256 Hs.78979 Golgl apparatus protein 1 5.5 

65 322556 BE041451 Hs.177507 hypothetical protein 2.9 

420802 U22376 H5.1334 v-myb avian myek)blastosls viral oncogen 2.3 

424001 W67883 Hs.137476 paternally expressed 10 (PEG10;KIAA105 7 
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425182 AF04125g.Ks.155040 zinc finger protein 217 2.3 

446999 AA151520 Hs.334822 hypothetical protein MGC4485 7.5 

450701 H39960 Hs.288467 Homo sapiens cONAFU12280fis. done MA 5.6 

452461 N78223 Hs.1O8106 transcription fector 4.7 

S 453157 AF077036 H5^1989 DKFZP58661722 proteiii 12.1 
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TABLE 5A 

Table 5A shows the accession numbers for those pkeys lacking unigenelD's for Table 5. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools poubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 



10 Pkey: 

CAT number 
Acoessbn: 



Unique Eos probeset identifier number 
Gene cluster number 
Genbanit aocession numbers 



15 Pkey 

123615 
124385 
110856 
20 120472 



25 



30 129019 44573_2 



CAT number Accessions 



35 



40 

122618 
125115 
120809 
45 129680 



50 

101045 
110501 
121558 
55 121911 



30686^-15 AA609170 
656394_1 AI267847 N27351 

19346^14 AA992380 N33063 N2141B H79958 R21911 H79957 

44573 2 AI950087 N70208 R97040 N36809 AI308119 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 
AA908598 AA251875 AI820501 AI820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
AI219788 AAe84444 N92578 F13493 AA927794 AI560251 AW874068 AL134043 AW235363 AA663345 AW008282 
AA488964 AA283144 AI890387 AI950344 AI741346 AI689062 AA282915 AW102898 AI872193 AI763273 AW173586 
AW1 50329 AI653632 AI762688 AA968777 AA488892 AI356394 AW1 03813 AI539642 AA642789 AA856975 AW505512 
AI961530 AW629970 BE6128B1 AW276997 AW513601 AW512843 AA044209 AW856538 AA180009 AA337499 AW961 101 
AA251669 AA251874 AI819225 AW205e62 AI683338 AI858509 AW27e905 AI633006 AA972584 AA90874 1 AW072629 
AW51 3996 AA293273 AA969759 N75628 N22388 H8472g H60052 T92487 AI02205B AA780419 AA551005 W80701 
AW613456 AI373032 AI564269 F00531 H83488 W37181 W78802 R66058 AI002B39 R87840 AA300207 AW959581 T63226 
F04005 

AI950087 N70208 R9704O N36809 Ai308119 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 
AA908598 AA251875 AI820501 AI820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
A)219788 AA884444 N92578 F13493 AA927794 AI560251 AW874068 AL134043 AW235363 AA663345 AW008282 
AA488964 AA283144 AI890387 AI950344 AI741346 AI689062 AA282915 AW102898 AI872193 A(763273 AW173586 
AW1 50329 AI653832 AI762688 AA988777 AA488892 AI356394 AW10381 3 AI539642 AA642789 AA856975 AW50551 2 
A)961530 AW629970 BE612881 AW276997 AW513601 AW5 12843 AA044209 AW856538 AA180009 AA337499 AW961 101 
AA251669 AA251874 AI819225 AW205862 AI683338 Al 858509 AW276905 AI633006 AA972584 AA908741 AW072629 " 
AW513996 AA293273 AA969759 N75628 N22388 H84729 H60052 T92487 AI022058 AA780419 AA551005 W80701 
AW813456 AI373032 Ai564269 F0W31 H83488 W37181 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 
F04005 

120695 9683_3 AA976503 AI917802 AA953664 AA404613 AA428771 BE280542AW1 94691 AI927301 AI740458 
At796100 A(935603 AW052210 AA970201 A1633384 AA425910 AI017004 AI241295 AA402816 AA291468 
AA453641 AA454061 
genbank T97341 T97341 
genbankZAA346495 AA346495 

2Z^B2J U03749 NM.001275 J03483 J03915 AI214509 AW245744 AL046455 AA318960 AI741505 AA843875 AI829382 AI560122 
AIB58999 055958 Al 684005 D53170 AA854091 A1025609 D53119 D54729 D55504 D55377 D55313 AW512244 AA846441 
AW043898 AI969102 AA405741 AI091983 AA788784 AA706586 AA854361 AW470949 AAB43095 AA772028 AI148432 
AI038109 AA782478 AA910064 AI220384 AA781296 AA843881 AA854064 AA843125 AA843419 AA319036 AA319054 
AI273831 W32275 AI584185 005724 AA789023 AI68681B D54392 A1022485 AA431410 AA854232 W39212 W15214 
AA894441 AI803081 AI167381 AW245389 AA319430 AA335156 AI042646AA327030 AA725170 T27943 AA889304 
AA976699 AI687001 A1621 107 AI865540 AA772107 C06286 AA319661 AA405992 

entre2LJ05614J05614 

genbanlLH55748 H55748 

genbankuAA412497 AA412497 

genbankj^427950 AA427950 



305217 1 



148 



wo 02/059377 



PCT/US02/02242 



TABLE 6: Figure 6 from BRCA 001 US 
Table 6 shows genes upregulated in tumor tissue compared to normal breast tissue. 



Pkey: Unique Eos probeset identiTter number 

ExAocn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unlgene number 

1 0 Un^ene Title: Unlgene gene tftle 

R1 : Ratio of tumor to nonna! breast tissue 



Pkey ExAccn UnigenelD UnlgeneTltle R1 

100147 D13666 Hs.136348 osteoblast specific factor 2 (fascidln 7.5 

100678 AW502935 Hs.740 PTK2 protein tyrosine Idnase 2 53.2 

101806 AA586894 Hs.112408 8100 caldum-bindlng protein A7 (psorias 8.9 

102455 U48705 Hs.75562 discoidin domain receptor family, member 6.9 

20 103206 X72755 Hs.77367 monokine induced by gamma Interferon 8.8 

105743 BE246502 Hs.9598 sema do mah, immunoglobulin domain (Ig), 2.6 

105746 AW151952 Hs.46679 iiypotlietical protein FU20739 1.5 

106373 AW5038Q7 IHs.21907 htstone acetyltransferase 1.8 
110240 AI668594 Hs.176588 ESTs. Weakly similar to CP4Y_HUMAN CYTOC 4.2 

25 119260 AKD01724 Hs.1Q2950 coat protein gamma-cop 3.2 

120206 H26735 Hs.91668 Homo sapiens done PP1498 unknown mRNA 45.7 

120253 AA131376 Hs.326401 fibroblast growth factor 12B 38.9 

120297 AA191384 Hs.104072 ESTs. Weakly simitar to Z195^HUMAN ZINC 15.2 

120624 AW407987 Hs.173518 M-phase pliosplioprotein iiomotog 52 

30 120695 AA976503 gb:oq30a04.s1 NCLCGAP„GC4 Homo sapiens 46.8 

120807 AA346385 Hs.30002 SH3-contalnlng protein SH3GLB2; KiAAl 848 6.8 

121508 AA402515 Hs.97887 ESTs 28 

122607 AA453518 Hs.98023 ESTs 61.5 

122616 AA453638 Hs.161873 ESTs 107.3 

35 122618 AA453641 gb:zx48e05.s1 Soares^testis^NHT Homo sap 31.1 

122829 AW204530 Hs.99500 ESTs 81.8 

122838 AA460584 Hs.334386 ESTs 75.3 

123753 AA609955 Hs.234961 Huntlngtln interacting protein E 30.6 

124385 AI267847 gbaq49a10j<1 Stanley Frontal NB pool 2 57.1 

40 124860 R65763 Hs.101477 EST 23.9 

124930 AI076343 Hs.173939 ESTs. Weakly similar to ALUB.HUMAN III! 22.8 

125051 T79956 Hs.1005B8 EST 135.3 

128781 N71826 Hs.105465 small nuclear tibonucleoprotein polypept 53.9 

129096 AA463189 Hs.288906 WW Domaln-Conteining Gene 20.9 

45 129347 BE614192 Hs.279869 melanoma-assodated antigen recognised b 7.6 

129689 AW748482 Hs.77873 B7iiomok)g3 2.6 

130503 BE208491 Hs.295112 KIAA0618 gene product 16.1 

130511 L32137 Hs.1584 cartilage oligomerlc matrix protein (pse 6.1 

131046 AA321649 Hs.2248 small indudbte cytokine subfamily B (Cy 7.4 

50 131643 AW410601 Hs.30026 HSPCl 82 protein 2.9 

131925 API 51048 Hs. 183 180 anaphase promoting complex subunit 11 (y 2.7 

132180 NI\/L004460Hs.418 fibroblast activatton protein, alpha 14.7 

132370 AW572805 Hs.46645 ESTs 28.3 

132994 AA1 12748 H8.279905 ckine HQ0310 PRO0310p1 17.1 

55 133016 AJ439688 Hs.6289 hypothetkal protein FU20886 4.4 

133266 AI160873 Hs.69233 zinc finger protein 16.1 

133391 AW103364 Hs.727 Inhlbln, beta A (activin A, actlvin AB a 25.5 

134169 AI690916 Hs.178137 transducer of ERBB2, 1 1.2 

134219 NM 000402HS.B0206 glucoses-phosphate dehydrogenase 1.9 

60 134405 AW067903 Hs.82772 collagen, type XI. alpha 1 72.9 

134529 AW41 1479 Hs.B48 FK506-binding protein 4 (59kD) 2.8 

134975 R50333 Hs.92186 Leman coiled-coll protein 2.6 

135181 BE250865 Hs.279529 px19-like protein 145 

322556 BE041451 Hs.177507 hypothetkal protein 2.9 
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TABLE 6A 

Table 6A shows the accession numbers for those pkeys lacking unigeneED's for Table 6. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Mgmnent Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession*' coliram. 

Pkey: Unique Eos probeset Identifier number 

CAT number. Gene cluster number 
Accession: Genbanlt accession numbers 

Pkey CAT number Accessions 
124385 656394J AI267847 N27351 

120695 9683 3 AA976503 AI917802 AA953664 AA404613 AA428771 BE280542 AW194691 A1927301 AI740458 AI796100 AI935603 

AW052210 AA970201 AI633384 AA425910 AI017004 Ai24129S AA402816 AA2gi468 
122618 305217J AA453641 AA454061 
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TABLE 7: Figure 7 from BRCA 001-1 US 

Table 7 shows genes upregulated in tumor tissue compared to normal breast tissue. Open 
reading frames in the sequences have been characterized as having a signal sequence (SS), a 
transmembrane domain (TM) or other. 
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Pkey: Unrque Eos probeset identifier number 






ExAccn: Exemplar Accession nunnber. Genbank accession number 






UnigenelD: Unigene number 








Unlgene Title: Unigene gene title 








R1: Rat?o of tumor to nornial breast tissue 






ORF struct info: Stiudural characterization of open reading frame for the sequence of the gene 




Pkey ExAccn 


• 

UnigenelD 


UnigeneTitie 


l\l 


ORF struct Info 


100113 NM_001269 


LI- QA^ AC 


chromosome condensation 1 


Z.J 


TM 


100114 X0230o 


rlS.o29o2 


thymidylate synthetase 


9 Q 


other 


100131 D124o5 


HS.11951 


ectonucleotlde pyrophosphatase/phosphodi 




other 


10014O BElo54y9 


Hs.2471 


KIAA0020 gene product 


1 Q 


TM 


100147 DlOOOD 


Us 4 Id'iAQ 

nS.loCh>K} 


osteoblast speci^c factor 2 (fasdclin 


7fi 

fJO 


other 


100154 no0720 


Un 0 4QQ1 


I/I A hMr\4 /innA nrwiitM 

KiAAUiui gene product 


Q 9 


other 


100163 W44o7i 


nS.124 


gene preuicuso from cuna wnn a compieie 


1 A 


other 


4 AAOIA A I Afn4 CC4 A 

100220 AWD15534 


nS^17493 


anne]dnA2 


c. 


other 


100265 Djo521 


Lin 4 

ns.1 14Cj9d 


KiAAUUf f protein 


■ •9 


other 


100271 HF1R0081 


Hs 256290 


S100 calfitum-bfndina onotsln A1 1 fcafaiz 


13.5 


other 


100275 BE242802 


Hs.154797 


KIAA0090 protein 


5.1 


other 


100323 D50920 


Hs.23106 


K1AA0130 gene product 


1.9 


TM 


100335 AW247529 


Hs.6793 


platelet-activating factor acetylhydrola 


2.7 


other 


100364 NM_004341 


Hs.154868 


carbamoyi-phosphate synthetase 2, aspart 


2 


other 


100372 NM^014791 


Hs.1 84339 


KIAA0175 gene product 


2.6 


other 


100393 D84145 


Hs.39913 


novel RGD-containIng protein 


3.2 


other 


100400 AW954324 


Hs.75790 


phosphatidylinositol glycan, class C 


1.5 


other 


100418 D86978 


Hs.84790 


KIAA0225 protein 


2 


other 


100462 M65028 


Hs.81361 


heterogeneous nuclear hbonudeoprptein 


2.9 


other 


100518 NMJ)04415 


Hs.74316 . 


desmoplakin (DPI, DPIl) 


1.9 


other 


100666 L05424 


Hs.169610 


CD44 antigen (homing functton and Indian 


5.7 


other 


100667 L05424 


Hs.169610 


CD44 antigen (homing function and Indian 


9 


? 


100668 L05424 


Hs.169610 


CD44 antigen (homing function and Indian 


7.7 


other 


100678 AW502935 


Hs.740 


PTK2 protem tyrosine kinase 2 


53.2 


other 


100783 AF078847 


Hs.191356 


general transcription factor IIH, polype 


6 


other 


100892 BE245294 


Hs.1 80789 


SI 64 protein 


1.7 


? 


100945 AF002225 


Hs.1 80686 


ubiquitin protein iigase E3A (human papi 


1.5 


other 


100969 AA1 57634 


Hs.79172 


solute earner family 25 (mitochondriat 


6.3 


other 


100988 AK000405 


Hs.76480 


ubiquitin-iike 4 


11.4 


? 


100999 H38765 


Hs.80706 


diaphorase (NADH/NADPH) (cytochrome b-5 


1.6 


other 


101031 J05070 


Hs.151738 


matrix metalloproteinase 9 (gelatinase B 


8.4 


other 


101045 J05614 




gb:Human proliferating cell nuclear anti 


5 


? 


101077 N99692 


Hs.75227 


Empirically selected from AFFX single pr 


2.6 


other 


101093 L06419 


Hs.75093 


procollagen-lysine, 2-oxoglutarate S^Jto 


1.4 


? 


101186 AA020955 


Hs.179881 


core-binding factor, beta subunH 


2 


TM 


101216 AA284166 


Hs.84113 


cyclln-dependent kinase inhibitor 3 (CDK 


1.8 


other 


101228 AA333387 


Hs.82916 


chaperonin containing TCP1, subun'rt 6A ( 


1.7 


TM 


101247 AA132666 


Hs.78802 


glycogen synthase kinase 3 beta 


1.9 


other 


101249 L18964 


Hs.1904 


protein kinase C, iota 


1.5 


other 


101332 J04088 


Hs.156346 


topoisomerase (DMA) II alpha (170kD) 


5.3 


other 


101352 AI494299 


Hs.16297 


C0X17 (yeast) homotog, cytochrome c oxid 


4.2 


other 


101396 BE267931 


Hs.78996 


proliferating cell nuclear antigen 


1.9 


TM 


101445 M21259 




gb'>luman Atu repeats in the region 5' to 


1.6 


TM 


101470 NM.000546 


Hs.ld46 


tumor protein p53 (Li-Fraumeni syndrome) 


2.5 


other 


101478 NM-002890 


Hs.758 


RAS p21 protein activator (GTPase actlva 


5.5 


other 
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65 



101483 M24486 


Hs.76768 


procollagen-proline. 2-oxogtutarate 4-di 


2.1 


other 


101540 J04977 


Hs.84981 


X-ray repair complementing defective rep 


1.6 


other 


101573 AW248421 


Hs.250758 


proteasome (prosome, macropain) 26S subu 


5.7 


other 


101580 NM_012151 


Hs.83363 


ooagutation factor Vllt-associated (intr 


1.8 


other 


101592 AF064853 


Hs.91299 


guanine nucleotide binding protein (G pr 


5.6 


? 


101621 BE391804 


Hs.62661 


guanytate tx'ndlng protein 1, interferon- 


2.4 


other 


101702 AW504089 


Hs.179574 


protein phospliatase 2 (formerly 2A), reg 


1.3 


other 


101734 M74099 


Hs.147049 


cut (Drosophila)-like 1 (CCAAT displaoem 


2.1 


? 


101759 M80244 


Hs.184601 


solute earner family 7 (cationic amftio 


5 


TM 


101767 M81057 


Hs.180884 


cart)0xypept]daseB1 (tissue) 


14.4 


ss, 


101782 AA306495 


Hs.1869 


phosphoglucomutase 1 


5.2 


other 


101805 AW409747 


HSJ5612 


stressnnduced-ptiospiioprotein 1 (Hsp70/H 


8.6 


other 


101806 AA586894 


Hs.1 12408 


S100 caldunvbinding protein A7 (psortas 


8.9 


SS,TM 


101610 NM 000318 


Hs.180612 


peroxisomal membrane protein 3 (3510, 2e 


3.2 


TM 


101879 AA176374 


Hs.243886 


nuclear autoantlgenic sperni protein (his 


1.6 


other 


101911 AA441787 


Hs.1 19689 


glyooprotein homiones, alpha polypeptide 


31.3 


? 


101920 AF182645 


Hs.6024 


IK cytokine, down-regutator of HLA 1 1 


1.8 


other 


101973 U41514 


Hs.80120 


UDP*N-acetyt-alpha-(>9alactosamine:polyp 


2.4 


other 


102009 BE245149 


Hs.82643 


protein tyrosine kinase 9 


1.3 


other 


102036 BE250127 


Hs.82906 


CDC20 (cell division cyde 20, S. cerevi 


2 


? 


102083 T35901 


Hs.75117 


interleuldn enhancer binding factor 2. 4 


1.6 


other 


102107 BE258602 


Hs.182366 


heal shock protein 75 


1.4 


other 


102123 Nf\L001809 


Hs.1 594 


centromere protein A (1 7kD) 


1.8 


other 


102165 BE313280 


Hs.1 59527 


death associated protein 3 


4.6 


? 


102198 AW950852 


Hs.74598 


polymerase (DNA directed), delta 2. regu 


4.4 


? 


102217 AA829978 


Hs.301613 


JTV1 gene 


6.7 


other 


102220 U24389 


Hs.65436 


lysosomal 


4.4 


TM 


102234 AW163390 


Hs.278554 


heterochromatin-like protein 1 


1.9 


TM 


102260 AU)39104 


Hs.159557 


karyopherin alpha 2 (RAG cohort 1. Impor 


4.4 


other 


102302 AA306342 


Hs.69171 


protein Unase C-ilke 2 


17 


? 


102330 BE298063 


Hs.77254 


chromobox homolog 1 (Diosophila HP1 beta 


15 


other 


102339 BE378432 


Hs.95577 


cycOn-dependent kinase 4 


2.3 


TM 


102348 U37519 


Hs.87539 


aldehyde dehydrogenase 3 family, member 


2 


TM 


102349 AU077055 


Hs.289107 


baculoviral lAP repeat-containing 2 


3.2 


other 


102369 U39840 


Hs.299867 


hepetocyte nuclear factor 3, alpha 


2 


other 


102374 U33635 


Hs.90572 


PTK7 protein tyrosine kinase 7 


6.2 


other 


102391 AA296B74 


Hs.77494 


deoxyguanosine kinase 


1.5 


TM 


102455 U48705 


Hs.75562 


discoidin domain receptor family, member 


7 


other 


102465 NM 001359 


Hs.81548 


2,4-dienoyl CoA reductase 1, mltochondrl 


1.8 


SS. 


102488 U50939 


Hs.61828 


amyloid beta precursor protein-binding p 


1.5 


? 


102489 AL080116 


Hs.74420 


origin recognition complex, subunit3 (y 


3.3 


other 


102494 AI188137 


Hs.75193 


C0P9 homolog 


2.1 


other 


102501 AF217197 


Hs.74562 


siah binding protein 1; FBP Interacting 


3.2 


other 


102522 BE250944 


Hs.183556 


solute canier family 1 (neutral amino a 


2.8 


? 


102532 AF040253 


Hs.70186 


suppressor of Ty (S.cerevisiae) 5 homolo 


5.7 


? 


102564 U59423 


Hs.79067 


MAD (mothers against decapentaplegic, Dr 


2.3 


other 


102568 W81489 


Hs^025 


RAB31, member RAS oncogene family 


5.3 ' 


other 


102580 U60808 


Hs.152981 


CDP-diacylglyceFol synthase (phosphatfda 


2.1 


other 


102581 AU077228 


Hs.77256 


enhancer of zeste (Drosophila) homotog 2 


1.6 


? 


102582 U61232 


Hs,32675 


tubulin-spedfic chaperone e 


2.1 


other 


102617 AW161453 


Hs.1 98767 


C0P9 (constitutive photomorphogenic. Ara 


1.8 


other 


102618 AL037672 


Hs.81071 


extracellular matrix protein 1 


5.8 


other 


102627 AL021918 


Hs.158174 


zinc finger protein 184 (Kmppel-like) 


1.3 


other 


102663 N1VL002270 


Hs.168075 


karyopherin (import/n) beta 2 


1.8 


TM 


102676 8E2629B9 


Hs.12045 


putative protein 


2.3 


other 


102687 NM_007019 


Hs.93002 


ubiquitln cam'er protein E2-C 


4.4 


? 


102689 U96132 


Hs.171280 


hydroxyacyl-Coenzyme A dehydrogenase, ty 


6 


? 


102696 BE540274 


Hs.239 


foiWiead box W 


4.2 


other 


102704 AU077058 


Hs.54089 


BRCA1 associated RING domain 1 


1.9 


other 


102705 T97490 


HS.500Q2 


small inducible cytokine subfamily A (Cy 


2.3 


SS.TM 


102750 AB014460 


Hs.66ig6 


nth (E.coli endonuclease lll)-like 1 


1.2 


■ TM 


102801 BE252241 


Hs.38041 


pyrfdoxal (pyddoxine, vitamin B6) kinas 


6.5 


other 


102812 1)90549 


Hs.236774 


high-moWlily group (nonhistone chromoso 


1.6 


other 


102827 BE244588 


Hs.6456 


chaperonin containing TCP1 , subunit 2 (b 


5.6 


TM 


102844 AV653790 


Hs.324275 


WW domain-containing protein 1 


1.3 


TM 


102868 X02419 


H8,77274 


plasminogen activator, urokinase 


4,4 


other 


102925 BE440142 


Hs.2943 


signal recognition partlde 19kD 


1.9 


other 
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102935 BE561850 Hs.8O506 small nuclear ribonudeopratein polypept 

102968 AU075611 Hs.154672 mettiylene tetrahydrofolate dehydrogenase 

102983 BE387202 Hs.118638 norvmetastatic cells 1, protein (NM23A) 

102985 U95742 Hs.2707 G1 to S phase transition 1 

5 103023 AW500470 Hs.117950 multifunctional polypeptide similar to S 

103038 AA926960 Hs.334883 CDC28 protein kinase 1 

103060 Nh4_005940 Hs.1 55324 matrix metalloproteinase 1 1 (MMP1 1 : stro 

103080 ALI077231 Hs.82932 cydin D1 (PRAD1: parathyroid adenomatos 

103089 D31152 Hs.179729 collagen, type X, alpha 1 (Schmid metaph 

10 103177 BE244377 Hs.48876 famesyl-diphosphatefamesyltransferase 

103178 AA205475 Hs.275865 ribosomal protein S18 

103179 NM_001777 Hs.B2685 CD47 antigen (Rh-relaled antigen, integr 
103181 X69636 Hs^731 Homo sapiens, done l!\/tA6E:3448306, mRNA. 
103185 NM_006825 Hs74388 transmembrane protein (63kD), endoplasmi 

15 103191 AA401039 Hs^03 protein phosphatase 4 (fomieriy X). cata 

103193 NMJ)04766 Hs75724 ooatomer protein complex, subunit beta 2 

103194 Nl\/L004939 Hs78580 DEAD/H (Asp-G!u-Ala-Asp/His) box polypep 
103206 X72755 Hs.77367 monokine induced by gamma interferon 
103223 BE275607 Hs.1708 chaperonin containing TCP1,subun it 3 (g 

20 103232 X75962 Hs.129780 tumor necrosis factor receptor superfami 

103238 AI369285 Hs75189 deathnassociated protein 

103297 Nl\4_001545 Hs.9078 immature colon carcinoma transcript 1 

103330 AI803447 Hs77496 small nudear ribonudeopratein polypept 

103349 X89059 gb:H.sapiens mRNA for unknown protein ex 

25 103376 AL036166 Hs.323378 coated veside membrane protein 

103391 X94453 Hs.114366 pymdine-S^arboxylate synthetase (glut 

103392 X94563 gb:H.5apiens dbi/acbp gene exon 1 & 2. 
103430 BE564090 Hs.20716 transbcase of inner mitochondrial membr 
103491 AF264750 Hs288971 myeloid/lymphoid or mixed-iineageleukem 

30 103505 AL031224 Hs.33102 transcription factor AP-2 beta (acfivati 

103547 AI376722 Hs.180062 proteasome (prosome. macropain) subunit, 

103588 Nl\/l_006218 Hs^5701 phosphoinositide-3-kinase, catalytic, al 

103613 NM_000346 Hs.2316 SRY (sex detemiining region Y)-box 9 (ca 

103621 BE379766 Hs.150675 polymerase (RNA) II (DNA directed) polyp 

35 103622 AA609685 Hs.278672 membrane component, chromosome 11. surfa 

103727 A1878883 Hs.296381 growth factor receptor-bound protein 2 

103754 A1015709 Hs,172089 Homo sapiens mRNA; cONA DKFZp586l2022 (f 

103780 AA094752 Hs.169992 hypothetical 43.2 Kd protein 

103795 H26531 Hs.7367 Homo sapiens BTB domain protein (BDPL) m 

40 103797 AA080912 gb:zn04d03.r1 Stratagene hNT neuron (937 

103813 AI042582 Hs.181271 CGI-120 protein 

103855 W02363 Hs.302267 hypothetical protein FU10330 

103886 AK001278 Hs.105737 hypothetical protein FU1041 6 similar to 

104052 NM-002407 Hs.97644 mamnrjagtobln 2 

45 104079 AA251242 Hs.103238 ESTs 

104174 AA478984 Hs.6451 PRO0659 protein 

104227 AB002343 Hs.98938 protocadherin alpha 9 

104275 AI751970 Hs.101067 GCN 5 (general control of a mino-acid synt 

104325 BE379766 Hs.150675 polynf)erase (RNA) II (DNA directed) polyp 

50 104370 AA324597 Hs.21851 Homo sapiens d}NA R.J1 2900 fis. clone NT 

104423 R83113 Hs.1432 protein kinase C substrate 80K-H 

104482 AB037762 Hs.44268 myelin gene expression factor 2 

104667 AI239923 Hs.30098 ESTs 

104757 AI694413 Hs.332649 olfadory receptor, family 2. subfamily 

55 104804 AI85B702 Hs.31803 ESTs. Weakly similar to N-WASP (H.saplen 

104806 AB023175 Hs.22982 K1AA0958 protein 

104827 AW052006 Hs.8551 PRP4/STK/WD splicing factor 

104846 Ai250789 Hs.32478 ESTs 

104854 AA041276 Hs.154729 3i)hosphoinositide dependent protein kin 

60 104867 AA278898 Hs.225979 hypothetical protein similar to small G 

104871 T78044 Hs58893 Homo sapiens mRNA; cDNADKFZp56402364(f 

104895 AW015318 Hs.23165 ESTs 

104909 AW408164 Hs.249184 transcription fador 19 (SCI) 

104916 AW958157 Hs.155489 NS1-assoclated protein 1 

65 104919 AA026880 Hs.25252 prolactin receptor 

104930 AF043467 Hs.32893 neurexophilin 2 

104973 NM-015310 Hs.6763 KIAA0942 protein 
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104974 Y12059 


Hs.278675 


bfomodomairKontalning 4 


15 


other 


104975 AL136877 


Hs^758 


SMC4 (structural maintenance of chiomoso 


2.4 


other 


104978 AI199268 


Hs.19322 


Homo sapiens, Similar to RIKEN cDNA 2010 


7.3 


other 


104979 AA937934 


Hs.321062 


ESTs 


1.3 


other 


104994 A1499930 


Hs^34885 


mitochondrial GTP binding protein 


3.6 


? 


105009 BE379584 


Hs.34789 


dolichyl-dlphosphooligosaccharide-protel 


5.6 


other 


105012 AF098158 


Hs.9329 


chromosome 20 open reading frame 1 


3.4 


other 


105028 AI050715 


Hs.2331 


E2F transcription factor 5, p130-blndkig 


2.2 


other 


105041 AB037716 


Hs.26204 


KIAA1 295 protein 


2.2 


other 


105045 BE242899 


Hs.129951 


specWe-type POZ protein 


3.9 


? 


105079 AA151342 


Hs.12677 


CGl-147 protein 


9.5 


TM 


105087 AA147884 


Hs,9812 


Homo sapiens cDNA FLJ14388 fis, clone HE 


5.7 


other 


105088 H5858g 


Hs.35156 


Homo sapiens cDNA FU11027 fe, done PL 


22 


other 


105095 Z78407 


Hs.27023 


vesicle transport-related protein 


22 


other 


105110 BE387350 


Hs33122 


KIAA1 160 protein 


1.6 


other 


105126 AW975433 


Hs.36286 


ESTs 


6.4 


7 


105127 AA045648 


Hs.301957 


nudix (nucleoside diphosphate linked moi 


2.2 


other 


105141 AA164687 


Hs. 177576 


mannosyl (alpha-1,3-)-glycoprotBln beta- 


Z8 


other 


105158 AW976357 


Hs.234545 


hypothetical protein NUF2R 


2 


other 


105169 BE245294 


Hs.180789 


3164 protein 


1.7 


other 


105186 AA191512 


Hs.28005 


Homo sapiens cDNA FIJ11309 lis, done PL 


45 


SS.TM 


105254 AA071276 


Hs, 19469 


KIAA0859 protein 


2 


m 


105281 AA263143 


Hs.24596 


RAD51 -interacting protein 


2.9 


? 


105288 N99673 


Hs.3585 


ESTs. Weakly similar to AF126743 1 DNAJ 


15 


TM 


105302 AA700122 


Hs.3355 


sentrin-spedfic protease 


8.2 


? 


105331 AW270037 


Hs.179507 


K1AA0779 protein 


1.8 


SS, 


105359 NM»016015 


Hs.8054 


CGI-68 protein 


8.4 


other 


105366 BE264645 


Hs.282093 


hypothetical protein FLJ2191B 


5.1 


other 


105373 AW887701 


Hs.32356 


hypothetical protein FLJ20626 


2.6 


other 


105374 BE242803 


Hs.262823 


hypothetical prot^ FU10326 


2.2 


TM 


105387 AW592146 


Hs.108636 


membrane protein CHI 


tz 


SS.TM 


105393 AF167570 


H5i56583 


interleuldn enhancer binding fadtor 3. 9 


5.5 


SS, 


105399 BE366877 


Hs.334611 


Npw38-blndlng protein NpwBP 


1.8 


other 


105400 AF198620 


Hs.65648 


RNA binding motif protein 8A 


1.6 


other 


105445 AA252395 




gb:zs12g10.s1 NCLCGAP^GCBI Homo sapiens 


5.1 


? 


105507 BE268348 


Hs.226318 


CCR4-N0T transcription complex, subunit 


1.6 


other 


105529 AA113449 


Hs.32471 


hypothetical protein FLJ20364 




other 


105530 AB023179 


Hs.9059 


KIAA0962 protein 


3.5 


other 


105547 AA262640 


Hs.27445 


unknown 


9.3 


other 


105564 BE615694 


HS.2B8042 


hypothetical protein FLJ 14299 


1.4 


other 


105595 AA579535 


HS.1B490 


hypothetical protein FLJ20452 


10.9 


TM 


105597 AF054284 


Hs.334826 


splicing factor 3b, subunit 1, 155kO 


2.9 


TM 


105608 AI808201 


Hs.287863 


hypothetical protein aj12475 


1.7 


? 


105610 AA280072 


Hs.99872 


fetal Alzheimer antigen 


1.4 


other 


105617 AK000892 


Hs.4069 


gluoooorticold modulatoiy element btndin 


1.7 


TM 


105620 AW302245 


Hs.181390 


casein kinase 1, gamma 2 


5.6 


other 


105656 AA985190 


Hsi46675 


hypothetical protein FLJ20059 


9.4 


other 


105697 AW499988 


Hs.27801 


zinc finger protein 278 


2 


TM 


105708 R26944 


Hs.180777 


Homo sapiens mRNA; cDNA DKFZp564l\40264 (f 


1.7 


other 


105743 BE246502 


Hs.9598 


sema domain, immunoglobulin domain (Ig), 


2.7 


other 


105746 AW151952 


Hs.46679 


hypothetical protein FLJ20739 


1.5 


7 


105759 AI123118 


Hs.15159 


chemokine-like factor, alternatively spl 


1.3 


other 


105771 AI267720 


Hs.153221 


synovial sarcoma, translocated to X chro 


1.6 


other 


105820 AA741336 


Hs.152108 


transcriptional unit N143 


Z2 


other 


105826 AA478756 


Hs.194477 


E3 ubiquitin ligase SMURF2 


1.3 


other 


105856 A1262106 


Hs.12653 


ESTs 


2.4 


other 


105858 AF151066 


Hs.281428 


hypothetical protein 


2.9 


other 


105875 AK001708 


Hs.32271 


hypothetical protein FLJ10846 


1.4 


other 


105930 AF016371 


Hs.9880 


peptldyl prolyl isomerase H (cydophilin 


53 


other 


106000 AW194426 


HS20726 


ESTs 


1.7 


other 


106011 AW081202 


Hs.12264 


Homo sapiens, done IMAGE:29B9556. mRNA. 


2.8 


other 


106017 AA477956 


Hs^268 


ESTs 


1.4 


other 


106073 AL157441 


Hs.17834 


downstream neighbor of SON 


1.4 


other 


106078 AA130158 


Hs.19977 


ESTs, i^erately similar to ALU8 J^UMAN A 


1.6 


? 


106094 AA533491 


Hs.23317 


hypothetteal protein FLJ14681 


6.9 


other 


106140 AB006624 


Hs.14912 


KIAA0286 protein 


1.6 


other 


106271 AA251393 


HSJ289052 


Homo sapiens. Similar to RIKEN cDNA 5430 


10.8 


? 
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106288 AB037742 Hs24336 KIAA1321 protein 

106300 Y10043 Hs.19114 high-mobility group (nonhistone chiomoso 

106333 AL043114 Hs.22410 ESTs, Weakly sirrular to A64B49 collagen 

105360 AK001404 Hs.194698 cyclinB2 

5 106359 AW390282 Hs.31130 transmembrane 7 superfamily member 2 

106381 AB04O916 Hs.24106 KIAA1 483 protein 

106389 AW748420 Hs.6236 Homo sapiens cDNA:FLJ214B7fis, done C 

106457 AF119256 Hs.27801 zinc finger protein 278 

106470 D63078 Hs.186180 Homo sapiens cDNA:FU23038fis. done L 

10 106586 AA243837 Hs.57787 ESTs 

106589 AK000933 Hs.28661 Homo sapiens cDNA FU10071 fis, done HE 

106610 AA458882 Hs.79732 ftbulin 1 

106624 NI^003595 Hs.26350 tyrosylprotein sulfotransferase 2 

106650 AL049951 Hs22370 Homo sapiens mRNA; cDNA DKFZp564O0122 (f 

15 106669 AV657117 Hs.184164 ESTs. Moderately similar to S65657 alpha 

106713 BE614802 Hs.184352 hypothetical protein FLJ12549 

106717 AA600357 Hs.239489 TIA1 cytotoxic granule-^sodatedRNA-bi 

106723 BE388094 Hs.21857 ESTs 

106795 AF1 74487 Hs.293753 Bcl-2-rBlaled ovarian killer protein-ltk 

20 106829 AW959893 Hs.27099 hypothetical protein FU23293 similar to 

106831 BE564871 Hs.29463 centrin, EF-hand protein, 3 (CDC31 yeast 

106846 AB037744 Hs.34892 KIAA1 323 protein 

106852 AF151031 Hs300631 hypothetical protein 

106873 N49809 Hs.11197 Homo sapiens, clone fMAGE:3343149. mRNA, 

25 106886 W79171 Hs.9567 GL002 protein 

106906 AA861271 Hs.222024 transcription factor BMA12 

106920 AK001838 Hs296323 serum/glucocorticoid regulated kinase 

106945 AK00051 1 Hs.6294 hypothetical protein DKFZp434L1435 simil 

106973 BE156256 Hs.11923 hypothetical protein 

30 106978 AW631460 Hs^688 ESTs 

107004 AA146872 Hs.300700 hypothetical proteffi FLJ20727 

107029 AF264750 Hs^8971 myeloid/lymphoid or mixed-lineage leukem 

107071 AW385224 Hs^5198 ectonucleotidepyrophosphatase/phosphodl 

107113 'AK000733 Hs.23900 ' GTPase activating protein 

35 107125 AK000512 Hs.69388 hypothetical protein R.J20505 

107136 AV661958 Hs.8207 GK001 protein 

107146 AK001455 Hs.5198 Down syndrome critical region gene 2 

107151 AW378065 Hs.8687 ESTs 

107155 AW391927 Hs7946 KIAA1 288 protein 

40 107174 BE122762 Hs.25338 ESTs 

107197 W15477 Hs.64539 glioma pathogenesis-related protein 

107221 AW888411 Hs.81915 leukemia-associated phosphoprotelnpl 8 ( 

107243 BE219716 Hs^4727 ESTs. l\/loderately similar to 138759 zinc 

107248 AW263124 Hs.315111 nudear receptor co>repressor/HDAC3 comp 

45 107263 D60341 Hs21198 transk>case of outer mitochondrial membr 

107265 BE379594 Hs.49136 ESTs.Moderatelyslmilarto ALU7J1UMAN A 

107299 BE277457 HsJ0661 hypothelteal protein MGC4606 

107316 T63174 Hs.193700 Homo sapiens mRNA; cDNA DKFZp686l0324 (f 

107354 NM_006299 Hs.96448 zinc finger protein 193 

50 107392 AW299900 Hs.267632 TATA element moduJatoo' factor 1 

107481 AA307703 Hs^79766 kinesin family member 4A 

107529 BE515065 Hs.296585 nucleolar protein (KKE/D repea? 

107554 AA001386 Hs.59844 ESTs 

107681 BE379594 Hs.49136 ESTs, Moderately similar to ALU7_HUMAN A 

55 107772 AA018587 Hs.303055 ESTs, Weakly similar to ALUI^HUMAN ALUS 

107859 AW732573 Hs.47584 potassium voKage^ated channel, delayed 

107901 L42612 Hs.335952 keratin 6B 

107922 BE1 53855 Hs.61460 Ig superfamily receptor LNIR 

107974 AW956103 Hs.61712 pynjvale dehydrogenase kinase, isoenzyme 

60 108040 AL1 21031 Hs.1S9971 SWI/SNF related, matrfx associated, act! 

108230 AA054224 Hs.59847 ESTs 

108274 AF129535 Hs.272027 F-box only protein 5 

108296 N31256 Hs.161623 ESTs 

108496 AA083069 Hs.339659 ESTs 

65 108607 BE300380 Hs.69476 Homo sapiens cDNAFU12758fls, done NT 

108621 AA101809 Hs.182685 ESTs 

108634 AW022410 Hs.69507 ESTs 

155 



1.3 


other 


3.7 


other 


5.5 


SS. 


5.8 


other 


6.4 


other 


6.6 


other 


2,2 


TM 


2.7 


other 


2.3 


other 


1.6 


other 


2.4 


? 


8 


SS, 


7.8 


other 


1.8 


other 


1.3 


TM 


4.6 


other 


1.3 


other 


1.6 


SS. 


5.7 


other 


16.2 


TM 


1.5 


other 


2.2 


other 


1.3 


other 


16.8 


other 


1.5 


TM 


22 


other 


3.4 


other 


6.8 


? 


6.7 


other 


6.1 


SS, 


1.3 


other 


1.8 


other 


17 


other 


2.5 


other 


17 


other 


47 


other 


2 


other 


6.4 


TM 


33.5 


other 


5.2 


? 


6.1 


other 


17.4 


other 


7.4 


? 


1.8 


other 


67 


other 


2.5 


other 


3.2 


TM 


2 


TM 


5 


? 


1.2 


other 


1.6 


other 


3 


TM 


1.4 


other 


2.3 


SS,TM 


2.2 


? 


8.4 


TM 


2.5 


other 


Z3 


other 


6.8 


other 


1.6 


other 


1.3 


other 


7.2 


? 


2.6 


other 


3.6 


other 


3.5 


other 


17 


other 


1.8 


SS.TM 



wo 02/059377 



PCT/US02/02242 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



108647 BE546947 


Hs.44276 


homeoboxCIO 


9.8 


other 


108695 AB02g000 


Hs.70823 


IOAA1077protefn 


7.3 


other 


108740 AI089575 


Hs.9071 


progesteione membrane bincKng protein 


2.8 


? 


108828 AK001693 


HS273344 


DKFZP564O0463 protein 


1.9 


other 


108859 AL121500 


Hs.178904 


ESTs 


1.6 


TM 


108872 H06720 


Hs.111680 


endosulfine alpha 


2.2 


other 


108891 AI801235 


Hs.48480 


ESTs 


5.4 


other 


108894 AK001431 


Hs.5105 


hypothetfcal protein FU10569 


4.1 


TM 


108955 AA149754 


Hs.195155 


Homo sapiens amino add transport system 


5.7 


? 


108982 AA151708 


Hs.171980 


homeo box (expressed in ES cells) 1 


1.7 


other 


108987 AA152178 


Hs.23467 


hypothetical protein FLJ10633 


6.3 


other 


109002 AB028987 


Hs.72134 


KIAA1054 protein 


1.7 


other 


109011 AA156542 


Hs.72127 


ESTs 


1.5 


other 


109026 AA157811 




gb2o35d07.s1 Stratagene colon (937204) 


5.4 


other 


109068 AA1 64293 


Hs.72545 


ESTs 


3 


other 


109101 AW608930 


Hs.52184 


hypothetical protein FU20618 


1.6 


ss. 


109112 AW419196 


HS37924 


hypothetical protein FLJ13782 


3.3 


TIW 


109124 AK0a0664 


Hs.183887 


hypothetical protein FU22104 


1.7 


other 


109139 AJ132592 


Hs.59757 


zinc finger protein 281 


2.7 


other 


109166 AA219691 


Hs.73625 


RAB6 interacting, kinesin-like (rabkines 


3 . 


TM 


109198 BE566742 


Hs.58169 


highly expressed in cancer, rich In leuc 


2.1 


other 


109213 NI\L016603 


Hs.82035 


potential nuclear protein C50RF5; GAP-fi 


5.4 


other 


109220 AW958181 


Hs.189998 


ESTs 


5,8 


other 


109233 AU077281 


Hs.170285 


nudeopDrin214kD (CAIN) 


5.3 


other 


109270 N99673 


Hs.3585 


ESTs, Weal^ly similar to AF126743 1 DNAJ 


1.4 


other 


109273 AA375752 


Hs.82719 


Homo sapiens mRNA; cDNA DKFZp586F1822 (f 


3 


other 


109313 AF153201 


Hs.86276 


C2H2 (Knjppel-type) zinc finger protein 


1.3 


other 


109341 AA213506 


Hs. 115099 


EST 


3 


7 


109391 AL096858 


Hs.184245 


KIAA0929 protein Msx2 interacting nudea 


1.5 


other 


109420 H83603 


Hs.40408 


homeoboxC9 


2.2 


SS, 


109426 N30531 


Hs.42215 


protein phosphatase 1, regulatory subuni 


3.1 


TM 


109429 AI160029 


Hs.61438 


ESTs 


2 


? 


109445 AA232103 


Hs.189915 


ESTs 


1.8 


other 


109450 AB032969 


Hs.173042 


KIAA1143 protein 


3.8 


other 


109468 NM_01531D 


Hs.6763 


KiAA0942 protein 


3.3 


other 


109478 AW074143 


Hs.87134 


ESTs 


2 


TM 


109570 L40027 


Hs.1 18890 


glycogen synthase kinase 3 alpha 


2.1 


other 


109662 F02614 


Hs.27319 


ESTs 


1.4 


other 


109825 R71264 


Hs.16798 


ESTs 


1.3 


other 


110039 HI 1938 


Hs.21907 


histone acetyltransferase 


2 


other 


110056 AA503041 


Hs.279009 


matrix Gla protein 


2,5 


other 


110085 AA603840 


Hs.29956 


K1AA0460 protein 


1.7 


other 


110110 T07353 


Hs.7948 


ESTs 


2.9 


other 


110129 R51853 


Hs.226429 


ESTs, Weakly similar to ALU1J^UMAN ALU S 


1.7 


SS, 


110154 NM-014521 


Hs.17667 


SH3-domain binding protein 4 


4.3 


other 


110240 AI668594 


Hs.176588 


ESTs, Weakly simiar to CP4Y,HUMAN CYTOC 


4,3 


? 


110242 N41744 


HS.1997B 


CGI-30 protein 


1.3 


other 


110259 H28428 


Hs^2406 


ESTs, Weakly similar to 138022 hypotheti 


2.2 


other 


110312 BE256986 


Hs.11896 


hypothetical protein FLJ12089 


2.1 


other 


110501 H55748 




gb:yq94a01.s1 Soanes fetal liver spleen 


6,1 


? 


110504 H55915 


Hs,210859 


hypothetical protein FLJl 1 016 


6.1 


TM 


110525 H57330 


HS37430 


EST 


6.4 


ottier 


110568 AK001160 


Hs.5999 


hypoU)etical protein FLJ10298 


1,3 


? 


110699 T97586 


Hs.18090 


ESTs 


1.8 


other 


110705 AB007902 


Hs.32168 


KIAA0442 protein 


1.6 


TM 


110742 AW190338 


Hs.28029 


hypothetbal protein MGC11256 


7.8 


other 


110761 AL138077 


Hs.16157 


hypothetical protein FLU 2707 


2.5 


other 


110762 BE044245 


Hs.30011 


hypothetical protein MGC2963 


9.3 


? 


110765 AK000322 


Hs.18457 


hypothetical protein FU2031 5 


'5.5 


SS, 


110769 BE000831 


Hs.23837 


Homo saji^s cDNA FU1 1812 fis. done HE 


2.1 


TM 


110799 AI089660 


Hs.323401 


dpy-30-like protein 


1^ 


TM 


110805 T25829 


Hs.24048 


FK506 binding protein precursor 


6.7 


TM 


110813 AA767373 


Hs.35669 


ESTs. IVloderately similar to ALU1J1UMAN A 


5.7 


other 


110620 R33261 


Hs.6614 


ESTs, Weakly similar to A43932 mucin 2 p 


3.4 


other 


110840 N31598 


Hs.12727 


hypothetical protein FU21610 


1.7 


TM 


110844 AI740792 


Hs.167531 


methyicrotonoyi-Coenzyme A carboxylase 2 


1.7 


other 


110654 BE612992 


Hs.27931 


hypothetical protein FLJ10607 similar to 


4.7 


other 
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110856 


AA9g2380 




gb:ot37g06.s1 Soares_teslis_NHT Homo sap 


2.3 


other 


110885 


BE384447 


Hs.16034 


hypothetical protein MGC13186 


3.5 


? 


110897 


AL1 17430 


Hs.6880 


DKFZP434D156 protein 


2.2 


? 


110915 


BE092285 


Hs.29724 


hypothetical protein FLJ13187 


2.6 


SS, 


110918 


H04360 


Hs.24283 


ESTs, Immoderately similar to reduced expr 


1.9 


TM 


110958 


N1VL005864 


Hs.24587 


signal transduction protein (Si^ contain 


6.7 


other 


110963 


AKD02180 


Hs.11449 


DKF2P5640123 protein 


2 


other 


110981 


AK001980 


Hs.24284 


ADP-ribosyltransferase (NAD+; poly(ADP-r 


1.3 


other 


110984 


AW613287 


Hs.80120 


UDP-N-acetyI-alpha-D-galactDsamine:polyp 


1.8 


? 


111125 


N63823 


Hs.269115 


ESTs, Moderately similar to Z195_HUMAN Z 


3.7 


other 


111132 


AB037807 


Hs.83293 


hypothetical protein 


2.1 


TM 


111164 


N46180 


Hs.122489 


Homo sapiens cDNA FU 13289 fis, done OV 


2.3 


other 


111172 


R67419 


Hs.21851 


Homo sapiens cDNA FLJ12900 fis, done HI 


3.7 


other 


111174 


AL050166 


Hs.26295 


Homo sapiens mRNA; cDNA DKFZp586D1 122 (f 


7.5 


other 


111179 


AKD00136 


Hs.10760 


asporin (LRR dass 1) 


7.1 


other 


111184 


AI815486 


Hs^43901 


Homo sapiens cDNA FLJ20738 fis, done HE 


6.8 


other 


111189 


N67603 


Hs^72130 


ESTs, Weakly similar to S65824 reverse t 


3.6 


SS. 


111216 


AW13g408 


Hs.152940 


ESTs 


1.5 


other 


111221 


AB037782 


•Hs.15119 


K1AA1361 protein 


2.6 


other 


111223 


AA852773 


Hs.334838 


KIAA1866 protein 


47 


other 


111239 


N90956 


Hs.17230 


hypothetical protein FU22087 


7.9 


? 


111285 


AA778711 


Hs.4310 


eukaryotic translation initiation factor 


7 


other 


111299 


AB033091 


Hs.74313 


KIAA1265 protein 


5 


other 


111312 


AI523913 


Hs.34504 


ESTs 


3.8 


other 


111318 


T99755 


Hs.334728 


ESTs 


1.2 


TM 


111337 


AA837396 


Hs.263925 


LISMnteracting protein NUDEl, rat homo 


5.1 


other 


111352 


H58589 


Hs.35156 


Homo sapiens cDInIA FLJ11027 tis, done PL 


2.2 


other 


111370 


AI478658 


Hs.94631 


brefekitn A-inhibited guanine nudeotkle 


2.8 


? 


111384 


N94606 


Hs.288969 


HSCARG protein 


2.2 


other 


111389 


AK000987 


Hs.169111 


oxkiation resistance 1 


^1 


other 


111452 


R02354 


Hs.15999 


ESTs 


27 


m 


111486 


A1051194 


Hs.227978 


EST 


6.6 


other 


111549 


Wg0638 


Hs^0321 


ESTs, IVloderately similar to ZRF1_HUMAN Z 


,'1.4 


other 


111585 


R10720 


HS20670 


EST 


1.6 


? 


111627 


R52656 


Hs.21691 


ESTs 


16 


other 


111870 


AB037834 


Hs.18685 


Homo sapiens mRNA for KIAA1413 protein, 


2.4 


other 


111937 


BE29B665 


HS.U846 


Homo sapiens mRNA; cDNA DKFZp564D016 (fr 


10.6 


other 


111944 


AW083791 


Hs.21263 


suppressor of potassium transport defect 


6.6 


TM 


111987 


NM.015310 


Hs.6763 


K1AAQ942 protein 


5.1 


other 


112134 


R41823 


HS7413 


ESTs; calsyntenin-2 


2.8 


other 


112244 


AB029000 


HS70823 


K1AA1077 protein 


14.6 


other 


112388 


R46071 


Hs.301693 


Homo sapiens, ctone IMAGE:3638994, mRNA, 


9 


other 


112456 


NIVL016248 


Hs.232076 


A kinase (PRKA) anchor protein 11 


1.4 


other 


112464 


AW007287 


Hs.28538 


Homo sapiens cDNA: FU21086 fis, done C 


1.4 


TM 


112506 


AI742756 


Hs.26079 


ESTs 


3.2 


other 


112513 


R68425 


Hs.13809 


hypothetical protein FU10648 


2 


TM 


112752 


AK001635 


Hs.14838 


hyjxjthefical protein FU10773 


1.8 


other 


112884 


AK000004 


Hs.5013 


Homo sapiens mRNA for FU00004 protein, 


6.6 


other 


112923 


T10258 


Hs.5037 


EST 


1.5 


? 


112936 


AW970826 


Hs.6185 


KIAA1557 protein 


3.2 


other 


112958 


R61388 


Hs.6724 


ESTs 


6.1 


other 


112966 


Z44718 


Hs.102548 


glucocorticoid receptor DNA bindkig fact 


6.5 


other 


112978 


AK000272 


Hs.7099 


hypothetical protein FLJ20265 


1.2 


other 


112995 


AA737033 


HS7155 


ESTs. Moderayy similar to 211 5357ATYK 


5.6 


other 


112996 


BE276112 


Hs7165 


zinc finger protein 259 


2 


other 


113047 


A)571940 


Hs7549 


ESTs 


1.9 


other 


113049 


AW965190 


HS7560 


Homo sapiens mRNA forKIAA1729 protein. 


2.4 


TM 


113089 


T40707 


Hs.270862 


ESTs 


1.3 


SS, 


113196 


T57317 




gb:yb51a03.s1 Stratagene fetal spleen (9 


17 


other 


113248 


T63857 




9b:yc16e01^1 Stratagene lung (937210) H 


2.8 


other 


113254 


AK002180 


H8.11449 


DKFZP5640123 protein 


1.3 


other 


113277 


AW971049 


H8.11774 


proten (peptUyl-prolyl cis/trans Isome 


3.2 


other 


113429 


AA688021 


Hs.179808 


ESTs 


1.2 


other 


113499 


AI467906 


HS.8B82 


ESTs 


6 


other 


113547 


H59588 


Hs.15233 


ESTs 


2 


SS, 


113647 


AA813887 


Hs.188173 


Homo sapiens cDNA FU12187 fis, clone MA 


1.3 


SS, 


113702 


T97307 




9b:ye53h05.s1 Soares fetal liver spleen 


4.4 


other 
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113759 


AW4996O0 


Hs.9456 


SWI/SNF related, matrix associatea, acu 




other 


113777 


BE266947 


II A Am A 

Hs.10590 


zinc finger protein 313 


13.4 


other 


113783 


AL359500 


H 5,7041 


nypotnetlca! protein DKPZp/o^KczD 




other 


113791 


Ai269096 


Hs.1 35578 


chitobiase, di-N-acetyl- 


1.3 


other 


4 A On All 

113808 


W44735 


1 1. AAA A 

Hs.9286 


Homo sapiens cDNA: FU2127o IB, done C 


3.3 


other 


113811 


BE207480 


Hs,6994 


Homo sapiens cONA: FU22044 fis, clone H 


3.1 


other 


113817 


H13325 


Hs.332795 


hypothetical protein DKFZp761 017121 


3.2 


other 


113826 


AW378212 


Hs.24809 


hypothetical protein FLJ10826 


2.3 


? 


113834 


T26483 


Hs.6059 


EGF-conlaining fibulin-like extraoellula 


1U 


TlVl 


113868 


W57902 


Hs.90744 


proteasome (prosome, macropain) 26S subu 


2.7 


other 


113870 


AL079314 


Hs.16537 


hypothetical protein, similar to (U06844 


6.1 


other 


113885 


AW959486 


Hs.21732 


ESTs 


6.6 


other 


113923 


AW953484 


Hs.3849 


hypothetical protein FU22041 similar to 


1.9 


? 


113989 


W87544 


Hs.268828 


ESTs 


\2 


other 


114022 


A1539519 


Hs.120969 


Homo sapiens cDNA FU1 1562 fis. done HE 


5.4 


other 


114030 


A1825386 


Hs.164478 


hypothetical protein FIJ21939 similar to 


9.4 


other 


114060 


AB029551 


Hs.7910 


RINGl and YY1 binding protein 


1.8 


other 


114196 


Al=017445 


Hs.150926 


fucose-l-phosphate guanylyltransferase 


1.5 


other 


114226 


AB028968 


Hs.7989 


KIAA1045 protein 


1.8 


other 


114253 


BE149866 


Hs.14831 


Homo sapiens, Similar to zinc finger pro 


2.3 


other 


114262 


AL1 17518 


Hs.3686 


K1AA0978 protein 


1.4 


m 


114275 


AW515443.comp 


Hs.306117 


K1AA0306 protein 15.6 


114292 


AI815395 


Hs.184641 


fatty acid desaturase 2 


1.9 


TM 


114309 


AA332453 


Hs.20824 


CGI-85 protein 


2.4 


other 


114392 


AA249590 


Hs.100748 


ESTs, Weakly similar to A28996 proline-r 


1.9 


other 


114407 


BE539976 


HS.10330S 


Homo sapiens mRNA; cDNA DKFZp434B0425 (f 




TM 


114455 


H37g08 


Hs.271616 


ESTs, Weakly similar to ALU8 JiUMAN ALU S 


5.6 


other 


114463 


AL120247 


Hs.40109 


KIAA0872 protein 


5.3 


TM 


114464 


AI091713 


Hs.106597 


Homo sapiens, Similar to RIKEN cDNA 11 10 


1.3 


other 


114471 


AA028074 


Hs.104613 


RP42 homobg 


1.9 


? 


114480 


BE066778 


Hs.1 51 678 


UD P-N-acetyl-alpha-D-galadosamineipolyp 


13.4 


other 


114671 


AA7662BB 


Hs.266273 


hypothetical protein FLJ13346 


2 


other 


114698 


AA476966 


Hs.1 10857 


polymerase (RNA) III (ONA directed) poly 


3.6 


other 


114730 


AI373544 


Hs.331328 


intennediate filament protein synooilin 


3.9 


other 


114767 


Ai85g865 


Hs.1 54443 


minichromosome maintenance deficient (S. 


1.7 


other 


114774 


AV656017 


Hs.184325 


CGI-76 protein 


3,2 


other 


114798 


AA159181 


Hs.54900 


serologically defined colon cancer antig 


3.6 


other 


114860 


AL1 57545 


Hs.42179 


bromodomain and PHD finger containing, 3 


4.4 


other 


114895 


AA236177 


Hs.76591 


K1AA0887 protein 


7.2 


other 


114896 


BE539101 


Hs.5324 


hypothetical protein 


1.3 


other 


114911 


AA236672 




gb:zt291023l Scares ovary tumor NbHOT H 


1.5 


other 


114930 


AA237022 


Hs.188717 


ESTs 


2 


ss. 


114938 


AA242834 


Hs.58384 


ESTs 


2.9 


other 


114965 


A1733881 


Hs.72472 


BNP-R1B 


2.3 


? 


115023 


AF102546 


Hs.63931 


dachshund (Drosophlla) homotog 


1.3 


other 


115038 


AA252360 


Hs.87968 


toll-like receptor 9 


1.6 


other 


115061 


AI751438 


Hs.41271 


Homo sapiens mRNA full length insert cDN 


11.8 


other 


115117 


AI670847 


Hs.5324 


hypothetical protein 


1.5 


other 


115206 


AW1 83695 


Hs.1 86572 


ESTs 


2.5 


other 


115221 


AW365434 


Hs.79741 


hypothetical protein FLJ10116 


1.5 


other 


115239 


BE251328 


Hs.73291 


hypothetical protein FLJ10881 


1.3 


TM 


115242 


Aj 368236 


Hs.283732 


tSTs, Moderately similar to ALUi JIUWIAN a 


1.4 


other 


115278 


AK002163 


Hs.301724 


hypothetical protein FU11301 


1.5 


other 


115285 


AW972872 


Hs.293736 


ESTs 


2.4 


other 


115291 


BE545072 


Hs.122579 


hypothetical protein FIJ10461 


6.3 


ss, 


115400 


At215069 


H5.89113 


ESTs 


6.7 


? 


115468 


AA314349 


Hs.46499 


tumor antigen SLP-8p 


7.5 


7 


115471 


AK001376 


Hs.59346 


hypothetical protein FU10514 


1.4 


TM 


115479 


AW301606 


Hs.278188 


ESTs, Moderately slm'tar to 154374 gerte 


4.1 


TM 


115496 


AW247593 


Hs.71819 


eukaryotic translatbn initiation factor 


16.3 


other 


115500 


Y14443 


Hs.88219 


zinc finger protein 200 


5 


other 


115553 


AJ275986 


Hs.71414 


transcripbon factor (SMIF gene) 


2.5 


other 


115581 


AI540842 


Hs.61082 


ESTs 


6.2 


other 


115587 


BE081342 


Hs.283037 


HSPC039 protein 


2.9 


other 


115590 


AA399477 


Hs.67896 


7-60 protein 


5.3 


TM 


115646 


N36110 


Hs.305971 


solute carrier family 2 (^cflitated glu 


4.8 


? 


115652 


BB)93589 


Hs.38178 


hypothetical protein FU23468 


10.6 


other 



other 



158 



wo 02/059377 PCT/US02/02242 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



116655 AL048269 


Hs.288544 


Homo sapiens, done MGC:16063, niRNA, com 


12.7 


TM 


115663 AI138785 


Hs.40507 


ESTs 


2 


other 


115676 AA953006 


Hs.88143 


ESTs 


3.1 


other 


115690 AA625132 


Hs.44159 


hypothetica) protein FIJ21615 


1.7 


TM 


115693 AF231023 


Hs,55173 


cadherin, EGF LAG seven-pass G-type tece 


65 


. other 


115715 BE395161 


Hs.1390 


proteasome (piosome, macropain) subunit, 


1.7 


other 


115734 A1950339 


Hs.40782 


ESTs 


2.7 


TM 


115811 NM-015434 


Hs.48604 


DKFZP434B168 protein 


2.1 


other 


115823 AI732742 


Hs.87440 


ESTs 


2.1 


other 


115837 A)675217 


Hs.42761 


ESTs 


1.3 


other 


115844 A1373062 


Hs.332938 


hypothetical protein MGC5370 


4.4 


other 


115866 AW062629 


Hs.52081 


KiAA0867 protein 


7^ 


other 


115675 N55669 


H5.333823 


mitochondrial ntiosoma) protein L13 


1.2 


other 


115941 Ai867451 


Hs.46679 


hypothetical protein FLJ2D739 


5.5 


other 


115968 AB037753 


Hs.62767 


K1AA1332 protein 


9.8 


other 


116003 BE275469 


Hs.66493 


Down syndrome critical region gene 5 


1.4 


other 


116011 AL359053 


Hs.57664 


Homo sapiens mRNA full length insert cDN 


2.4 


other 


116108 AA770688 


Hs.28777 


H2A histone family, member L 


1.8 


other 


116134 BE243834 


Hs.50441 


CGl-04 protein 


1.4 


other 


116189 N35719 


Hs.44749 


ESTs, Moderately similar to T00358 hypot 


1.2 


other 


116195 AW821113 


Hs.72402 


ESTs 


2.1 


other 


116238 A\/660717 


Hs.47144 


DKFZP586N0819 protein 


1.7 


other 


116246 AF265555 


Hs.250646 


baculovlnal lAP repeatcontatnlng 6 


1.7 


other 


116262 AI936442 


Hs.59838 


hypothetical protein FU 10808 


1.8 


? 


116298 AI955411 


Hs.94109 


Homo sapiens cDNA FU13634 fis, done PL 


1.9 


other 


116318 AF097645 


Hs.58570 


deleted in canoer 1; RNAheiicase HDB/DI 


5 


SS, 


116325 AI472106 


Hs.49303 


Homo sapiens cDNA FLJ11663 ffs, done HE 


14 


ss, 


116336 AL133033 


Hs.4084 


KIAA1025 protein 


1.9 


? 


116339 AK000290 


Hs.44033 


dipeptidyl peptidase 8 


15 


other 


116350 AA497129 


Hs.184771 


nuctear factor l/C (CCAAT-blndihg transc 


1.9 


? 


116358 A)14g586 


Hs.38125 


interferon-lnduced protein 75, 52kD 


1.9 


? 


116365 N50174 


Hs.46765 


ESTs 


6.1 


other 


116368 N9a466 


Hs.71109 


KIAA1229 protein 


1.6 


? 


116417 AW499664 


Hs.12484 


Human done 23826 mRNA sequence 


7.4 


other 


116436 AA161411 


Hs.58668 


chromosome 21 open reading frame 57 


2.1 


other 


116462 AF218313 


^te.236828 


putative helicase RUVBL 


1.5 


TM 


116470 Ai272141 


Hs.83484 


SRY (sex detemninlng region Y)-box 4 


2.1 


TM 


116575 AA312572 


Hs.6241 


phosphoinositfde-3-l(lnase. regulatory su 


t.5 


other 


116637 AK001043 


Hs.92033 


Integrm-linked kinase-assodated serine 


2.7 


other 


116640 X89984 


Hs.211563 


B-cellCLL/lymphoma7A 


2.3 


other 


116700 AI800202 


Hs.317589 


hypothetical protein MGC10765 


1.4 


other 


116705 AW074819 


Hs.12313 


hypothetical protein FU14568 


3.4 


other 


116732 AW15222S 


Hs.165909 


ESTs, Weakly simitar to 138022 hypotfieti 


2.9 


other 


116926 H73608 


Hs.290830 


ESTs 


1.7 


TM 


117034 U72209 


Hs.1 80324 


YYI-assoclated factor 2 


3.4 


TM 


117132 AI393666 


Hs.42315 


plO-bindIng protein 


5.2 


? 


117247 N21032 




gb:yx46f06^1 Soares melanocyte 2NbHMHo 


5.5 


TM 


117276 N71183 


Hs.121806 


Homo sapiens cDNA FLJ1 1971 fis, done HE 


1.5 


TM 


117284 AK001701 


Hs.183779 


Homo sapiens cDNA FLJ10590 fis, done NT 


2 


other 


117367 A1041793 


Hs.42502 


ESTs 


2 


other 


117368 AI878942 


Hs.90336 


ATPase, H+ transporting, lysosomal (vacu 


2.1 


? 


117382 AF1 50275 


rls.401 id 


coTS 


9 7 


T\k 

\ vA 


117412 N32536 


Hs.42645 


solute carrier family 16 (monocartxixyllc 


1.4 


other 


117557 AF123050 


Hs.44532 


diubiqultin 


3.4 


TM 


117588 N34895 


Hs.44648 


ESTs 


3.4 


? 


117745 BE294925 


Hs.46680 


. CGl-12 protein 


3 


SS. 


117754 AA121673 


Hs.59757 


zinc finger protein 281 


1.9 


other 


117879 N54706 


Hs.303025 


chromosome 11 open reading frame 24 


1.8 


other 


117904 BE540675 


Hs.332938 


hypothetical protein MGC5370 


6 


? 


117911 AL137379 


Hs.47125 


hypothetical protein FU13912 


1.7 


other 


117933 Y10518 


Hs.116470 


hypothetical protein FLJ20048 


1.7 


other 


117983 ALl 10246 


Hs.47367 


KIAA1785 protein 


5.4 


other 


118078 N54321 


Hs.47790 


EST 


5.2 


other 


118301 AA453902 


Hs.293264 


ESTs 


2.6 


other 


118429 AA243332 


Hs.74649 


cytochrome c oxidase subunit Vic 


2.5 


TM 


118472 AL157545 


Hs.42179 


bromodomain and PHD finger containing* 3 


4.1 


other 


118488 AJ277275 


Hs.50102 


rapa-2(iapagene) 


1.2 


other 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



118509 
118528 
118656 
118670 
118698 
118737 
118925 
118984 
118986 
119206 
119235 
119265 
119279 
119298 
119338 
119403 
119478 
119486 
119513 
119601 
119602 
119676 
119682 
119774 
119780 
119789 
119805 
119818 
119863 
119905 
119986 
120132 
120206 
120248 
120269 
120274 
120280 
120296 
120297 
120324 
120325 
120327 
120336 
120342 
120345 
120349 
120352 
120356 
120371 
120382 
120383 
120386 
120388 
120389 
120396 
120404 
120418 
120423 
120472 
120473 
120484 
120504 
120509 
120520 
120535 
120551 
120570 



N22617 

AI949952 

A1458020 

AA332845 

AB033113 

AA199686 

N92293 

AI668709 

AF1 48713 

W24781 

AW453069 

BE539706 

N57568 

NM,001241 

AI417240 

AL1 17554 

AI624342 

A1796730 

W37933 

AK000155 

AW675298 

AA243837 

W61019 

AB032977 

N^L016625 

BE393948 

AJ223810 

AA130970 

AA081218 

AW449064 

AA703129 

W57554 

H26735 

AI924294 

AW131940 

AA177051 

AA190577 

AW995911 

AA191384 

AA195517 

AA195651 

AK000292 

N85785 

AW450669 

AA210722 

AW969481 



AF000545 

AA219305 

AA228026 

AL1 09963 

AW969665 

AA232874 

AW967985 

AA134006 

AB023230 

AW966B93 

AA236453 

AI950087 

AA251973 

AA253170 

AA256837 

BE04771B 

AA258601 

BE350244 

AA279160 

AA280679 



Hs.43228 

Hs.49397 
Hs.293287 
H5.152618 
Hs.50187 

Hs.206832 

Hs.240722 

Hs.125830 

Hs.293798 

Hs3657 

Hs.285363 

Hs.48028 

Hs.155478 

Hs.320836 

Hs.1 19908 

Hs.170042 

Hs.55513 

Hs^1664 

Hs.233694 

Hs.57787 

Hs.57811 

H5.6298 

Hs.191381 

Hs.50915 

Hs.43213 

Hs.58382 

Hs.58608 

Hs.1 19571 

Hs.58963 

HS.12S019 

Hs.91668 

Hs.173259 

Hs.104030 



Hs.299883 

Hs.104072 

Hs.191643 

Hs.104105 

Hs.278732 

Hs.181165 

Hs.45068 

Hs.104158 

Hs.55169 

Hs.193172 

Hs.296433 

Hs.104196 

Hs.38774 

Hs.123122 

HS.154B48 

Hs.104245 

Hs.325572 

Hs.79306 

Hs.96427 

Hs.26613 

Hs.18978 

Hs269988 
Hs.96473 

Hs.96545 

Hs.161731 

Hs.96547 

Hs.111407 

Hs.271445 



Homo sapiens cDNA FU1 1835 fis, done HE 

ESTs 

ESTs 

ESTs, Moderately similar to ZN9LHUMAN Z 
K1AA1287 protein 

gb.2q75g09.r1 Stratagene hNT neuron (937 
ESTs, Moderately simitar to ALU8 JiUMAN A 
ESTs, Moderately slm'tiar to ALU8_HUMAN A 
t)ladder cancer overexpressed protein 
KIAA1710 protein 

activity-dependent neuroprotective prote 

ESTs 

EST 

cyclln T2 

ESTs, Weakly sfirdar to A47582 B-oeH gr 
nucleolar protein NOP5/NOP58 
ESTs 
ESTs 

Empirically selected from AFFX single pr 

Homo sapiens mRNA; cDNA DKFZp667l103 (fr 

hypothetical protein FU1 1350 

ESTs 

ESTs 

K1AA1151 protein 
hypothetical protein 

kallikrein 5 (KLK5; KLK-L2; stratum com 

ESTs, Weakly similar to lEFS^HUMAN TRANS 

hypothetical protein FU1 1 101 

Homo sapiens cDNA FU14206 fis, done NT 

collagen, type Hi, alpha 1 (Ehlers-DanI 

ESTs 

lymphoid nudear protein (LAF-4) mRNA 
Homo sapiens done PP1498 unknown mRNA 
uncharacterlzed hone marrow protein BM03 
ESTs 

gb:hc02a02.s1 NCLCGAP_Pr3 Homo sapiens 

gb:zp52g02.s1 Stratagene HeLacetls393 

hypothetical protein FLJ23399 

ESTs, Weakly similar to Z195J^UMAN ZINC 

ESTs 

ESTs 

hypothetical protein FU20285 
eukaryotic translation elongation factor 
hypothetical protein DKFZp434i143 
ESTs 

hypothetical protein 

ESTs, Weak^ slmflar to 138022 hypotheO 
putative purviergic receptor 

EST 
ESTs 

FSH primary response (LRPR1, rat) homota 
hypothetical protein DKFZp434D0127 

ESTs 

ESTs, Moderately similar to ALU7_HUMAN A 
eukaryotic translation Initiation factor 
KIAA1013 protein 

Homo sapiens mRNA; cDNA DKFZp586F1323 (f 
Homo sapiens cDNA: FIJ22822 fis, done K 
gbnwq05c02jc1 NCLC6AP_Kid12Homosapien 
ESTs 
EST 

gb:zr84d10.s1 Soares^NhHMPu^SI Homosapi 

ESTs 

EST 

ESTs 

Homo sapiens, done iMAGE:3613029, mRNA, 
ESTs, Weakly similar to ALUIJiUMAN ALU S 
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1.5 


other 


7.4 


? 


2.5 


other 


1.2 


TM 


2.1 


TM 


5.2 


other 


1.4 


other 


3.6 


other 


4.9 


? 


1,7 


TM 


2.2 


other 


1.4 


? 


25.1 


other 


1.6 


? 


1-3 


other 


6.7 


TM 


2.4 


other 


2.1 


other 


1.9 


other 


3.7 


TM 


3 


other 


1.4 


other 


1.2 


? 


1.8 


TM 


3.1 


other 


9.2 


other 


3.6 


TM 


2.5 


? 


2.7 


TM 


2.6 


other 


27 


other 


1.2 


other 


45.7 


other 


12 


other 


9.6 


other 


4.7 


other 


2.1 


other 


1.9 


TM 


15.2 


other 


5.6 


? 


6.5 


other 


16.1 


other 


3 


other 


5.6 


other 


4.6 


SS,TM 


16.8 


other 


5.1 


other 


28,1 


TM 


12.4 


? 


4.1 


TM 


9.7 


TM 


32,6 


other 


3.2 


other 


217 


other 


12.5 


other 


7.3 


other 


11.4 


other 


1.9 


other 


19.4 


other 


5.5 


? 


10.4 


? 


4 


? 


9.4 


other 


2.4 


other 


2.5 


? 


5.3 


other 


14.4 


? 
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120582 BE244830 Hs.284228 ZNF135-Iike protein 

120590 AW372799 Hs. 125790 leudne-rich repeatoonlaining 2 

120596 kAimiA Hs.237323 N-acetylgluoosamine-phosphatemulase 

120619 AW965339 Hs.111471 ESTs 

120624 AW407987 Hs.173518 M-phase phosphoprotein homotog 

120639 AA286942 gb:2s56K)5^1 NCLCGAP_GCB1 Homo sapiens 

120848 AA287095 Hs.140309 Homo sapiens, done IMAGE:36771 94, mRNA, 

120653 AW063659 Hs.191649 ESTs 

120668 AW969638 Hs.1 12318 6.2 kd protein 

120669 BE536739 Hs.109909 ESTs 

120695 AA976503 gb:oq30a045l NCLCGAPJGC4 Homo sapiens 

120698 AI82153g Hs.g7249 ESTs 

120713 AW449855 Hs.96557 Homo sapiens cDNAFU12727fis, clone NT 

120718 AA292747 Hs.97298 ESTs 

120750 AI191410 Hs.96693 ESTs. Moderately similar to 21 09260A Be 

120774 A1608909 Hs.193985 ESTs 

120807 AA346385 Hs3mQ2 SH3-a)ntainlng protein SH3GLB2; KIAA1 848 

120809 AA346495 gb;EST52657 Fetal heart 11 Homo sapiens 

120938 AA385260 Hs.104632 EST 

120977 AA398155 Hs.97600 ESTs 

120984 BE262951 Hs.99052 ESTs 

120985 AI219896 Hs.97592 ESTs 
121011 AA398360 Hs.97608 EST 
121026 AI439713 Hs.l65295 ESTs 

121081 AA398721 Hs.186749 ESTs. Highly similar to 137550 mismatch 

121133 AA363307 Hs.97032 ESTs 

121176 AL121523 Hs.97774 ESTs 

121223 At0021 10 Hs.97169 ESTs. Weakly similar to dJ867H12.2.1 [H. 

121320 AA403008 Hs.301927 c6.1A 

121340 AW956981 Hs.97910 Homo sapiens cDNA FLJ13383 fis. clone PL 

121408 AA406137 Hs.98019 EST 

121439 AA410190 Hs.98076 ESTs, WeaWy similar to A47582 B-ceHgr 

121450 AA406430 Hs.105362 Homo sapiens, cbneMGC: 18257, mRNA, com 

121452 AW971063 Hs.292882 ESTs 

121455 H58306 Hs.15165 retinoic acid Induced 14 

121457 W07404 Hs.144502 hypothetical protein FU22055 

121496 AA442224 Hs.97900 ESTs 

121505 AA494172 Hs.194417 ESTs 

121508 AA402515 Hs.97887 ESTs 

121513 AA416653 Hs.181510 ESTs 

121514 AA412112 gb:zt69b02^1 SoaresJtesli5_NHT Homo sap 
121549 AA412477 Hs.98142 EST 

121558 AA412497 gb:zt95g12^1 Soares^teslis.NHT Homo sap 

121577 AA411970 Hs.98096 EST 

121581 AA416568 gb:zu05c10^1 Soares^teslis_NHT Homo sap 

121589 AD001528 Hs.89718 spemnine synthase 
121594 AA626010 Hs.98247 ESTs 

121622 AA416931 Hs.126065 ESTs 

121655 AA421537 Hs.178072 Homo sapiens mRNA; cDNA DKFZp434B1023 (f 

121682 AA418160 Hs.86043 Homo sapiens cDNA FLJ1355B fis, clone PL 

121590 AV660305 Hs.1 10286 ESTs 

121706 U55184 Hs.154145 hypothetical protein FU1 1585 

121714 AA419225 Hs.98269 Homo sapiens cDNAFLJ1 1953 fis, clone HE 

121729 AI949597 Hs.98325 ESTs 

121731 AA421041 Hs.180744 ESTs 

121744 AA398784 Hs.97514 ESTs 

121748 BE536911 Hs.234545 hypothetical protein NUF2R 

121773 AB033Q22 Hs.158654 KIAA1 196 protein 

121775 AA421773 Hs.161008 ESTs 

121776 AA29257g Hs.125133 hypothetical protein FU22501 
121786 AI810774 Hs.98376 ESTs 

121832 AW340797 Hs.98434 ESTs 

121836 AA328348 Hs.218289 ESTs 

1 21839 AA425691 Hs.1 91 606 ESTs, Highly similar to KIAA1048 protein 

121842 AF027406 H3.104865 serine/threonine kinase 23 

121847 AA446628 Hs.2799 cartilage linking protein 1 

121871 AW972668 Hs.293044 ESTs 
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121882 


AA426376 


Hs.98459 


ESTs 


121911 


AA427950 




gb:zw50f02.s1 Soares_tolaLfetus_Nb2HF8_ 


121915 


AA428179 


Hs.223405 


ESTs, Moderately similar to A46010 X-lin 


121935 


AA428647 


Hs.98611 


EST 


121983 


AA298760 


Hs.180191 


hypothetical protein FLJ14904 


121985 


AI862570 


Hs.299214 


Homo sapiens, done IMAGE:2822295. mRNA» 


121995 


AA210863 


Hs.3532 


neiDO-like kinase 


121999 


AA430211 


Hs.9a668 


EST 


122009 


AW292763 


Hs.160822 


Homo sapiens cONA: FLJ20863 fis. done A 


122013 


AA431085 


Hs.98706 


ESTs 


122036 


W92142 


Hs.271963 


ESTs. Weakly similar to ALU5_HUMAN ALU S 


122050 


AI453076 


Hs.166109 


ELAV (embryonic lethal, abnonmal vision, 


122060 


AA431738 


Hs.98750 


EST 


122114 


AW161023 


Hs.104921 


ESTs 


122186 


AA398838 




9b:z(80d01.r1 Soares.testis.NHT Homo sap 


122204 


AA435g36 


Hs.98842 


EST 


122246 


AA329550 


Hs.29417 


HCF-binding iranscription factor Zhangte 


122257 


AA436819 


Hs.98899 


ESTs 


122302 


AA441801 


Hs.1 04947 


ESTs 


122341 


AW601969 


Hs.99010 


hypothetical protein FU22263 similar to 


122356 


AA443794 


Hs.98390 


ESTs 


122369 


AA443985 


Hs.303222 


ESTs 


122371 


AA868555 


Hs.178222 


ESTs 


122372 


AA446008 


Hs.336677 


EST 


122378 


AB032948 


Hs.21356 


hypothetical protein DKI=Zp762K2015 


122405 


AA446572 


Hs.303223 


EST 


122412 


AA446869 


Hs.1 19316 


ESTs 


122415 


AA446918 


Hs.99088 


EST 


122418 


AA446966 


Hs.99090 


ESTs. IVkxierately similar to similar to K 


122440 


AW505139 


Hs.9460 


Homo sapiens mRNA; cDNA DKFZp547C244 (fr 


122446 


AA447603 


Hs.99123 


EST 


122446 


AA447626 


HS59127 


EST 


122458 


AI286159 


Hs.104980 


ESTs 


122460 


AW418788 


Hs.99148 


ESTs, Weakly similar to S43569 R01 H10.6 


122464 


AA448158 


Hs.99152 


EST 


122490 


AA448349 


Hs.238151 


EST 


122492 


AA448417 


Hs.104990 


ESTs 


122502 


AA204969 


Hs.234863 


Homo sapiens cDNA FU12082 fis, clone HE 


122510 


AA449232 


Hs.99195 


ESTs 


122530 


AW959741 


Hs.40368 


adaptor-related pioteln complex 1, sigma 


122547 


AA779725 


Hs.164589 


ESTs 


122555 


AA194055 


Hs.293858 


ESTs 


122570 


AA452578 


Hs.262g07 


ESTs 


122572 


AA452601 


Hs.99287 


EST 


122586 


AKOOigiO 


Hs.9d303 


Homo sapiens cDNA FLI1 1048 fis, done PL 


122587 


AB040893 


Hs.6968 


KIAA1460 protein 


122598 


AI028173 


Hs.99329 


ESTs 


122599 


AL355841 


Hs.99330 


hypothetical protein FU23588 


122602 


AA411925 


HS301960 


ESTs 


122607 


AA453518 


Hs.98023 


ESTs 


122614 


AA453630 


Hs.99339 


EST 


122616 


AA453638 


Hs.1 61 873 


ESTs 


122617 


AI681535 


Hs.1 48135 


serfne/threonine kinase 33 


122618 


AA453641 




gb-.zx48e065l Soares.testis„NHT Homo sap 


122622 


AA453987 


Hs.144802 


ESTs 


122717 


AA456859 


Hs.178358 


ESTs 


122762 


AI376875 


Hs.105119 


ESTs 


122829 


AW204530 


Hs.99500 


ESTs 


122834 


AA461492 


Hs.99545 


Homo sapiens cDNA FU10658 fis, done NT 


122836 


AA460581 


Hs.290998 


ESTs 


122837 


AA461509 


Hs.293565 


ESTs, Weakly similar to putative p150 [H 


122838 


AA460584 


Hs.334386 


ESTs 


122854 


AA600235 


Hs.9625 


NIMA (never in mitosis gene a)-related k 


122856 


Ai929374 


HS75367 


Src-ii!«-adapter 


122861 


AA335721 


Hs.1 19394 


ESTs 


122866 


BE539656 


Hs.283705 


ESTs 


122668 


AF005216 


Hs.115541 


Janus kinase 2 (a protein tyrosine kinas 
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22870 AW576312 
22872 AW081394 
22879 AA769410 
22907 AA470074 
22916 AA470140 
22981 AA478951 
23013 AW96B324 
23016 AW338067 
23034 AL359571 
23072 A1382600 
230B2 AA485360 
230B8 AI343652 
23110 AA486256 
23114 BE304942 

23131 T52027 

23132 At061582 
23136 AW451999 
23149 A1734179 
23152 AW601773 
23258 AA490929 
23315 AA496369 
23369 AA504757 
23394 AA731404 
23433 AW450922 
23466 AA599042 

23470 AW303285 

23471 AB021644 
23475 BE439553 
23482 N95059 
23486 BE019072 
23508 AW380388 
23615 AA609170 
23619 AA602964 
23658 AA609364 
23674 A1269609 
23735 NM^013241 
23738 AA609891 
23753 AA609955 
23804 AA620464 
23811 AA620585 
23951 AB012922 
123983 AJ272267' 
124001 L42542 
24006 AI147155 
24070 A1950314 
124074 H05635 
24178 BE463721 
24203 AA372796 
24352 AA640891 
24375 D87454 
24385 AI267847 

24390 AA317338 

24391 AF1 55099 
24417 N34059 
24428 H13540 
24440 AA532519 
24466 R10064 
24482 N53935 
24498 H79433 
24515 AA669097 
24608 N71076 
24631 NWL014053 
24634 A)765123 
24637 AA160474 
24642 AW968856 
24649 N92593 
24661 R48170 



Hs.318722 

Hs.97103 

Hs.1 28654 

Hs.169896 

Hs229170 

Hs.105629 

Hs.17384 

Hs.323231 

Hs.44054 

Hs.104308 

Hs.105661 

H8.105667 

Hs.193510 

Hs.265848 

Hs.271795 
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131511 AA732153 Hs.27865 Homo sapiens cDNA: FLJ21 333 fis, Clone C 

131528 AU076408 Hs.28309 UDP-glucose dehydrogenase 

131532 BE268278 HS.2B393 hypothetical protein MGC2592 

131543 AW966881 Hs.41639 programmed cell death 2 

5 131544 AU55715 Hs.28555 programnoed cell death 9 (PDCD9) 

131562 NIVL003512 Hs.28777 H2A histone family, member L 

131564 T93500 Hs.28792 Homo sapiens cDNA FU11041 fis, done PL 

131569 AL389951 Hs.271623 nucleoporin 50kD 

131618 BE393822 Hs.29645 Homo sap/ens mRNA;cDNADKFZp761C029(fr 

10 131622 R78195 Hs.29692 Homo sapiens CDNAFU11436 fis, clone HE 

131623 AB037791 Hs.29716 hypothetical protein FU10980 

131643 AW410601 Hs.30026 HSPC1 82 protein 

131653 AW960597 Hs.30164 ' ESTs 

131656 AI218918 Hs.30209 KIAA0854 protein 

IS 131669 X52486 Hs.3041 uradmNA glycosylase 2 

131692 BE559681 Hs.30736 KIAA01 24 protein 

131714 AA642831 Hs.31016 putative DNA binding protein 

131722 D13757 Hs.311 phosphoribosyl pyrophosphate amldotransf 

131737 AK001641 Hs.31323 inhibitor of kappa IJght polypeptide gen 

20 131763 AI878932 Hs.317 topoisomerase (DNA) I 

131772 AA382590 Hs.170980 K1AA0948 protein 

131787 D87077 Hs.196275 KIAA024O protein 

131793 AW966127 Hs.32246 Homo sapiens cDNA FLJ14656 fis, clone NT 

131795 BE501849 Hs.32317 h^gh^nobillty group 20B 

25 131798 X86098 Hs^01449 adenovirus 5 El A binding protein 

131817 U20536 Hs^80 caspase 6, apoptosls-related cysteine pr 

131824 U28838 Hs^2935 TATAbox binding protein (TBP)-associate 

131850 A)251317 Hs.33184 ESTs 

131878 AA083764 Hs.6101 hypothetica) protein MGC3178 

30 131885 BE502341 Hs.3402 ESTs 

131900 AA099014 Hs.231029 Homo sapiens, clone MGC: 15961, mRNA, com 

131904 AF078866 Hs.284296 Homo sapiens cDNA: FU22993 fis, clone K 

131905 AA179298 Hs.3439 stomatin-like 2 

131913 AW207440 Hs.1 85973 degenerative spermatocyte (homolog Droso 

35 131916 AA025976 Hs.34569 ESTs 

131925 AF151048 Hs.183180 anaphase promoting complex subunit 11 (y 

131929 BE541211 Hs.34804 Homo sapiens cDNAFUl 1472 fis. done HE 

131941 BE252983 Hs.35086 ubiqullin specific protease 1 

131950 AA355113 Hs.35380 xOOl protein 

40 131962 AK000046 Hs.267448 hypothetical protein FU20039 

131965 W79283 Hs.35962 ESTs 

131971 BE56710D Hs.154936 hypothetical protein MDS025 

131977 U90441 Hs.3622 proooDagen-proilne. 2-oxoglutarate 4^ 

1 31985 AA503020 Hs.36563 hypothetical protein FU2241 8 

45 131991 AF053306 Hs.36708 budding uninhibited by benzimidazDies 1 

132019 H56995 Hs.37372 Homo sapiens DNA binding peptide mRNA. p 

132062 BE266155 Hs.3832 clathrin-assoclated protein AP47 

132084 NM_002267 Hs.3886 karyopherin alpha 3 (Importin alpha 4) 

132103 BE171921 Hs.3991 ESTs 

50 132105 AV646076 Hs.39959 ESTs 

132116 AW960474 Hs.40289 ESTs 

132176 AA857025 Hs.8878 kinesin-like 1 

132180 NM 004460 Hs.418 fibroblast activation protein, alpha 

132194 R42432 Hs.4212 ESTs 

55 132207 BE206939 Hs.42287 E2F transcription factor 6 

132235 AV658411 HS.426S6 KIAA1681 protein 

132252 AI566004 Hs.141269 HomosapienscDNA:FLJ21550 lis, done C 

132266 AA301228 Hs.43299 hypothetica) protein aj 12890 

132273 AA227710 ' Hs.43658 DKFZP586L151 protein 

60 132276 AA653507 Hs.285711 hypothetical protein FU 13089 

132288 N36110 Hs.305971 solute carrier family 2 (fadlitaledglu 

132294 AB023191 Hs.44131 KIAAD974 protein 

132298 NM_015986 Hs7120 cytokine receptor-like molecule 9 

132299 AW405882 Hs.44205 cortistatin 

65 132325 N37065 Hs.44856 hypothetical protein FU12116 

132370 AW572805 Hs.4'6645 ESTs 

132374 AF155582 Hs.46744 corel UDP-galactoserN-acefylgalactosamin 
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AI279892 

AA312135 

AL1 35094 

AA100012 

AW973521 

AB011084 

A1224456 

X16660 

AW885606 

AA306105 

AA454132 

BE386673 

BE568452 

AW631437 

AKD01484 

AA345547 

H12751 

BE262677 

Ar796870 

AB018319 

AW191962 

F11875 

NM_004600 

AI142265 

AI189075 

AA010233 

AA125g65 

Y10275 
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AI026701 

U07418 

AB007944 

BE313625 
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N27852 
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AA808177 

AF198620 

H94227 

Z11695 
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Hs.46967 

Hs.47334 

H&48827 

Hs.247324 

Hs.48924 

Hs.4934 

Hs.1 19007 

Hs.5064 

Hs.50785 

Hs.5080 

Hs.5066 

Hs.5101 

Hs.5184 

Hs^298 

Hs.53263 

Hs.5327 

Hs,283558 

Hs.54277 

Hs.5460 

Hs.249239 

Hs.5534 

Hs.554 

Hs.55498 

Hs.301872 

Hs^21 

KS.5614S 

HS56407 

HSJ295901 

HS.S6845 

Hs.5716 

Hs.57301 

Hs.5737 

Hs.57435 

Hs.57475 

Hs.57553 

Hs.169610 

Hs.57783 

Hs^79771 

Hs.5811 

Hs.287912 

H&203961 

Hs.585dd 

HS58617 

Hs.59271 

Hs.59545 

Hs.59838 

Hs.167578 

Hs.60257 

Hs.127243 

Hs.197751 

Hs.61426 

Hs.6153 

Hs.288924 

Hs.323277 

Hs.301404 

Ks.62016 

Hs.279905 

Hs£2711 

Hs.246315 

Hs.64056 

Hs.6430 

Hs.64691 

Hs.65228 

Hs.65648 

Hs.6592 

HS324473 



sorting nexin 14 
HSPC034 protein 
hypothetical protein FLJ 14495 
hypothetical protein F1J12085 
mitochondrial n*bosomal protein S14 
KIAA0512 gene product; ALEX2 
H^piens poiyA site DNA 
RAB4, membei RAS oncogene family 
ESTs 

SEC22, vesicle trafficking protein (S. c 
mitochondrial n'bosomal protein L16 
hypothetical protein l\A6C10433 
protein regulator of cytokinesis 1 
TH1 drosophila homotog 
CGM5 protein 

hypothetical protein FLJ 13287 

PR01914 protein 

hypothetical protein PR01855 

DNA segment on chromosome X (unk)ue) 992 

KIAA0776 protein 

collagen, type Vlli» alpha 2 

Homo sapiens cDI^ FLJ12961 fis, clone NT 

Sjogren syndrome antigen A2 (60kD, ribon 

geranylgeranyl diphosphate synthase 1 

hypothetical protein MGC4840 

glutamyl-prolyl-tRNA synthetase 

thymosin, beta, identrfled in neuroblast 



KIAA0493 protein 
GDP dissociation inhibitor 2 
KiAA0310 gene product 
mul (E. coil) homoiog 1 (colon cancer, 
KIAA0475gene product 
soiute canler family 11 (proton-coupled 
sex comb on mkileg homoiog 1 
tousled'like kinase 2 

CD44 antigen (homing function and Indian 

eukaryotic translation Initiation factor 

Homo sapiens clone PP1596 unknown mRNA 

chromosome 21 open reading frame 59 

tedin, mannose-blnding, 1 

ESTs, IVIoderately sim'ilar to AF116721 89 

KtAA1266 protein 

Rho-associated, coiled-coit containing p 
U2(RNU2) small nuclear RNA auxiliary fac 
ring finger protein 15 
hypothetical protein FLJ10808 
Homo sapiens cDNA FLJ1 1095 fis, clone PL 
Homo sapiens cDNA FLJ13598 fis. ckine PL 
Homo sapiens mRNA for KIAA1724 protein, 
KIAA0666 protein 

Homo sapiens mesenchymal stem cell prote 
CGl-48 protein 

Homo sapiens cDNA FLJi 1392 fis, done HE 
ESTs 

RNA binding motif protein 3 
ESTs 

doneHQ0310PRO0310p1 

Homo sapiens, ctone IMAGE'3351295, mRNA 

UDP-N-acetyl-alpha-D-galactosamlne:polyp 
PRO0149 protein 

protein with polygtutamine repeat; cald 

KIAA0483 protein 

ESTs 

RNA binding motif protein 8A 

Homo sapiens, done iMAGE:2961368. mRNA, 

mitogen^divated protein kinase 1 
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133174 AA431620 Hs.324178 

133175 AW955632 Hs.66666 
133177 X97795 Hs.66718 
133208 AI801777 Hs.6774 

5 133226 AW954569 Hs.296287 

133228 A1492924 Hs.5831 

133254 A1567421 Hs.273330 

133268 AW956781 Hs.293937 

133291 BE297855 Hs.69855 

10 133314 AA102670 Hs70725 

133321 T79526 Hs.179516 

133327 AL390127 Hs.7104 

133347 BE257758 Hs.71475 

133360 AI016521 Hs.71816 

IS 133366 AA292811 H5.72050 

133367 AF231919 Hs.WSS 

133370 AF245505 Hs72157 

133390 AI950382 Hs.72660 

133391 AW103364 HsJ27 
20 133394 AA305127 Hs237225 

133437 AL031591 HsJ370 

133452 NIVL002759 Hs^74382 

133453 A1659306 Hs.73826 
133500 AW964804 Hs74280 

25 133529 W45623 Hs.74571 

133543 AU077073 Hs.108327 

133578 AU077050 Hs.75066 

133579 X75346 Hs.75074 
133582 BE391579 Hs.75087 

30 133594 AW160761 Hs.172589 

133595 AA393273 Hs.75l33 

133599 NM_002885 Hs.75151 

133621 NM 004893 Hs.75258 

133627 NM_002047 Hs.75280 

35 133631 NM_000401 HsJ5334 

133649 U25849 Hs.75393 

133690 AV661185 Hs.75574 

133720 L27841 Hs.75737 

133722 AW969976 Hs.279009 

40 133751 AW402048.comp 

133757 T52946 Hs.196209 

133760 BE271766 Hs.t81357 

133765 M62194 Hs75929 

133780 AA557660 Hs76152 

45 133797 AL133921 Hs76272 

133822 D50525 Hs.699 

133842 AW797468 Hs.285013 

133845 AA147026 Hs76704 

133865 AB011155 Hs.17O290 

50 133867 AW340125 Hs76989 

133868 AB012193 Hs.183874 

133922 U30825 Hs.77608 

133924 D86326 Hs.325948 

133929 NM_006306 Hs.211602 

55 133936 L17128 Hs.77719 

133941 BE244332 Hs77770 

133959 X81789 Hs77897 

133976 A190816S Hs.169946 

133989 AL040328 Hs782a2 

60 133997 AI824113 Hs78281 

134010 AB016092 Hs.197114 

134015 D31764 Hs278569 

134070 NIVL003590 Hs78946 

134110 U41060 Hs79136 

65 134129 NM_014742 Hs79305 

134134 H86504 Hs.173328 

134200 BE559598 Hs.197803 



hypothetical protein MGC2745 

ESTs. Weakly similar to S19560 prolines 

RAD54 (S.oerevisiae)-l1ke 

ESTs 

Homo sapiens, Similar to bromodomain-con 
golgi phosphoproteln 1 

Homo sapiens, clone IMAGE:3544662, mRNA, 
ESTs. Weakly similar to FXD2_HUMAN FORKH 
NFlAS-related gene 

ganmva-aminobutyricackl (GABA) A reoepto 
integral type I protein 
Kfuppel^ke factor 13 
add duster protein 33 
v-akt murine thymoma viral oncogene homo 
non-melastatic cells 5. protein expresse 
KfAA0539 gene product 
DKFZP564I1922 protein 
phosphatldylserine receptor 
Inhibln, beta A (activin A, activin AB a 
hypothetical protein HT023 
phosphotWylinositol transfer protein, b 
protein kinase, interferon-inducible dou 
protein tyrosine phosphatase, non-recept 
hypothetical protein FU22237 
ADP-ribosylation factor 1 
damage-spedfic DMA binding protein 1 (1 
translin 

mitogen-acfivated protein kinase-adivat 
Fas^ctfvated serine/threonine kinase 
nudear phosphoproteln similar to S. cer 
transcription factor 6-like 1 (mitochond 
RAP1, GTPase activating protein 1 
H2A histone family, member Y 
glycyl-tRNA synthetase 
exostoses (multiple) 2 
ackJ phosphatase 1, soluble 
mitochondrial ribosomal protein L19 
pericentriolar material 1 
matrix Gla protein 
Hs.334787 

RAE1 (RNA export 1, S.pombe) homolog 
laminin receptor 1 (67kD, n"bosomal prot 
cadherin 11. type 2, OB-cadherin (osteob 
decorin 

retinoblastoma-binding protein 2 
pepttdylprolyl isomerase B (cydophiltn 
putative human HIA class II assodated p 
ESTs 

discs, large (Drosophfia) homolog 5 
KIAA0097 gene product 
cullin 4A 

splidng factor, arglnine/serlne-rich 9 
veslde docking protein pi 15 
SMC1 (stmdural maintenance ofchromoso 
gamma-glutamyl cartMxylase 
adaptor-related protein complex 3, mu 2 
spfidng factor 3a. subunit 3. 60kD 
GATA-btndIng protein 3 (T-celt receptor 
SWI/SNF related, matrix associated. ac6 
regulator of G-proteIn signalling 12 
RNA binding protein; AT-nch element bin 
sorting nexin 17 
culHn 3 

UV-1 protein, estrogen regulated 
KIAA0255 gene product 
protein phosphatase 2. regulatory subuni 
KIAA0160 protein 
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other 


9.3 


other 


4.5 


TM 


5.5 


TM 


27 


other 


17 


? 


1.3 


other 


12.2 


other 


1.2 


other 


17 


TM 


11.1 


? 


2.9 


other 


2.5 


? 


1.5 


other 


2.1 


other 


1.3 


other 


2.2 


other 


57 


TM 


25.5 


other 
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other 
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other 
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other 


1.5 


other 
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TM 


4 
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1.8 


? 


1.5 


other 


3.5 


TM 


6.8 


TM 


2.6 


TM 


1.4 


other 


8.1 


other 


13.5 


other 


2.2 


other 


1.8 


other 


2 


other 


2.8 


other 


6.8 


other 


2.5 


other 


Homo sapiens, S 


1.4 


? 


5.4 


other 


5 


other 


3.8 


other 


3.1 


? 


97 


? 


2.4 


other 


2.5 


other 


5 


other 


2.5 


? 


2.1 


other 


2.8 


TM 


1.8 


SS. 
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? 


2.6 


other 


2.9 


other 


10.4 


other 
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other 
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SS. 
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134206 AF107463 


Hs.79968 


splicing factor 30. survtva) of motor ne 


1.3 


other 


134219 Nh4.0OO4O2 


Hs.80206 


glucose-6-phosphate dehydrogenase 


19 


other 


134234 BE300078 


Hs.80449 


Homo sapiens, done IMA6E:3S35294, mRNA. 


10.3 


SS. 


134275 A1878810 


Hs.3688 


dsplatin lesistance-associated overexpr 


Z5 


other 


134292 AI906291 


Hs.81234 


immunoglobulin supeifamity. member 3 


1.3 


TM 


134301 AW5025D5 


Hs.81360 


Homo sapiens cONA: FU21927 fis. done H 


1.6 


TM 


134305 U61397 


Hs.81424 


ubiquit'n-like 1 (sentrin) 


2.1 


TM 


134324 AB029023 


Hs.179946 


KIAA1 100 protein 


5.3 


? 


134326 AW903838 


Hs.81800 


chondroitin sulfate proteoglycan 2 (vers 


2.5 


TM 


134329 N92036 


Hs.81848 


RAD21 (S. pombe) homclog 


3.9 


? 


134337 NM_004922 


Hs.81964 


SEC24 {S. cerevislae) related gene famil 


2.4 


TM 


134348 AW291946 


Hs.82065 


interieukin 6 signal transducer [gp130. 


6.8 


TM 


134367 AA339449 


Hs.82285 


phosphonlDosyiglydnamlde fomiyltransfer 


2.3 


TM 


134376 X06560 


Hs.82396 


2',5'-oilgoadenytate synthetase 1 (40-46 


5.5 


other 


134379 AW362124 


Hs.323193 


hypothetical protein M6C3222 


5.9 


TM 


134384 AI589941 


Hs.8254 


Homo sapiens, Similar to tumor different 


2.2 


other 


134391 AA417383 


Hs.82582 


Integrin. beta-like 1 (with EGF-Hke rep 


2.1 


other 


134395 AA456539 


Hs.8262 


lysosomal 


2,3 


other 


134405 AW067903 


Hs.82772 


collagen, t/pe Xl. alpha 1 


72.9 


other 


134411 BE272095 


Hs.167791 


reticulocalbin 1, EF-hand calcium bindin 


4.4 


other 


134415 AI750762 


Hs.82911 


protein tyrosine phosphatase type iVA, m 


2.3 


other 


134421 AU077196 


Hs.82985 


collagen, type V, alpha 2 


6,8 


? 


134424 244190 


Hs.83023 


peroxisomal biogenesis factor 11B 


2,4 


other 


134446 AA112036 


Hs.83419 


KIAA0252 protein 


2.9 


other 


134447 M58603 


Hs.83428 


nuclear factor of kappa light polypeptid 


6.7 


other 


134470 X54942 


Hs.83758 


CDC28 protein kinase 2 


2.4 


other 


134480 NM 005000 


Hs.83916 


Empirically selected from AFFX single pr 


6.3 


? 


134485 X82I53 


Hs.83942 


catliepsin K (pycnodysostosls) 


1.9 


other 


134498 AW246273 


Hs.84131 


threonyl-tRNA synthetase 


U 


other 
other 


134513 AA425473 


Hs.84429 


KIAA0971 protefn 


1.4 


134516 AK001571 


Hs.273357 


hypothetical protein FU10709 


1.4 


other 


134520 BE091005 


Hs.74861 


activated RNA polymerase 11 transcriptio 


5:6 


other 


134529 AW411479 


Hs.848 


FK506>binding protein 4 (59kD) 


2.8 


? 


134577 BE244323 


Hs.85951 


exportin. tRNA (nudear export receptor 


1.7 


other 


134582 AA927177 


Hs.86041 


CGG triplet repeat binding protein 1 


1.7 


TM 


134612 AW068223 


Hs.171581 


ublquitin C-termlnal hydrolase UCH37 


21 


other 


134624 AF035119 


Hs.8700 


deleted in liver cancer 1 


1.3 


other 


134632 X78520 


Hs.174139 


chloride channel 3 


2.1 


? 


134654 AK001741 


Hs,8739 


hypothetteal protein FLJ10879 


2.3 


other 


134666 BE391929 


Hs.8752 


transmembrane protein 4 


4 


other 


134687 U62317 


Hs.88251 


arylsulfatase A 


6.2 


other 


134692 NM 003474 


Hs.8850 


a disintegrin and metalloproteinase doma 


2 


other 


134705 BE161887 


Hs,88799 


anaphase-promoting complex subunrt 10 


13 


SS. 


134714 Y14768 


Ha890 


lysosomal 


72 


? 


134719 AA852985 


Hs^2 


chromobox homotog 5 (Drosophtia HP1 alph 


Z2 


other 


134722 AF129536 


Hs^84226 


F-boxonly protein 6 


2.5 


other 


134746 X07871 


Hs.89476 


CD2 antigen (p50). sheep red bk)od cell 


5 


other 


134751 AW630803 


Ks.89497 


lamin B1 


6.1 


other 


134790 BE002798 


Hs587850 


integral membrane protein 1 


5.6 


TM 


134834 AW451370 


Hs.8991 


adaptor-related protein complex 1, gamma 


5.3 


other 


134850 A1701162 


Hs.90207 


hypothetical protein MGC11138 


9.1 


other 


134853 BE268326 


Hs.90280 


5-aminoImldazole-4-carbQxamtde rtbonude 


2.4 


other 


134880 AI879195 


Hs.90606 


15kDa selenoprotein 


2.7 


other 


134925 AW885909 


Hs.6975 


PRO1073 protein 


1.5 


other 


134955 AW401361 


Hs.91773 


protein phosphatase 2 (fomierly 2A), cat 


4.9 


other 


134971 AI097346 


Hs.286049 


phosphoserine aminotransferase < 


2 


other 


134975 R50333 


Hs.92186 


Leman coiled-coll protein 


2.6 


TM 


135011 AB037835 


Hs.92991 


KIAA1414 protein 


1.4 


? 


135022 NM_000408 


Hs.93201 


glycen)l-3-phosphate dehydrogenase 2 (mi 


1.6 


? 


135032 AW301984 


Hs.173685 


hypothetical protein FLJ12619 


1.4 


other 


135077 AW503733 


Hs.9414 


KIAA1488 protein 


1.8 


other 


135083 AB036063 


Hs.94262 


p53^ndudbl6 ribonucleotide reductase s 


2.5 


other 


135095 AF027219 


Hs.9443 


zinc finger protein 202 


1.5 


TM 


135096 AA081258 


Hs.132390 


zinc finger protein 36 (KOX 18) 


2.1 


other 


135153 AI093155 


Hs.95420 


JM27 protein 


4.4 


? 


135181 BE250865 


Hs.279529 


px19-like protein 


14.9 


? 


135199 AA477514 


Hs.96247 


translin-associated factor X 


1.3 


other 
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135207 N26427 


Hs.9634 


ESTs. Highly sImHar to C10_HUMAN PUTATI 
hypofivetfcal protern FLJ11656 


1.7 


other 


135214 T78802 


HS.96S60 


6.2 


other 


135243 BE463721 


Hs.97101 


putative G protetn-cxiupled receptor 


2,8 


TM 


135245 AI028767 


Hs.262603 


ESTs 


12,2 


TM 


135257 AW291023 


Hs.97255 


ESTs, Weakly similar to A45010 X-IInked 


77 


TM 


135263 AI088775 


Hs.55498 


geranylgerany) diphosphate synthase 1 


1.8 


other 


135274 AA448460 


Hs.112017 


GE36 gene 


4.2 


SS, 


135294 AA150320 


Hs.9800 


protein kinase NJmu-RI 


1.2 


other 


135295 AI090838 


Hs.98006 


ESTs 


4.9 


other 


135307 AI743770 


Ks.98363 


ESTs. Weakly similar to K1AA0822 protein 


5.9 


? 


135321 A1652089 


Hs.98614 


rlbosome binding protein 1 (dog 180kD ho 


12.3 


TM 


135354 AA456454 


Hs.183418 


ceil division cycle 2-iike 1 (PITSLRE pr 


5.8 


? 


135361 AA373452 


Hs.167700 


Homo sapiens cDNA FLJ10174 fis, done HE 


8.1 


other 


135389 U05237 


Hs.99872 


feta) Alzheimer antigen 


1.9 


other 


135400 X78592 


Hs.99915 


androgen reoeptor (dihydrotestosterone r 


13.9 


TM 


302256 AA857131 


Hs.171595 


HIV TAT spedTffi factor 1 


1.6 


other 


302276 AW057736 


Hs.323910 


HER2 receptor tyrosine kinase (&erl^b2, 


5.3 


other 


303135 AW592789 


Hs.279474 


HSPC070 protein 


2.2 


TM 


303686 AK000714 


Hs.109441 


MSTP033 protein 


1.4 


SS. 


310085 R43191 


Hs.101248 


Homo sapiens clone IMAG&32553. mRNA seq 


5.2 


other 


315518 AA808229 


Hs.167771 


ESTs 


2.3 


? 


317781 Nf\L007057 


Hs.42650 


ZW10 interador 


2.9 


? 


320836 AI268997 


Hs. 197289 


ral)3 GTPase-activating protein, non-cata 


2 


other 


321114 AA902256 


Hs.78979 


Golgi apparatus protein 1 


5.6 


SS, 


322221 N24236 


Hs.179662 


nucleosome assembly protein 1-like 1 


1.4 


? 


322474 AF1 18083 


Hs^9494 


PR01912 protein 


1.3 


other 


322556 BE041451 


Hs.1 77507 


hypothetical protein 


2.9 


SS, 


323541 AF292100 


Hs.104613 


RP42 homolog 


1.6 


other 


407827 BE278431 


Hs.40323 


BUB3 (budding uninhibited by benzimidazo 


1.8 


other 


408196 AL034548 


Hs.43627 


SRY (sex determining region yyhcm 22 


1.6 


other 


408813 AiSBOOgO 


Hs.48295 


RNA helicase family 


6i 


other 


409176 R73727 


Hs.101617 


ESTs. Weakly similar to T32527 hypotheG 


5.7 


other 


409487 H19866 




gb:yn57a05.r1 Soares aduit brain N2b5HB5 


2.7 


other 


413670 AB000115 


Hs.75470 


hypothetical protein, expressed in osteo 


2.6 


? 


414108 AI267592 


Hs.75761 


SFRS protein kinase 1 


2.4 


TM 


414846 AW304454 


Hs.77495 


UBX domain-containing 1 


2.4 


other 


416040 AW819158 


Hs.289044 


Homo sapiens cDNA FIJI 2048 fis. done 


2.3 


other 


416980 AA381133 


Hs.80684 


iiigh^mobility group (nonhistone chromoso 


4.2 


TM 


417378 R57256 


Hs.82037 


TATA box binding protein (rBP)^ssociate 


23.6 


other 


418283 S79895 


Hs.83942 


cathepsin K (pycnodysostosis) 


5.8 


other 


418467 NM 006910 


Hs.85273 


retinoblastoma4}inding protein 6 


1.3 


other 


420269 U72937 


Hs.96264 


alpha thalassemia/mentai retardation syn 


1.6 


? 


420802 U22376 


Hs.1334 


v-myb avian myetoblastosis viral oncogen 


2.3 


? 


421225 AA463798 


Hs.102696 


MCT-1 protein 


1.6 


? 


421642 AF172066 


Hs.106346 


retinolc add repressible protein 


3.5 


other 


421628 AW891965 


H8.279789 


htetonedeace^laseS 


5 


other 


421983 AI252640 


Hs.110364 


peptldylprolyl isomerase C (cydophifin 


3.1 


TM 


422052 AA302744 


Hs.104518 


ESTs 


1.9 


TM 


422055 NIVL014320 


Hs.11 1029 


putative heme-binding protein 


2.4 


other 


423750 AF1 65883 


Hs.298229 


prefoldin 2 


4.2 


? 


424001 W67883 


Hs.1 37476 


paternally expressed 10 (PEG10; KiAA105 


7.1 


? 


425182 AF041259 


HS.155C40 


zinc finger protein 217 


2.3 


other 


425284 AF155568 


Hs.155489 


NS1 -associated protein 1 


3.5 


other 


426372 BE304680 


Hs.169531 


DEAD/H (Asp^tu-AIa-Asp/Hts) box polypep 


1.9 


? 


428049 AW1B3765 


Hs.182238 


GW1 28 protein 


7.6 


? 


428477 AW500533 


Hs.1 1482 


splidng factor, arginine/serine-rich 11 


1.7 


other 


437562 AB001636 


Hs.5683 


DEAD/H (Asp^lu-Ala-Asp/His) box polypep 


2.4 


other 


438449 AK001333 


Hs.6216 


Homo sapiens hepatocellular carcinoma-as 


3.6 


other 


441560 F13386 


Hs.7e88 


Homo sapiens done 23736 mRNA sequence 


5.6 


other 


445580 AF167572 


Hs.12912 


skbl (S. pombe) homolog 


2 


TM 


446999 AA151520 


Hs.334822 


hypothetical protein MGC4485 


7.6 


other 


447111 A1017574 


Hs.17409 


cysteine-rich protein 1 (intesOnaQ 


2.2 


other 


447778 BE620592 


Hs.71190 


ESTs, Weakly similar to S16506 hypotheti 


2.9 


other 


448873 NM_003677 


Hs.22393 


density-regulated protein 


1.8 


other 


449687 W68520 


Hs.331328 


Intemnediate filament protein syncoHin 


5.9 


other 


450701 H39960 


Hs.288467 


Homo sapiens cONA FU12280 fis. done MA 


5.7 


other 


450703 AA011202 


H3.184771 


nudear factor l/C (CCAAT-bind'ng transc 


1.4 


other 
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452461 N78223 Hs.108106 transcription factor 

452511 BE408178 Hs.285165 Homo sapiens cDNA aJ20845fls, done AD 

453157 AF077036 Hs.31989 DKFZP586G1722 protein 

453658 BE541906 Hs.87819 Homo sapiens, clone MGC:2492. mRNA, comp 

5 100685 AA328229 Hs.184582 rilwKoma) protein L24 

100690 AA383256 Hs.1657 estrogen receptorl 

100833 AF135168 HsJ08802 N^thylmaleimide^enstlive factor 

100850 AA836472 Hs.297939 cattiepsinB 

101161 NM^006262 Hs.37044 peripherin . 

10 102481 U50360 gb:Human caldum, calmodulin-dependent p 

102831 AA262170 Hs.80917 adaptor-related protein complex 3, sIgma 

103549 BE270465 Hs.78793 protein kinase C, zeta 

103749 AL135301 Hs.8768 hypothetical protein aJ 10849 

104331 AB04O450 Hs.279862 cdk Inhibitor p21 binding protein 

IS 104532 AI498763 Hs.203013 hypotheficalprote^ FU12748 

104563 AL117403 Hs.306189 DKFZP434F1735 protBln 

105032 AA127818 gb.zi12a025l Soares_prBgnanLuterus_NbH 

105039 AA907305 Hs.36475 ESTs 

106531 AA454036 Hs.8832 ESTs 

20 106977 AL043152 Hs.50421 KiAA0203 gene product 

1 07298 N95657 Hs.6820 ESTs, Moderately similar to YOJI.CAEEL H 

108717 AA122393 Hs.70811 hypothetical protein FU20516 

110018 AW579842 Hs.104557 hypothetical protein FLJ10697 

110330 AI288666 Hs.16621 DKFZP434I1 16 protein 

25 111391 NIVL003896 Hs.225939 sialyltransferase 9 (CMP-NeuAc:lactosylc 

111392 W46342 Hs.325081 Homo sapiens, done IMAGE:3659680. mRNA. 

1135M AW503990 Hs.142442 HP1-BP74 

113722 AV653556 Hs.184411 albumin 

115008 AK001827 Hs.87889 helicase-moi 

30 115062 AA253314 Hs.154103 LIM protein (slmHarto rat protein kina 

115121 A1634549 Hs.88155 ESTs 

117881 AF1 61470 Hs.260622 butyratennduced transcript 1 

119075 M10905 Hs.287820 fibronectini 

119615 AL034423 Hs.75875 ubiquitin-oonjugating enzyme E2 variant 

35 120253 AA131376 Hs.326401 fibroblast growth factor 12B 

125006 BE065136 Hs.145696 splicing factor (CC 1.3) 

127609 X80031 Hs.530 collagen, type IV. alpha 3 (Goodpasture 

128868 AA419008 Hs.106730 diromosome 22 open reading frame 3 

128891 F34856 Hs.292457 Homo sapiens, clone IV1GC:16362, mRNA. com 

40 128959 AI580127 Hs.107381 hypothetical protein FUl 1200 

129209 R62676 Hs.17820 Rho^odated, coiled-oofl containing p 

129449 AI096988 Hs.111554 ADP-ribosylation factor-like 7 

129453 AW974265 Hs.111632 l^m3prote{n 

129629 AK000398 Hs.11747 hypothetical protein RJ20391 

45 129917 M30773 Hs^78540 protein phosphatase 3 (formerly 2B), reg 

129922 AF042379 Hs.13386 gamma-tubulin complex protein 2 

129989 AB015856 Hs247433 activating transcription factor 6 

130182 BE267033 Hs.192853 ubiquitin-conjugating enzyme E2G 2 (homo 

130365 W56119 Hs.155103 euloryotictranslation Initiation factor 

50 130471 AL121438 Hs.183706 adducin 1 (alpha) 

130542 U64675 Hs.179825 RAN binding protein 2-like 1 

130586 AB007B91 Hs.l6349 K1AA0431 protein 

130768 AF258627 Hs.211562 ATP-twiding cassette, sub-family A (ABC1 

130992 BE398091 Hs.74316 desmoplaWn (DPI, DPil) 

55 131047 H23230 Hs.22481 ESTs, Moderately similar to A46010X-|{n 

131135 UM 016569 Hs.267182 TBX3-iso protein 

131 339 AF058696 Hs.25812 Nijmegen breakage syndrome 1 (nibrfn) 

131760 X76732 Hs.3164 nudeobindin2 

131774 BE267158 Hs.169474 DKFZP586J01 19 protein 

60 131853 AI681917 Hs.3321 ESTs. Highly similar to IRXI.HUMAN IROQU 

131881 AW361018 Hs.3383 upstream regulatory element binding prot 

131887 W17064 Hs.332848 SWI/SNF related, matrix assodated, acti 

132031 AF193844 Hs.3758 C0P9 complex subunit 7a 

132192 AA206153 Hs.4209 mitochondrial rfbosomal protein L37 

65 132203 NM_004782 Hs.194714 synaptosomal-assodated protein, 29kD 

132240 AB018324 Hs.42676 KIAA0781 protein 

132348 AW067708 Hs.l70311 heterogeneous nuclear ribonudeoprotein 
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other 
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other 
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TM 
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Z3 


other 


5.3 


TM 


4.6 


other 
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SS, 


4.6 


other 
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other 


2.7 


other 
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other 
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TM 
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other 


1.8 


TM 


17 
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TM 


Z6 


other 
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TM 
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132528 
132571 
132726 
132863 
133016 
133053 
133197 
133240 
133266 
133285 
133383 
133540 
133784 
133791 
133850 
133859 
133881 
134208 
134403 
134724 
134806 
134859 
135193 



T78736 

AW674699 

N52298 

BE268048 

AI439688 

A1085016 

AI275243 

AK001489 

A1160873 

M76477 

BE313555 

AL037159 

BE622743 

M34338 

m9092 

U86782 

U30872 

NM.000288 

AA334551 

AF045239 

AD001528 

D26488 

X95525 

AA243007 

T70541 

X57766 

S66431 

AA453483 



AA173417 

AA280588 

AA504223 

AA609996 

F02907 

AA480103 

AA024664 

AA251776 

AA399047 

N34059 

U95367 

AA490899 

T54762 

241963 

AA521186 

AA400195 

AA045083 



W85712 

W45728 

U61232 

AA425154 

T39176 

AA496000 

W38150 

T96595 

AA227463 

R46025 

AA233177 

AA3387G0 

AA412106 

L47276 

D82307 

AA293568 

R37778 

AA250843 

W49521 

D80000 



Hs^0758 SMC4 (structural maintenance of chromoso 7.4 

Hs.5169 suppressor of G2 allele of SKP1 , S. cere 6.9 

Hs.55608 hypothetical protein MGC955 14;3 

Hs.236494 RAB10, member RAS oncogene famrty 10.3 

Hs.6289 hypothetical protein R.J20886 4.4 

Hs.6390 Homo sapiens clone FLB3344PRO0B45mRNA. 1.8 

Hs.1 80201 hypothetical protein aJ20671 1 .8 

Hs.242894 ADP-ribosylation factor-llke 1 1 .8 

Hs.69233 zinc finger protein 16.1 

Hs^89082 6M2 ganglioside activator protein 10,4 

Hs.7252 K1AA1224 protein 1.5 

Hs.74619 proteasome (prosome, macropain) 26S subu 1 .7 

Hs.301064 arfaptin 1 12.1 

H3.76244 spermidine synthase 9.7 

Hs.7678 ceUular refinoic acid-binding protein 1 4.2 

Hs.176761 26S proteasome-assodated padi homolog 2.2 

Hs.77204 centromere protein F (35O/400kD. mitosin 9.1 

Hs.79993 peroxisomal biogenesis factor 7 3.2 

Hs.82767 sperm spedfic antigen 2 1.4 

Hs.321576 ring finger protein 22 1.4 

Hs.89718 spemiine synthase 2,6 

Hs.90315 K1AA0007 protein 133 

Hs.96103 TATA box binding protein (TBP)-assoclate 3.1 

ESTs 1.6 

ESTs 2.5 

lHumanstromelysin-3mRNA 4.5 

Homo sapiens done 23592 mRN A sequence 3. 1 

ESTs 4.6 

ESTs 1.4 

ESTs 1.9 

ESTs 2.2 

ESTs Highly simiar to CHROMOSOME 2.4 

ESTs HighV simBar to Surf-4 protebi |M.musculusl 5.5 

ESTs 2.3 

ESTs Weakly similar to ALU SUBFAMILY J 2.8 

Human NADH:ubfquinone oxidoreductase subunrt 6.2 

ESTs 2.3 

ESTs 2.4 

EST.RC_I^34059 3.3 
Human GABA-A receptor pi subunit mRNA complete cds 

• ESTs 3.3 

ESTs 29 
Homo sapiens HP protein (HP) mRNA complete cds 1.3 

ESTs 1.6 

ESTs 1.3 
VITAMIN K^EPENDENT GAMMA-CARBOXYLASE 
Homo sapiens mRNA for GDP dfssodation Inhibitor beta 
ESTs Weakly simflar to PROCOLLAGEN ALPHA 2(IV 

ESTs Highly similar to HETEROGENEOUS 3.7 
Human tubuHn-lbMing cofactor E mRNA complete cds 



ESTs 



5.3 
26 
1.9 
1,7 
1.8 



ESTs Weakly similar to ZK1058.4 [C.elegans] 
ESTs 

EST.RC«W38150 
EST.RC.T96595 
ESTs Weakly similar to No definitibn line found [Celegans] 

ESTs 2.8 

ESTs 2 

ESTs 1.3 

ESTs 62 

EST-L47276 3.4 
ESTs Weakly similar to TH1 protein p.melanogasterl 

ESTs 1.5 

ESTs 2.4 

Interferon regulatory factor 5 1 4.6 

Human prolyl 4-hydroxyiase alpha (II) subunit 6.5 

Human mRNA for KIAA0178 gene partial cds 2 
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other 
? 

other 
other 

SS.TM 
other 
other 
other 
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other 
other 
other 

SS, 
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other 
other 
other 
other 
? 

SS. 
other 
other 

TM 

other 

other 

other 

other 

? 

TM 

TM 

other 

other 

other 

other 

1.7 
other 
? 
? 

TM 

other 
2.5 
1.6 
2.6 

other 
2.1 

other 
SS.TM 
SS, 

? 

TM 
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SS, 
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? 

other 
other 

II. 4 
other 
other 
? 

? 

other 



TM 



other 

TM 

TM 

other 



other 
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R9997o 


ESTs Weakly simaar to line-1 protein 0RF2 [H.sapiens] 


0.1 


AA1 95030 


Human Ro/SSA rilxjrttJcteoprotein homolog (RoRet 5.3 


o 

f 




ESTs Weakly similar to PROBABI^ E5 


1.4 


other 


U37547 


Human lAP homolog B (MIHB) mRNA complete ods 3^ 


other 


AA4799o1 


ESTs 


1.7 


other 


X57579 


tnhibin beta A (aMn A actlvin AB alpha polypeptide) 


13.0 


AA449u71 


ESTs 


1.3 


TM 


N51B55 


ESTs Modetately similar to N AD{+) ADP- 


1.3 


otfier 


AA421213 


ESTs Weakly similar to F28Fd.3 [Celegans] 


22 


other 


AA355201 


ESTs 


1i 


ee THii 
oO|TM 


N78717 


H^piens mRNA Ibrtranslln 


1.5 


? 


nfOOUO 


ESTs 


5 




U86782 


Human 26S proteasome-assodated padi 


22 


Other 


AA234817 


ESTs 


1.3 


other 


D13666 


Homo sapiens mRNA for osteoblast spedftc 


7.5 


SS. 


AA236177 


ESTs 


7.1 


? 


U50648 


Protein kinase interferonnnducible double 


4.1 


? 


M28211 


Homo sapiens GTP-bindhg protein (RAB4) 


2.9 


other 


AA446949 


ESTs 


2.2 


other 


W03007 


ESTs 


1.2 


other 


W61011 


ESTs 


1.2 


other 


W87544 


ESTs 


1.2 


other 


X02751 


Neuroblastoma RAS viral (v-ras) oncogene homokDg 1.2 


? 


Z14077 


YY1 transcription factor 


\2 


other 


Z38839 


ESTs 


12 


? 


AA410894 


ESTs 


1.7 


other 


AA504499 


ESTs Highly similar to probable chloride channel 3 [Ksap 


1.3 



175 



wo 02/059377 



PCT/US02/02242 



TABLE 7A 

Table 7 A shows the accession numbers for those pkeys lacking unigenelD's for Table 7. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mKNfAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number. Gene duster number 
Accession: Genbank accession numbers 

15 

Pkey CAT number Accession 

102481 3128L-28 U50360 
20 105032 genbanlLAA127818 AA127818 

409487 1134778J H19886 AW402806T10231 
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TABLE 8: Figure 8 from BRCA 001-1 US 

Table 8 shows genes upregulated in tumor tissue compared to nonnal breast tissue. 
Specifically, one column shows the ratio of expression of the indicated gene in breast tumor 
tissue compared to other body tissues, and ano&er column shows the ratio of expression of 
the indicated gene in breast tumor tissue compared to normal breast tissue. 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Pkey: Unique Eos probeset identifier number 






ExAccn: Exemplar Accession number, Genbanl( accession number 






UnigeneD: Untgene number 






Unigene Title: Untgene gene title 






R1 : Ratio of tumor to normal body tissue 






R2: 


Ratio of tumor to normal breast fesue 






Pkey ExAccn • 


UnlgenelD Unigene Title 


Rl 


R2 


100075 AF152333 


Hs.284160 protocadiierin gamma subfamily B, 4 


1 


3.8 


100229 AV652249 


Hs.180107 polymerase (DMA directed), beta 


17 


5.3 


100262 D38500 


Hs.278468 postmeJotic segregation increased 2-fke 


0.8 


4.8 


100271 BE160081 


Hs.256290 SlOOcaldum-binding protein All (calglz 


3.2 


2.3 


100355 Aig07114 


Hs.71465 squaleneepoxidase 


3.3 


1.4 


100522 X51501 


Hs.99949 prolacfin-induoed protein 


11.9 


0.4 


100552 AA019521 


Hs.301946 lysosomal 


3.8 


1.2 


100599 X77343 


Hs.334334 transcripSon factor AP-2 alpiia (actlvat 


9.4 


9.4 


100676 X02761 


Hs.287820 fibronectln 1 


3 


7.8 


100690 AA383256 


Hs.1657 estrogen receptor 1 


4.4 


4.4 


100895 U01351 


Hs.75772 nuclear receptor subfamily 3, group C, m 


1 


3.9 


101046 K01160 


NM_002122:Homo sapiens major histocompati .7 


4 


101086 AA382524 


Hs.250959 histaOn 1 


0.8 


4.1 


101148 NM_002923 


Hs.78944 regulator of G-proteln signalling 2, 24k 


1.2 


12 


101161 NM_005262 


Hs.37044 peripherin 


3.1 


1.1 


101201 L22524 


Hs.2256 matrix metalloproteinase 7 (MMP7; uterin 


4.4 


0.6 


101212 AI186220 


Hs.83164 collagen, type XV, alpha 1 


3.1 


3.4 


101441 AW468397 


Hs.100000 SlOOcaldum-bindfng protein A8 (calgran 


0.9 


4.2 


101447 M21305 


gb:Human alpiia satellite and satellite 3 


29.9 


0.3 


101469 AA310162 


Hs.169248 cytocliromec 


0.8 


4.9 


101567 M33552 


HS.S6729 lysosomal 


1 


6.9 


101600 BE561617 


Hs.1 19192 )H2Ahistone family, member Z 


2.8 


4 


101624 M55998 


gbrHuman alpha-1 collagen type 1 gene, 3 


3.1 


17 


101674 NWL002291 


Hs.82124 laminln. beta 1 


1.5 


4.1 


101861 AA350659 


Hs.83347 angio-associated, migratory cell protein 


3.1 


1.4 


101977 AF1 12213 


Hs.1 84062 putative Rab5-interacting protein 


1.3 


6.9 


102193 AL036335 


l^s.313 secreted phosphoprotein 1 (osteopontin. 


1.9 


4.9 


102199 AA334592 


Hs.79914 lumican 


22 


3.8 


102304 AF015224 


Hs.46452 mammaglobin 1 


4.2 


07 


102345 NIVL003480 


Hs.300946 Mlcrofibril-assQciated glycoprDlein-2 


1.1 


4.2 


102457 NIVL001394 


Hs.2359 dual specificity phosphatase 4 


4.5 


0.5 


102534 U96759 


Hs.198307 von Hippel-Llndau binding protein 1 


1.4 


4.2 


102541 AI379954 


Hs79025 KIAA0096 protein 


0.9 


3.9 


102827 BE244588 


Hs.6456 chaperonin containing TCP1 , subunit 2 (b 


1.5 


m 


102962 R50032 


Hs.159263 collagen, type VI, atplia 2 


2.2 


6.2 


102991 AW293542 


Hs.75309 eukaryotk:tranSlatk>nek)ngation factor 


5.6 


5.7 


103119 X63629 


Hs.2877 cadherin 3. type 1, P-cadherin (placenta 


3.7 


0.5 


103175 X69089 


HS79227 myomesln(M-protefn)2(165kD) 


1.3 


4 


103286 D38616 


Hs.54941 phosphorylase kinase, alpha 2 (liver) 


1.3 


3.8 


103319 X83492 


Hs.82359 tumor neaosis factor receptor superfami 


0.8 


4.6 


103372 BE536700 


Hs.4888 seryl-tRNA synthetase 


0.9 


8 


103419 T34708 


Hs.272927 Sec23 (S. cerevisiae) homolog A 


1.1 


5.1 


103471 Y00815 


Hs.75216 protein tyrosine phospi}atase, receptor t 


37 


1.2 


103546 Z14244 


Hs.75752 cytochrome c oxklase subunit Vllb 


0.9 


4.4 
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103658 NM 000088 Hs.172928 collagen, type I. alpha 1 3^ 

103758 AA084e74 gb:zn13e04.r1 StratagenehNT neuron (937 0.9 

103774 H24185 Hs.92918 hypothetical protein 1.9 

103821 AA095971 H3.198793 Homo sapiens cDNA: FU22463 fis, done H 1.2 

103869 BE439604 Hs.24322 ATPase, H+ transporting, lysosomal {vacu 1.4 

103980 AW130242 Hs.293476 hypothetical protein FKSG44 1.6 

104054 AK001913 Hs.7100 hypothetical protein 1.5 

104115 AF183810 Hs^6102 opposite strand to trichortilnophalangeal 7 

104189 AB040927 Hs.301804 KIAA1494 protein 2 

104230 AB002347 Hs.15303 KIAA0349 pnotein 0.7 

104278 AW583693 Hs.109253 N-terminalacetyltransferase complex artJ 3.3 

104295 AW365522 Hs.103657 hypothetical protein PR0221 9 2.3 

104319 AW804296 Hs.9950 Sec61 gamma 3.1 

104425 AF283775 Hs.35380 x 001 protein 4 

104432 X51501 Hs.99949 prolactifHnduced protein 3.8 

104464 AW966726 Hs^642 methionine adenosyitransferase II. beta 0.8 

104479 AK001731 Hs.106390 Homo sapiens mRNA; cDNA DKFZp586H0924 (f 

104505 W94824 Hs.11565 RIKEN cONA 2010100012 gene 2 

104592 AW63a488 Hs.325820 protease, serine. 23 1.9 

104613 AF1^303 Hs.24713 hypothetical protein 1.1 

104636 R82252 Hs.106106 protein Idnase (cAMP-dependent catatyti 1.2 

104782 AW270555 Hs.171774 hypothetical protein 1.4 

104792 AA960961 Hs.305953 zinc finger protein 83 (HPF1) 1 .5 

104848 AA305351 Hs. 274359 uncharaderized hypothalamus piDteln MAR 1.1 

104849 AI279065 Hs.241507 n^bosomal protein S6 1.3 

104850 AL1 33035 Hs.8728 hypothetical protein DKFZp434G171 U 
104852 W70164 Hs.20l07 ESTs 0.8 
104861 AA058630 Hs.29759 RNA POLYMERASE I AND TRANSCRIPT REI^SE 
104873 W03831 Hs^0597 host ceil factor homolog 0.8 
104891 W44626 Hs.30627 ESTs 0.7 
104920 AW955089 Hs.306083 Novel human gene mapping to chomosome 22 
104926 BE298608 Hs.33363 DKFZP434N093 protein 3.3 
104952 AW076098 H&74316 desmoplakin (DPI, OPII) 1.2 
104963 AB029020 Hs.173694 KIAA1097 protein 1.1 
104977 AI392640 Hs.18272 amino add transporter system A1 3.2 
105030 BE613061 Hs.337772 Homo sapiens, Similar to RIKEN cDNA 0610 1.6 
105035 N39760 Hs.8859 Homo sapiens. Similar to RIKEN cDNA 5830 1.5 
105068 BE410438 Hs.9005 VAMP (veside-assoclated membrane protei 1.1 
105159 AF146277 Hs.265561 CD2-associated protein 1.2 
105178 AA313825 Hs.21941 AD036 protein 3.6 
105162 BE407961 Hs.18271 golgiphosphoproteln3 1.7 
105274 AI554929 Hs.281 866 ATPase. H+ transporting, lysosomal (vacu 1.1 
105303 BE243327 Hs.182626 chromosome 22 open reading frame 5 1.5 
105413 AI015709 Hs.172089 Homo sapiens mRNA; cDNA DKFZp586l2022 {f 
105426 W20027 Hs.23439 ESTs 4.3 
105432 W03516 Hs.76698 stress-assodated endoplasmic reticutum 1.5 
105443 AA252372 Hs.12144 KIAA1 033 protein 1.2 
105483 AL137257 Hs.23458 Homo sapiens cDNA: FLJ23015 fls, done L 1.7 
105492 AI805717 Hs,289112 CGI-43 protein 2 
105495 ALJ037715 Hs.28785 microfibrillar^ssocialed protein 3 1.3 
105539 AB040884 Hs.109694 KIAA1451 protein 2.7 
105594 ABQ24334 Hs.25001 tyrosine 3-monooxygenaseAiyptophan 5-mo 1.3 
105623 BE504200 Hs.30127 hypothetical protein 1.7 
105807 AA788946 Hs.16869 ESTs, Moderately similar to CA1C RAT COL 3.9 
105812 BE614149 Hs.20814 CGI-27 protein 1.8 
105823 AI559444 Hs.293960 ESTs 15 
105831 AA329449 Hs.247302 twisted gastrulation 1.5 
105851 A1827976 Hs.24391 hypothetical protein FLJ13612 3.8 
105879 BE392914 Hs.30503 Homo sapiens cDNA FU11344 fis, done PL 1.7 
105918 AW028485 Hs.26136 hypothetical protein MGC141 56 1.7 
105939 AL137728 Hs.12258 Homo sapiens mRNA; cDNA DKFZp434B0920 (f 
105941 AB033075 Hs.10669 development and differentiation enhancin 1.3 
105969 AB030656 Hs.17377 coronin, actln-blnding protein, 1C 1.1 
105990 Aie90586 Hs.29403 hypothetical protein FU22060 2 
106012 AI240565 Hs.8895 ESTs 4.1 
106060 NM-001329 Hs.171391 C-lenninal binding protein 2 2.6 
106070 T74445 Hs.5957 Homo sapiens clone 24416 mRNA sequence 1.4 
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106083 H62087 Hs.31659 thyroid hormone receptor-assodated prol 1^ 3,6 

106155 AA425414 Hs.33287 nuclear factor l/B 5.4 12 

106255 BE613206 Hs.279607 calpaslatin 1.8 4 

106414 BE568205 Hs28827 mitogen-actlvatBd protein kinase kinase 5.1 6.1 

5 106538 AK000274 HsJ278635 HDCMA18P protein 1.2 5.9 

106568 AW051564 Hs 28285 patched related protein transtocated in 1.8 5.4 

106574 BE044325 Hs527280 U6 snRNA-assoclated Sm-like protein 2.3 11.2 

106613 N88604 Hs.3D212 thyroid receptor interacting protein 15 1.2 3.6 

106617 H09548 Hs.5367 ESTs, Weakly similar to 138022 hypotheH 0.9 4.4 

10 106619 AA459480 Hs.23956 hypothetical protein FU20502 1.3 3.6 

106701 BE387614 Hs.25797 splkdng factor3b.subunn4.49kO 1.6 7.3 

106721 AA741038 Hs.6670 ESTs 17 6.1 

106776 AA206079 Hs.6693 hypothefeal protein FLJ20420 1 5.4 

106866 AA487416 Hs.268231 Homo sapiens cDNA: FLJ23111 fis, clone L 16 5.4 

15 106868 BE185536 Hs.301183 molecule possessing ankyrin repeats indu 3.3 1.2 

106887 BE503373 Hs.334335 hypotheltea! protein FLJ1 3576 14 6.3 

106940 T85594 Hs.339808 hypotheUca! protein FLJ 101 20 3.3 1.8 

106968 AF216751 Hs.26813 CDA14 3 3 

107052 BE391904 Hs.12482 glyceronephosphateO-acyltransferase 17 7.6 

20 107061 BE147611 Hs.5354 stromal cell derived factor receptor 1 1.2 4.3 

107149 AI289507 Hs.299883 hypothetical protein FLJ23399 18 6.5 

107222 BE172058 Hs.82689 tumor rejectton antigen (gp96) 1 12 6.9 

107233 BE267795 Hs.22595 hypothetical protein FLJ10637 14 3.5 

107295 AA186629 Hs.60120 UDP-N-acety)^lpha-D^alactosaminenx)lyp Z6 4.3 

25 107679 AA011510 Hs.60612 ESTs 18 4 

107914 AA027229 Hs.61329 ESTs. Weakly similar to T16370 hypothell 13 3.5 

107966 AF109219 Hs.108787 phosphatidylinositolglycan. class N 1.6 3.5 

108033 AW368993 Hs.323748 Homo sapiens clone CDABP0086mRNAsequen 18 8.1 

108060 AA291440 Hs.73149 paired box gene 8 11 3.5 

30 108081 AA093668 Hs.28578 musdebllnd (Drosophila)-like 07 5.6 

108137 AI283611 Hs.263479 ESTs. Weakly similar to HMGI.HUiVIAN HIGH 1.2 5.6 

108186 AW068579 Hs.7780 Homo sapiens mRNA; cDNA OKFZp564A072 (fr 3.1 6.9 

108215 AI879238 Hs299315 collapsin response mediator proteln-5; C 1.5 4.6 

108297 AA333660 Hs71331 hypotheUcal protein MGC5350 1^ 4 

35 108339 AW151340 Hs.51615 ESTs. Weakly similar to ALU7^HUMAN ALU S 6.3 4.7 

108371 AA074374 Hs.67639 ESTs 13 3.8 

108399 AF086070 Hs.237519 EST 1 3.6 

108469 AA079487 gb:zm97f08.s1 Stratagene cobn HT29 (937 U 3.6 

108470 AA079500 gb:zm96h10.s1 Strategene cok)nHT29(937 1.1 4.3 
40 108564 M23114 Hs.1526 ATPase. Ca-M- transporting, cardiac musci 2 4.9 

108641 AA1 12059 Hs.429 ATP synthase, H+ transporting, mitochond 1.1 3.5 

108668 AA058522 Hs.185751 ESTs 1.2 3.6 

108694 AA036725 Hs.61847 ESTs 14 3.6 

108824 AK001332 Hs.44672 hypothetical protein FLJ 10470 1.4 3.5 

45 108863 AA133456 Hs.102548 glucocorticoid receptor DNA binding fact 12 4 

108893 BE276891 Hs.194691 retinoic acW induced 3 1.3 3.6 

108992 AA152312 Hs.72047 ESTs 1.1 4.1 

109072 A1732585 Hs.22394 hypothetical protein FIJI 0893 12 3.5 

109097 AA167512 gb:zp10f12.s1 Stratagene fetal retina 93 13 5 

50 109160 BE220601 Hs.301 997 hypothetical protein FLJ 13033 4 6.1 

109244 BE179030 Hs.64239 Human DNA sequence from clone RP5-1174N9 17 7.4 

109481 AA878923 Hs.289069 hypothetical protein FLJ21016 3.8 7.7 

109484 AA366263 Hs.72531 hypothetical protein FLJ 11 838 1.9 4 

109795 AA173942 Hs.326416 Homo sapiens mRNA; cDNA DKFZp564H1916 (f 37 13 

55 110009 BE075297 Hs.6514 ESTs. Weakly similar to A43932 mudn 2 p 4.6 7.4 

110107 AW151660 Hs.31444 ESTs 1.2 3.5 

110411 AW001579 Hs.9645 Homo sapiens mRNA for KiAA1741 protein, 3.7 3.3 

110731 NM„014899 Hs.188006 KiAA0878 protein 2.8 3.7 

110756 N21207 Hs.182999 ESTs 16 3.5 

60 110930 BE242691 Hs.14947 ESTs 3.1 12 

110935 AI753230 Hs.323562 hypothetical protein DKFZp564K142 1.9 7.5 

111051 A1681293 Hs.12185 hypothetical protein FLJ22558 2 4 

111110 AK001566 Hs.23618 hypotheUca I protein FLJ 10704 11 3.8 

111356 BE301871 Hs.4867 mannosyl (alpha-13-)-glycoprotein bete- 1 8.2 

65 111357 BE314949 Hs.87128 hypothetical protein FU23309 3.3 6.1 

111770 R27975 Hs.269401 ESTs, Moderately sfmllar to S65657 alpha 1.2 5.4 

111900 AF131784 Hs.25318 Homo saptens clone 25194 mRNA sequence 3.2 0.8 
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111903 NM^014906 Hs.166351 KIAA1072 protein 1 5.4 

111951 NM^014927 Hs.100527 KIAA0902 protein 1 3.8 

112141 AW137198 Hs.278682 Phosphatidylglycerophosphats Synthase 14 3.5 

.112193 R49499 Hs.138238 ESTs 1.5 3.6 

5 112197 NM_003655 Hs.5637 ESTs 4.6 2 

112610 AW500106 Hs.23643 serinefthreonlne protein kinase MASK 3.3 10.5 

112971 Z42387 Hs.83883 transmembiane. prostate androgen induced 3.2 3 

112984 T16971 Hs.289014 ESTs, Weakly similar to A43932n(iucin 2 p 3.7 10.8 

113056 AF019226 Hs.8036 glioblastoma overexpressed 4^ 3.7 

10 113449 AW160683 Hs.158005 hypothetical protein 1.2 4.4 

113497 AF143321 Hs.15572 hypothetical protein IMAGE 109914 0.9 3.6 

113508 AL042936 Hs.211571 holocytochramec synthase (cytochrome c 1.1 3.5 

113531 AK00189B Hs.16740 hypothetical protein FU1 1036 1.2 3.9 

113604 Ai075407 Hs.296083 ESTs. Moderately similar to 154374 gene 1.7 5.3 

15 113674 NIVL014214 Hs.5753 inosito!(myo)-1(or4)-monophospha!ase2 0.8 6.1 

113841 W30681 Hs.146233 Homo sapiens cDNA: FU22130fis, clone H 1.7 6.2 

113857 AW243158 Hs.5297 DKFZP564A241 6 protein . 1.2 4.6 

113931 BE255499 Hs.3496 hypothetical protein MGC 15749 1.5 4 

113936 W17056 Hs.83623 nuclear receptor subfamily 1, group I, m 3.8 1 

20 113987 AA345519 Hs.9641 complement component 1,q subcomponent 1.2 4.7 

114132 AI342493 Hs.24192 Homo sapiens cDNA FU20767 fis, clone CO 0.3 4.3 

114156 BE179882 Hs.336920 glutathione peroxidase 3 (plasma) 1.1 4.3 

114213 N58309 Hs.19575 CGM1 protein 1.6 9.2 

114636 AA075488 g b:zm88d01 .si Stratagene ovarian cancer ' 1.6 3.7 

25 114760 AI929382 Hs.252692 hypothetical protein FLJ20343 1.4 4 

114781 T10446 Hs.95388 ESTs 1 4.3 

114795 AB037858 Hs.173484 hypothetical protein FU10337 1.6 9.2 

114901 AV660012 Hs. 196437 hypothetical protein FU10788 1.4 5.2 

115096 A1683069 Hs.175319 ESTs 3.7 1 

30 115518 BE541042 Hs.23240 Homo sapiens cDNA: FU21848 fis, clone H 3.2 4.2 

115646 N36110 Hs.305971 solute earner family 2 (fadlitatedglu 1.5 3.9 

115764 AW582256 Hs.91011 anterior gradient 2 (Xenepus laevis) hom 1.3 5.9 

115802 AW410233 Hs.206521 YME1 (S.cerevislaeK*e1 1.7 5.6 

115994 AB037836 Hs.109315 KIAA1415 protein 1.5 9.1 

35 116032 BE383668 Hs.42484 hypothetical protein Fyi0618 0.9 4.3 

116046 BE395293 Hs.94491 hypothetical profceln aJ20297 1.6 5.5 

116274 AI129767 Hs.182874 guaninenudeotWeWnding protein (G pr 3.2 2.4 

116310 Z24B54 Hs.42299 ESTs 0.8 4.7 

116356 AI371223 Hs.288571 Homo sapiens cDNA FU1 1997 fis, done HE 2.4 3.9 

40 116429 AF191018 Hs.279923 putative nucleotide binding protein, est 5.5 5.5 

116461 AA313607 Hs.58633 Homo sapiens cDNA: FLJ22145 fis, clone H 5 1.3 

116470 A1272141 Hs.83484 SRY (sex determining region Y)-box 4 8.7 4.5 

116578 D21262 Hs.75337 nucleolarandooiled-bodyphosphprotein 3.2 6.9 

116579 AW886411 Hs.81915 leukemia-assodated phosphoprotein p18 ( 3.2 3 
45 116589 AI557212 Hs.17132 ESTs, Moderately similar to 154374 gene 3.1 8.3 

116786 H25836 Hs.301527 ESTs, Moderately similar to unknown [H.S 3.2 4.5 

117170 N25929 Hs.42500 ADP-ribosylation factor-like 5 7 5.5 

117215 N20066 Hs.133207 PTPRF interacting protein, binding prate 1.2 6.2 • 

117280 M18217 Hs.172129 Homo sapiens cDNA: FU21409 fis. done C 4.5 2.4 

50 117576 AI383467 Hs.44597 ESTs 1.4 4.2 

117667 U59305 Hs.44708 Ser-Thr protein kinase related to the my 4.3 0.5 

117881 AF161470 Hs.260622 butyrate-induced transcripl 1 2.1 5.7 

118336 BE327311 Hs,47166 HT021 3.6 7.7 

118475 N66845 gb:za45c1 1. si Soares fetal liver spleen 4.2 0.5 

55 118493 AL353944 Hs.50115 Homo sapiens mRNA:cDNADKFZp761J11 12 (f 3.5 3.3 

118505 N67343 gb:yz50b07.s1 Morton Fetal Cochlea Homo 2.1 3.8 

119159 AF142419 Hs.15020 homolog of mouse quaking QKI (KH domain 3.7 1.5 

119307 6E048061 Hs.37054 ephfin-A3 3 1.1 

119355 BE218319 Hs.5807 GTPaseRab14 1.1 5.6 

60 119462 BE041667 Hs.314544 Homo sapiens cervical cancer suppressor- 1.4 4.3 

119771 AI905687 Hs.2533 EST 3.2 1 

119940 AL050097 Hs.27253l DKF2P586B0319 protein* 4.3 0.7 

119943 BE565849 Hs.14158 copinelll 3.5 1.9 

120407 AA235207 Hs.250455 hypothetical protein DKFZp762F2011 1.5 3.7 

65 120493 AW968080 Hs.152939 Homo sapiens done 24630 mRNA sequence 4 1.4 

120677 AF150208 Hs.108327 damage-specific DNA binding protein 1 (1 1-6 6.8 

120867 AA350781 Hs.96967 ESTs 1.1 3.6 

180 



wo 02/059377 



PCT/US02/02242 



121368 BE262956 Hs.178292 KIAA01 80 protein 13 4.1 

121603 AA416785 Hs.249495 heterogeneous nuclear ribonudeoproteln 22 5.5 

121723 AA243499 Hs.104800 hypothefical protein FU10134 3.4 3.2 

122223 AF169797 Hs.27413 adaptor protein containing pH domain. PT 3.9 3.9 

5 122378 AB032948 Hs.21356 hypothetical protein DKFZp762K201 5 1.4 7.1 

122946 AI718702 Hs.308026 major histocompatibirrty complex, dass 1.4 3.7 

123155 AF121856 Hs.284291 sorting nexin 6 1.2 4.9 

123158 AF161426 Hs.218329 hypothetical protein 2.4 3.6 

123327 AA421581 Hs.178443 ESTs 0.9 5.2 

10 123495 W28673 Hs. 106747 serine cartwxypeptidase 1 precursor prot 1.3 5.1 

123526 AA60B657 gb:ae55d04.s1 Stratagene lung carcinoma 2.1 5.2 

123533 AA60B751 gb:ae56h07.s1 Stratagene lung carcinoma 2.1 9.3 

123768 AI932318 Ms.188762 ESTs. Moderately similar to M2Bl_HUMANH 1.1 3.6 

123961 Ali}50184 Hs.21610 DKFZP434B203 protein 1.1 3.5 

15 123999 AF084555 Hs.7351 cydicAMPphosphoprolein, 19kD 1.4 3.8 

124000 BE563957 Hs.74861 activated RNA polymerase II transcriptio 19 11.2 

124038 AB037860 Hs.173933 nudearfadorl/A 1.5 4.4 

124059 BE387335 Hs.283713 ESTs, Weakly simflar to S64054 hypotheti 14.8 115 

124083 AW195237 Hs.7734 hypothetical protein FLJ22 174 1.2 6.2 

20 124148 BE300094 Hs.227751 lectin, galacloside-binding. soluble, 1 2.5 12.7 

124153 AU077333 Hs.160483 erythrocyte membrane protein band 7.2 (s 1 4.1 

124252 BE613340 Hs.334725 Homo sapiens, Similar to RIKENcDNA 9430 1.5 8.4 

124314 AK001552 Hs.215766 GTP-blnding protein 1.8 lOi 

124375 D87454 Hs.192966 KIAA0265 protein 1.1 4.8 

25 124432 N39016 Hs.268869 ESTs. Weakly similar to ALUC_HUMANIjj| 1.3 4.1 

124447 N48000 gb:yy98e12.s1 Soares^multjple„sclerosi$_ 2.7 4.3 

124539 D54120 Hs.146409 celt division cycle 42 (GTP-bindtfig prot 2.1 5.7 

124543 AI393320 Hs.104573 ESTs 1 4.1 

124564 H66409 Hs.108275 ESTs 1.4 4 

30 124574 AL036596 Hs.42322 A kinase (PRKA) andmr protein 2 0.7 4 

124605 AA749315 Hs.77171 minichromosome maintenance defident(S. 1.1 3.5 

124639 H60193 Hs.21143 DKFZP586C1 324 protein 1.4 3.6 

124659 AI680737 Hs.289068 HomosapienscDNAFLJ11918fe,doneHE 1.5 9.9 

124737 BE270465 Hs.78793 protein kinase C. zeta 0.7 4 

35 124760 AW408586 Hs.91052 ESTs, Moderately similar to ALU5_HUMAN A 0.9 3.6 

124763 BE410405 Hs.76288 calpain 2, (m/ll) large subun'il 1.3 3.9 

124792 R44357 Hs.48712 hypothetical protein FU20736 1.8 4.2 

124842 R56485 gb:yg93h09.s1 Scares Infant brain 1 NIB H 1 3.6 

124940 AF068846 Hs.103804 heterogeneous nudear ribonucleoprotein 3.2 3.4 

40 124949 A1903210 Hs.336780 tubulin, beta polypeptide 1 4.4 

124960 AL023513 Hs.194766 seizure related gene 6 (mouseHke 0.9 5.2 

124995 T52700 Hs.1 10044 ESTs 0.9 3.5 

125030 AA610577 Hs.1B7775 ESTs 1.2 5 

125034 BE548446 Hs.5167 Homo sapiens mRNA:cDNADKFZp434F152(fr 1.5 3.7 

45 125058 T83731 Hs.3343 phosphoglycerate dehydrogenase 0.9 6 

125076 AA973971 gb:oq02h08.s1 NCLCGAP_Lu5Homosapfens 1 3.7 

125090 T91518 gb:ye20f05.s1 Stratagene lung (937210) H 3.2 2.5 

125103 AA570056 Hs.122730 ESTs, Moderately similar to K1AA1 21 5 pro 5.3 6.6 

125154 W38419 gb:zc78a07.s1 Pancreatic Islet Homo sapl 0.9 6.1 

50 125155 AA837043 Hs.143669 ESTs 1.1 4.3 

125159 AK0006e9 Hs.274428 TRF2-interacfing telomeric RAP1 protein 1.1 4.1 

125170 ALa20996 Hs.8518 selenoprotein N 1.1 3.8 

125181 R40815 Hs,12396 ESTs, Weakly similar to 2004399A chromos 1 3.6 

125193 W67577 Hs.84298 CD74 antigen (invariant polypeptide of m 1.2 7.8 

55 125260 H05635 Hs.294030 topoisomerase-related function protein 4 1 4.9 

125262 AW884980 Hs.171957 triple functional domain (PTPRF interact 1.3 4.8 

125272 BE612888 Hs.180224 myosin regulatory light chain 1.1 16.1 

125388 W27235 Hs.64311 a disintegrin and metalloproteinase doma 1.4 5.3 

125824 Z45258 Hs.28601 3 short coiled^il protein 2.4 8.7 

60 125852 AW630088 Hs.76550 Homo sapiens mRNA;cDNADKFZp564B1264(f 1.8 4.6 

125970 AW504721 Hs.177515 high density lipoprotein binding protein 1.9 3.8 

126192 AW160399 Hs.30376 hypothetfcal protein 1.4 4.1 

126469 BE384361 Hs.182885 ESTs, Weakly similar to JC5024 UDP-galac 2 3.7 

126510 AA057593 Hs.334762 hypothetkal protein FLJ 14735 1.3 4.1 

65 127095 AA340277 Hs.10248 Homo sapiens cDNA FU20167 fis, done CO 1.3 5 

127524 AI243596 Hs.94830 ESTs, Moderately similar to T03094 A-kIn 4.3 0.9 

128312 J04182 Hs.150101 lysosomal 1.5 4.7 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



128453 
128460 
128491 
128495 
128546 
128574 
128611 
128552 
128653 
128655 
128584 
128717 
128774 
128790 
128805 
128827 
128840 
128869 
128889 
128890 
128915 
128920 
128926 
128930 
128942 
128948 
128953 
128979 
128980 
129005 
129009 
129013 
129068 
129106 
129113 
129125 
129126 
129151 
129230 
129234 
129238 
129239 
129241 
129243 
129247 
129250 
129258 
129260 
129270 
129277 
129284 
129300 
129318 
129352 
129362 
129379 
129390 
129416 
129427 
129470 
129472 
129475 
129498 
129501 
129527 
129545 
129579 



X02761 

T16206 

H08379 

NM 005904 

NmI003478 

AI185977 

NM 014721 

AA432202 

D87432 

AI246669 

BE246444 

AK001564 

AA476220 

AF026692 

AA194554 

AI638184 

A1917602 

AA758242 

D6D985 

AI222020 

AK000140 

AA622037 

AF155096 

AA298958 

AW247536 

AW953622 

AB020716 

AW271217 

AA258924 

AI770025 

C15105 

AA371155 

A1634522 

AW504486 

BE543205 

AB002450 

AW881089 

N23018 

AA335362 

M18916 

BE542214 

W57656 

AI878857 

BE169531 

R49920 

AA344367 

AA260970 

AF077200 

AA357185 

AB007895 

AA318224 

W94197 

AF189062 

AW511656 

U30245 

BE278964 

AA318271 

AA016188 

AI498631 

W92931 

AL050260 

NM_004477 

AA449789 

AI631811 

AA769221 

R18087 

AW517695 



Hs.287820 fibronedin 1 1.2 4.3 

Hs.237154 ESTs. Highly similar to LDHH_HUMAN L-LAC 3.1 44.4 

Hs.165563 hypothetical protein DKFZp434N1429 0.6 13.1 

Hs.100602 MAD (mothers against decapentaplegicDr 1.3 4 

Hs.101299 cuilln5 1 5.1 

Hs.38260 ubi'quitin spedfic protease 18 0.8 4 

Hs.102471 KIAA0680 gene product 1.3 3.7 

Hs,103147 hypothetical protein FLJ21347 1.4 3.9 

Hs.10315 solute carrier family 7 (catbnic amino 1.2 3.6 

Hs.324275 WW domain^ntainlng protein 1 0.8 4.1 

Hs.283685 hypothetical prote&iFLJ20396 3 1.6 

Hs.104222 hypothetical protein FLJ10702 2.8 4^ 

Hs.54457 CD81 antigen (tartjet of antiprollfeiativ t1 10.6 

Hs.105700 secreted frizzled^related protein 4 1 3.8 

Hs.1d3434 ATPase, H-i- transporting, lysosomal (vacu 5.3 5.3 

Hs.106334 Homo sapiens clone 23836 mRNA sequence 2.2 5.3 

Hs.106440 ESTs 1 4.5 

Hs.80618 hypothetical protein 0.8 3.6 

Hs.106909 DKFZP566D193 protein 4.6 3.7 

Hs.182364 CocoaCfisp 3 1.5 

Hs.107139 hypothetical protein 0.2 3.9 

Hs.166468 programmed cell death 5 2.5 15.2 

Hs.107213 hypothetical protein FLJ20585 4 4 

Hs.10724 ME)S023 protein 1.2 4.5 

Hs.10729 hypothetical protein 1.4 5 

Hs.223025 RAB31 , mmber RAS oncogene family Z3 5.6 

Hs.107362 KIAA0909 protein 0.9 3.9 

Hs.281434 Homosapiens cDNA FU14028 fis. done HE 1.5 3.6 

Hs.10758 NIVL002495*:Homo sapiens NADHdehydrogena 0.8 

Hs.13323 hypothetical protein FU22059 1.2 5.7 

Hs.330716 Homosapiens cDNA FLJ14368 fis, done HE 2.1 9.9 

Hs.107942 DKFZP564M1 12 protein 2.4 3.8 

Hs.152925 K1AA1268 protein 1.2 3.8 

Hs. 108689 sterol regulatory element binding transc 1.2 5.5 

Hs.288771 DKFZP586A0522 protein 0.5 3.7 

Hs.278391 CGM09 protein 1 5.2 

Hs.108806 Homo sapiens mRNA; cDNA DKFZp566li^0947 (f 1.5 

•Hs.171391 C-temiinal binding protein 2 2.1 9.7 

Hs.109646 Empirically selected from AFFX single pr 0.9 8.6 

Hs.282997 gluoosidase, beta; add (Includes glucos 1.1 3.5 

Hs.109697 ESTs 1.1 1Z8 

Hs.109701 ub{quiUn-liI(e5 3.2 5.1 

Hs.109706 hematologicai and neurological expressed 1.9 5.7 

Hs.109727 TAKI-blndlng protein 2; KIAA0733 protein 1 2 6.6 

Hs.109733 CGl-131 protein 1.5 3.5 , 

Hs.109760 Empirically selected from multiple AFFX 1 5.4 

Hs.251946 poly(A)-binding protein, cytoplasmic 1-1 1.3 4.1 

Hs.279813 hypothetical protein 1.6 3.9 

Hs.109918 rashomolog gene family, member H 1.8 4.2 

Hs.110 putative L-type neutral amino add trans 1.1 6.1 

Hs.296141 ESTs Z5 4.8 

Hs.1 10165 ribosomal protein L26 homolog 1.6 5.1 

Hs.285976 tumor metastasis-suppressor 1.8 6.5 

Hs.170177 Melsl (mouse) homolog 0.9 4 

Hs.1 10736 solute carrier family 12 (sodium/potassi 1,4 9.2 

Hs.1 1085 CGI-1 11 protein 1 4.8 

Hs.250905 hypothetical protein 1 4.1 

Hs.111244 hypothetical protein 1.8 10.7 

Hs.111334 fem*tin,llgtitpolypeptide 1.1 4.8 

Hs.250899 heat shocit fador binding protein 1 1.8' 9.3 

Hs.323817 DKFZP547E1010 protein 1 5 

Hs.203772 FSHD region gene 1 1.1 4.2 

Hs.75511 connective tissue growth factor 1.9 6.8 

Hs.180403 STRIN protein 1.1 97 

Hs.270847 delta-tubulin 1.1 4.3 

Hs.323769 cispiatin resistance related protein CRR 1 4.2 

Hs.286218 junctionai adhesion molecule 1 2.3 3.5 
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129606 AW968941 Hs.166254 hypothetical protein DKFZp566t1 33 2.4 4.4 

129619 AA209534 Hs.284243 tetraspan NET-6 protein Z2 13 

129620 D79338 Hs.239720 CCR4.N0T transcription complex, subunit 1.6 4.6 

129621 AL110212 H s.301 005 purine-rich element binding protein B 1.1 5.7 
5 129634 AB020335 Hs.181300 seM (suppressor of lln-12, C.elegans)- 0.9 4.3 

129663 AI207406 Ms.11866 translocase of inner mitochondria) membr 1.9 4.8 

129679 AW889132 Hs.11916 nMinase 0.9 4.1 

129688 U53209 Hs.24937 transformer-2 alpha (hlra-2 alpha) 1.3 4.7 

129691 M26939 Hs.l 19571 collagen, type III, alpha 1 (Ehlers-DanI 4.7 3.7 

10 129712 U46386 Hs.12102 sorting nexin 3 1.2 3.6 

129747 AL050272 Hs.12305 DKFZP566B1 83 protein 1 8.9 

129788 BE397454 Hs.124969 Homo sapiens clone 24707 mRNA sequence 1.4 3.6 

129796 BE218319 Hs.5807 GTPaseRab14 2.9 5.1 

129797 M62839 Hs.1252 apol^oppotein H (beta-2-9tycoproteln I) 03 5.1 
15 129800 AF052112 Hs.12540 lysosomal 1.6 8£ 

129834 AL080084 Hs.2g6155 CGMOO protein 0.9 5.3 

129836 AW410233 Hs.206521 YME1(S.cerevislaeHite 1 1.8 9.9 

129843 NMJ14840 Hs.200598 KIAA0537 gene product 0.9 3.6 

129874 AA626937 Hs.181551 hypothetical protein MGC2594 1.4 9.5 

20 129878 243161 Hs.283714 30 kDa protein 1.1 6.3 

129904 AL119499 Hs.l 3285 neuronal potassium channel alpha subunit 1 3.5 

129917 M30773 Hs.278540 protein phosphatase 3 (fbmier^' 2B). leg 2 5.1 

129976 X14008 Hs.234734 lysosomal 0.9 4.9 

129982 Z14221 gb:H.sapiens gemiline transcript of Ig h 1.2 3.6 

25 130007 R15917 Hs.l 42570 Homo sapiens done 24629 mRNA sequence 4.3 1.3 

130060 BE277024 Hs,145381 RNA binding motif protein, X chromosome 1.6 3.8 

130064 X57815.comp Empirically selected from AFFX single pr 1.2 8.2 

130068 IVI93143 Hs.262869 plasminogen-like 1.4 75 

130090 H97878 Hs.1 32390 zinc finger protein 36 (KOX 18) 1.4 12.3 

30 130095 AK001635 Hs.14838 hypothetical protein FU10773 0.2 4.6 

130102 W61005 Hs.148g6 DHHC1 prot^ 1 • 4.1 

130112 AA916785 Hs.180610 splicing factor proiine/glutamlne rich ( 1.2 5.3 

130115 T47294 Hs.l 49923 X-box binding protein 1 3.8 0.8 

130123 NM_005095 Hs.150390 zinc finger protein 262 1 4.2 

35 130150 BE094848 Hs.15113 homogentisale i;2-dioxygenase (homogenti 0.5 4 

130161 R4267B Hs. 151 385 K1AA0564 protein 1 3.7 

130210 M23115 Hs.1526 ATPase. Ca++ transporting, cardiac musd 0.4 4.4 

130213 BE278370 Hs.15265 heterogeneous nuclear n'bonudeoprotein 1.7 7.5 

130215 BE301883 Hs.152707 glioblastoma amplified sequence 1 5.6 

40 130232 U29463 gb:Human cytochrome b561 gen 1.2 4.2 

130252 U92014 Hs.153527 Homo sapiens pTM5 mariner-like transpose 1.3 3.6 

130281 W78907 Hs.15395 similar to arginyl-lRNA synthetase (argi 1.5 4.4 

130343 AB040914 Hs.278628 KiAA1481 protein 2.9 7.5 

130385 AW067800 Hs.155223 stannk)caldn 2 3.2 0.2 

45 130414 AW842182 Hs.241392 smaHindudble cytokine A5(RArirES) 1.4 10£ 

130417 AW163518 Hs. 1 55485 huntingtin interacfing protein 2 17 117 

130440 AA852868 Hs.1 32853 KIAA01 71 gene product 1.1 5 

130442 NIVL006245 Hs.118244 protein phosphatase 2, regulatory subuni 1.4 4.3 

130465 AW362955 Hs.15641 Homo sapiens cDNAFLJ14415fB, done HE 1.6 7.6 

50 130479 R44163 Hs.12457 hypothetical protein FLJ10814 0.9 4.1 

130499 AB007915 Hs.158286 KIAA0446 gene product 1 3.8 

130546 AI598022 Hs.193989 TAR DNA binding protein 1.3 47 

130568 AA232119 Hs.16085 putative G-proteln coupled receptor 1.2 9.4 

130506 AI6521 43 Hs.288382 hypothetical protein FLJ131 11 1 4.1 

55 130612 BE242873 Hs.16677 WD repeat domain 15 1.1 3.6 

130616 Ali)49963 Hs.284205 up-regulated by BCG^WS 0.6 3.8 

130623 AL045128 Hs.1691 glucan(1,4-alpha-), brandling enzyme 1 0.9 6.6 

130629 AL042896 Hs.1697 ATPase, H+ transporting, lysosomal (vacu 0.9 3.9 

130632 AW073971 Hs.238954 ESTs. Weakly similar to KlAAl 204 protein 0.9 6.9 

60 130639 A1557212 Hs.17132 ESTs. Moderately similar to 154374 gene 2.6 35 

130641 AF158556 Hs.239189 glutaminase 1.2 13.8 

130653 AI861791 Hs.278479 TSPY-like 1.3 4 

130655 AI831962 Hs.17409 cysteine-ridi protoln 1 (Intestinal) 2.5 . 4 

130666 AL117508 Hs.194035 KIAA0737 gene product 1.3 6.2 

65 130669 Ai928985 Hs.17680 hypothetical protein M6C1 314 similar to 1.4 3.9 

130693 R68537 Hs.17962 ESTs 3.2 0,8 

130694 NM_014827 Hs.17969 KIAA0663 gene product 1.1 4.8 
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130696 AA325308 Hs.18016 Homo sapiens mRNA: cDNA DKFZp586H0324 (f U 5 

130701 Z98883 Hs.18079 phosphafidyBnositolglycan. class Q 11 6.7 

130707 AW190925 Hs.203559 hypothetical protein FLJ1 2701 1.2 4.1 

130731 AI932971 Hs.18593 Homo sapiens cDNA: n.J21449 fis. clone C 1.4 6.9 

5 130787 AF072813 85.252831 relicuton3 1.2 11.2 

130796 AA088809 Hs.19525 hypothetical protein FLJ22794 1.8 6.8 

130808 NM_001761 Hs.1973 cydinF 1.3 4,1 

130863 Y10805 Hs.20521 HMT1 (hnRNP methyltfansferase, S. cerevi 3.2 5.9 

130902 AB037750 Hs.21061 KIAA1329 protein 1 .3.8 

10 130908 AW195747 Hs.21122 hypothetical protein FLJ11830 similar to 1.3 7.9 

130911 BE409769 Hs.21189 DnaJ (Hsp40) homolog. subfamily A. memt)e 2.7 3.7 

130913 BE390905 Hs.21198 Iranslocase of outer mitochondrial membr 1,9 4 

130923 H96115 Hs.21293 UDP-N-acteylgluoosaminepyrophosphoiylas 1.9 10.3 

130959 AB023182 Hs.184523 K1AA0965 protein 1.5 6.8 

IS 130967 AA393071 Hs.182579 leucine aminopeptidase 1.4 5.5 

130975 AA099923 Hs.283728 PEST-containlng nuclear protein 1.3 3.8 

131037 BE243101 Hs.22391 chromosome 20open reading frame 3 1.9 4.1 

131039 D87436 Hs.166318 Iipin2 1.6 3.5 

131060 AA194422 Hs.22564 myosin VI 4.5 5 

20 131097 AL137682 Hs.22937 M(appa-B-interacling Ras-like protein 2 2 3.7 

131101 BE387561 Hs.22981 DKFZP586M1 523 protein 1.6 • 4.5 

131104 W27770 Hs.301756 ESTs, Weakly similar to T31475 hypothetl 0.9 3.5 

131107 BE620886 Hs.75354 GCN1 {general control of amlno-acld synt 2.1 4.5 

131109 BE564123 Hs.23060 DKF2P564F0522 protein 1.1 4.6 

25 131136 AB033099 Hs,23413 KIAA1273 protein 1.2 4.2 

131148 AW953575 Hs.303125 p53-induoed protein PIGPC1 4.5 135 

131150 X77753 Hs.23582 tumor^ssodated calcium signal transduc 3.4 0.4 

131156 AI472209 Hs.323117 ESTs 0.8 4.9 

131164 AW013807 Hs.182265 keralfn19 3.3 Z4 

30 131 181 H25094 Hs.293663 ESTs, Moderately similar to 138022 hypot 0.6 4 

131194 AW864222 Hs.24083 KIAA0997 prolein 1.4 ZA 

131199 AW979155 Hs.298275 amino acid transporter 2 1.2 8.5 

131215 AL050107 Hs.24341 transcriptional co^ctlvator with PDZ-bi 0.7 4.7 

131216 Ai815486 Hs.243901 Homo sapiens cDNA FU20738 ffs. done HE 2.1 8.2 
35 131233 D89053 Hs.268012 fatty-add-Coenzyme A ligase, long-chain 1.7 3.5 

131237 AW956858 Hs.24508 DKFZP554D177 protein 1.3 5.4 

131262 AU077158 Hs.24930 tubulin-spedfic diaperone a 1.5 4.8 

131263 AU077002 Hs.24950 regulatorofG-protein signalling 5 1.4 4.4 
131367 AI750575 Hs.173933 nudear factor I/A 3.3 , 2.2 

40 131372 AW293399 Hs.144904 nudear'receptor co-repressor 1 1.6 3.9 

131373 NM_006052 Hs.26146 Down syndrome critical region gene 3 1 11.1 

131388 NM.014610 Hs.92200 KIAA0480 gene product 5 2 

131492 AI452601 Hs.288869 nuclear iBceptor subfamily 2, group F,m 0.9 3.5 

131493 AW960146 Hs.284137 hypothetical protein FU12886 1 3.5 
45 131514 BE270734 Hs.2795 lactate dehydrogenase A 2 6.5 

131524 AB040927 Hs.3O1804 KIAA1494 protein 1.5 10.7 

131528 AU076408 Hs.28309 UDP-glucose dehydrogenase 1.3 4.7 

131534 AF157326 Hs.184786 TBP-interacting protein 1.3 4.9 

131555 T47364 Hs.27851 3 interferon, alpha-indudbie protein 27 1.5 8 

50 131578 AA936296 Hs.234265 DKFZP586G011 protein 1.8 3.5 

131589 C18825 Hs.29191 epithelial membrane protein 2 1.3 8.2 

131609 D83032 Hs,169984 nudear protein 2.8 3.9 

131626 BE514605 Hs.289092 Homo sapiens cDNA: FLJ22380 fo, clone H 1.3 11.2 

131670 H03514 Hs.10130 ESTs 1.3 4.8 

55 131697 C19034 Hs.288613 Homo sapiens cDNA FLJ14175 fls. clone NT 3.2 9.7 

131701 AF103798 Hs.30819 hypothetical protein 1.3 5.2 

131703 AW160865 Hs.30888 cytochrome c oxidase subunit Vila polype 1.3 7.8 

131739 AF017986 Hs.31386 secreted fHzzied-related protein 2 10.6 14.7 

131764 AI805664 Hs.31731 peioxiredoxin 5 1.1 3.6 

60 131781 AF077036 Hs.31989 DKFZP586G1 722 protein 1.6 3.7 

131791 X62111 gbJi.sapiensVII-5 gene for immunoglobul 1.1 3.5 

131853 A1681917 Hs.3321 ESTs, Highly similar to IRXI^HUMAN IROQU 5.3 1.2 

131870 NIVL014874 Hs.3363 KIAA0214 gene product 0.6 A2 

131903 NM_004642 Hs.3436 deleted in oral cancer (mouse, homolog) 2.4 4.9 

65 131913 AW207440 Hs.185973 degenerative spennatocyte (homolog Droso 2.4 6 

131930 AA772603 Hs.69476 Homo sapiens cDNAFU12758fis. clone NT 1.7 9.2 

131941 BE252983 Hs.35086 ubiquitin spedfic piotease 1 0.5 5.2 
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131947 A1123939 Hs.182997 ESTs 0.7 4.1 

131961 AA129782 Hs.3576 Homo sapiens mRNA ftjil length Insert cDN 0.9 4.8 

131964 AW381148 Hs.198365 2.3-blsphosphoglycerate mutase 1.1 6.1 

131974 AF208856 Hs.268122 hypothetical protein 1.3 3.9 

5 131983 AF1 19665 Hs.184011 pyrophosphatase (Inorganic) 3.3 6.9 

131997 AF229181 Hs.136644 CS box-containing WD protein 0.9 5.2 

132006 AW162336 Hs.3709 low molecular mass ul)lqulnone-bindlng pr 1.2 3.6 

132063 BE277910 Hs3833 3'-phosphoadenostne 5'-phosphosulfate sy 3.2 1.8 

132065 BE379335 Hs.211594 proteasome(piosome,macropain)26S subu 1.2 3£ 

10 132071 AF217798 Hs.3850 US1 -interacting protein NUDEL; endoolig 0.7 5.2 

132079 AI701457 Hs^8694 ESTs 2 5.3 

132094 NM.016045 Hs^945 CGM07 protein 1.2 43 

132116 AW960474 Hs.40289 ESTs 3.1 3.1 

132164 A1752235 Hs.41270 prooollageivlysine.2-oxogiufarate5^1o 1.8 3.7 

15 132181 AW961231 Hs.16773 Homo sapiens done TCCCIA00427 mRNA sequ 1.2 5 

132208 AL031709 Hs.241575 hypothetical protein CAB561 84 1.4 4.2 

132258 AA306325 Hs.4311 SUIVtO-1 activating enzyme subunft 2 2 10.3 

132303 BE177330 Hs325093 Homo sapiens cONA:FU21210fls, clone C 1.2 4.1 

132316 U28831 Hs.44566 KIAA1641 protein 5.9 1.6 

20 132358 NIVI^003542 Hs.46423 H4 histone family, member G 5.8 1.5 

132384 AA312135 Hs.46967 HSPC034 protein 2.1 9.3 

132397 AA021160 Hs.4750 hypothetical protein DKF2p564K0822 1.3 4.6 

132413 AW361383 Hs.260116 metailoprotease 1 (pitriiysin family) 2 4.9 

132442 AW970859 Hs.313503 ESTs 1.2 5 

25 132534 BE388673 Hs.5086 hypothetical protein MGC 10433 2 3.9 

132540 BE396290 Hs.5097 synaptogyrin 2 1.4 5.1 

132554 AF065391 Hs.194718 zinc finger protein 265 1.2 4 

132575 AV660538 Hs.284162 60S nljosomal protein L30 *Bolog 3 1.7 

132585 AF029760 Hs.179600 TAP binding protein (tapasln) 1.8 4.7 

30 132602 AW606927 Hs.5306 hypothetical protein DKFZp586F11 22 simil 1.6 4.9 

132608 AA353044 Hs.5321 ARP3 (acfin-related protein 3, yeast) ho 1.8 8.1 

132718 NM„004600 Hs.554 Sjogren syndrome antigen A2(60I(D, ribon 4.2 2 

132719 AI264357 Hs.55405 hypothetical protein IVIGC1 6212 1.1 5.3 
132730 AK000868 Hs.5570 hypothetical protein FLJ10006 1.4 5.2 

35 132765 BE222975 Hs.56205 insulin induced gene 1 1.1 5.8 

132782 F07424 Hs.279840 zinc finger protein 222 1.3 3.7 

132793 AB020713 Hs.56966 KiAA0906 protein 2.3 63 

132805 AW975748 Hs.5724 sderostin 0,7 7.7 

132863 BE268048 Hs235494 RAB10. member RAS oncogene famOy 1.8 62 

40 132894 D63209 Hs.5944 solute earner family 11 (prolon^upled 1.5 20.8 

132930 AA579258 Hs.6083 Homo sapiens cDNA:FLJ21028fis, done C 1 3.8 

132932 AW1 18826 Hs.6093 Homo sapiens cDNA:FU22783fis. done K 0.7 5.4 

132933 BE263252 H5.6101 hypothetical protein lUX3C317d 1.6 4.1 
132965 A1248173 Hs.191460 hypothetical protein MGC12936 1 4.2 

45 132984 BE539199 Hs.62112 zinc finger protein 207 1.5 4.4 

132990 X77343 Hs.334334 transcription factor AP-2 alpha (adivat 13.9 0.8 

132998 Y00062 Hs.170121 protein tyrosine phosphatase, receptor t 0.6 4.6 

133002 AW499985 Hs.42915 ARP2 (actin-related protein 2, yeast) ho 1.5 11.1 

133011 NIVL006379 Hs.171921 sema donwin. Immunoglobulin domain (Ig). 3.5 1 

50 133012 AA847843 Hs.62711 Homo sapiens, done II\1AGE:3361295, mRNA 1 4.5 

133040 AW502761 Hs.30909 K1AA0430 gene product 0.9 5.5 

133056 H12028 Hs.6396 jumping translocation breakpoint 17 5.3 

133063 AI6541 33 Hs.30212 thyroid receptor interacting protein 15 0.6 4.9 

133067 AKD00708 Hs.169764 hypothetical protein aJ20701 1.2 3.5 

55 133080 AF089816 Hs.6454 diromosome 19 open reading frame 3 1.2 17.5 

133110 AA808177 Hs.65228 ESTs 0.9 5.1 

133150 AV655783 Hs.661 Emplricaltyseiected from AFFX single pr 11 4.5 

133175 AW955632 Hs.66666 ESTs. Weakly similar to S19560proline-r 1.5 4.8 

133199 AF231981 Hs.250175 homolog of yeast long chain polyunsatura 5.5 55 

60 133203 AA464362 Hs.6748 hypothetfcal protein PP1665 1.2 3.7 

133206 AB037773 Hs.6762 hypothetfcal protein 1.6 8.6 

133221 W32474 Hs.301746 RAP2A,memberofRAS oncogene family 2.4 4.8 

133229 AL137480 Hs.6834 K1AA1 01 4 protein 1 4.2 

133241 AW796524 Hs.68644 Homo sapiens mfcrosomal signal peptidase 1.3 3.9 

65 133257 BE617892 Hs.6895 actin related protein 2/3 complex, subun 1.4 5.4 

133271 Z48633 Hs.283742 H.sapiens mRNA for retrotransposon 3.1 0.7 

133273 N27672 Hs.69469 dendritic cell protein 2.5 6^ 
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133287 AW797437 Hs.69771 B-factor, properdin 1.3 

133291 BE297855 Hs.69855 NRAS-related gene 1.4 

133292 AA304961 Hs.699 peptidylprolyl isomerase B (cyclophilln 2.2 
133294 AJ001388 Hs.69997 zinc finger protein 238 1.5 
133300 AF116666 Hs70333 hypothetical protein MGC10753 1.4 
133302 X04898 Hs.237658 apo!ipoproteinA-ll 02 
133308 U56979 Hs.250651 H factor 1 (complement) 0.6 
133347 BE257758 Hs71475 add cluster protein 33 1.2 
133370 AF245505 Hs72157 DKFZP564I1922 protein 3.7 
133404 AB007916 Hs.214646 KIAA0447 gene pfodud 1.4 
133408 AI738719 Hs.198427 hexokinase2 0.9 
133422 AB033061 Hs73287 KIAA1 235 protein 1.2 
133442 AL137663 Hs.7378 Homosai^ensmRNA;cDNA DKFZp434G227(fr 

133448 M27749 Hs.268168 immunoglobulin lambda-like polypeptide 1 1.1 

133449 Ar=038962 Hs.7381 volt^e^tependent anion channel 3 0.7 
133501 AI962602 Hs.74284 hypothetical protein MGC2714 3.1 
133504 NM_004415 Hs74316 desmoplakin (DPI, DPIl) 4.3 
133506 BE562958 Hs74346 hypothetical protein MGC14353 1.8 
133532 D87452 Hs74579 KIAA0263 gene pnoduct 1.2 
133574 H97991 Hs.193313 Empiricalty selected fiomAFFX single pr 1.4 
133586 AI929645 Hs.225936 synapsinl 0.8 
133589 L37368 Hs75104 RNA-binding protein SI. serine-rich doma 2 
133591 Ai423369 Hs75111 protease, serine, 11 (IGF binding) 2.1 
133606 U10554 Hs75188 wee1+(S. pombe) homolog 3.3 
133617 BE244334 Hs75249 ADP-ribosylation factor-like 6 interacti 2.3 
133651 AI301740 Hs.173381 dlhydropyrimldinase^il(e2 0.8 
133660 H14843 Hs,303154 popeye protein 3 1 
133663 AJ006239 Hs.75438 quinoid dihydropterldine reductese 0.5 

' 133668 L77964 Hs.271980 mitogen-adrvated protein kinase 6 1.1 

133671 AW503116 Hs.301819 zinc finger protein 146 1.8 

133681 AI352558 Hs75544 tyrosine 3-fnonoQxygenase/tryptophan Smo 1.5 

133694 W17187.comp Hs.232400 heterogeneous nuclear ribonudeoproteln 2 

133708 Ai018656 . Hs75667 synaptophysin 0.6 

133737 AW001130 Hs75824 KIAA0174 gene product 1.2 

133743 AI929587 Hs75847 CREBBPyEPSOO friWbltoiy protein 1 1^ 

133750 BE410769 Hs75873 zyxin 1.2 

133755 IVi62194 Hs75929 cadherin11.type2,OB^dherin(osteob 3.2 

133776 BE268649 Hs.177766 ADP-ribosyltransfe^ase(NAD-^^ poly(ADP- 2.1 

133799 W24087 Hs76285 DKFZP564B1 67 protein 1.9 

133800 AF075337 Hs76293 thymosin, beta 10 2.6 
133802 AW239400 Hs76297 G protein-coupled receptor kinase 6 1 
133806 D25969 Hs76325 step II splldng factor SLU7 0.5 
133817 AW578716 Hs.7644 HI histone family, member 2 1.5 
133829 AW630088 Hs.76550 Homo sapiens mRr^cONADKFZp564B1264(f 
133841 AA345824 Hs.76688 carboxylesterase 1 (monocyte/macrophage 0.3 
133845 AA147026 Hs.76704 ESTs 5.5 
133863 AI815523 Hs76930 synudein, alpha (non A4 component of am 0.6 
133887 X07767 Hs77271 protein kinase, cAIVtP-dependent calalytl 1 
133892 AW859528 Hs,301497 arginyltransferase 1 0.9 

133913 AU076964 Hs.7753 calumenin 2.8 

133914 AI458213 Hs.77542 ESTs 1.8 
133917 Ali)31177 Hs.7756 prateasome (prosome. macrapain) 26S subu 1.5 
133947 L41066 Hs.77810 nudear fedor of activated T-cells. cyt 1.5 
133985 1^54968 Hs,184050 v-Kl-ras2 Wrsten rat sarcoma 2 viraJ on 0.9 
133987 L15409 Hs.174007 von Hippekindau syndrome 2.3 

133989 AL040328 Hs.78202 SWI/SNF related, matrix assodated. adi 3.3 

133990 R48316 Hs.7822 Homo sapiens mRNA;cDNADKFZp564C121 6 {f 
134029 BE150882 Hs.143601 hypothetical protein hCLA-iso 1 
134040 NM_003470 Hs.78683 ubiqultinspedfic protease 7 (herpes vi 17 
134042 AI027881 Hs.7869 lysosomal 1 
134049 AF117236 Hs.78825 matrinS 1.2 
134095 NM_004354 Hs.79069 cyclInG2 27 
134098 BE513171 Hs.79086 mitochondria! ribosomal protein L3 3.3 

.134207 Z43039 Hs.170198 KIAA0009 gene produd 1.3 

134210 AF035606 Hs.80019 programmed cell death 6 17 

134218 U77736 Hs,80205 pinv2 oncogene 0.8 

134270 X68194 Hs.80919 synaptophysin-llke protein 1.4 
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302665 Rgg693 Hs.224410 
302892 AW176909 Hs.42346 
302963 AW673106 Hs.151945 
303131 AW081061 Hs.103180 
5 303150 AAB87146 Hs.8217 
310125 AA147979 Hs.285005 
312662 AA233808 Hs.286241 
319429 BE616412 Hs.286218 
320591 AA054761 Hs.169149 

10 406779 AA412048 Hs.279574 
410691 AW239226 Hs.65450 
410763 AF279145 Hs.8956 
415738 BE539367 Hs.295953 
420186 NIVL015925 Hs.95697 

15 422055 NIVL014320 Hs.111029 
425815 R94023 HsJ37531 
426218 AF119043 Hs.168005 
427397 AI929685 Hs.177656 
427466 AA523543 Hs7678 

20 427505 AA361562 Hs.178761 
427723 A1355260 Hs^79789 
428673 AW601325 Hs.324278 
430219 X99209 Hs.235887 
430450 R23553 Hs.241489 

25 432866 BE395875 Hs.279609 
433423 BE407127 Hs.8997 
437562 AB001636 Hs.5683 
437667 BE616412 Hs^86218 
437754 R60366 Hs.5822 

30 440252 BE513940 Hs.6101 
441471 AL042986 Hs.7857 
448292 BE281316 Hs.47334 
449404 H51068 Hs.23581 
449964 AW001741 Hs.273193 

35 451389 N73222 Hs579009 
452685 AI634651 Hs.30250 
RC_H15847_s 
RC_W84712 
X14008_ma1J 

40 RC_H86543J 
H07011 

RCJ^164586_s 

RC_AA070485 

RC_H98714_s 
45 RCJVV406145J 

AA458584 

AA031548 

X02761 

RC.AA487193 
50 R25326 

RC.AA393805 

RC_AA449333 

RC_AA28768Ls 

RC_AA490864 
55 RC_C14243J 

R21443 

RC.AA251902 

M2112L8 

C00038_s 
60 Y00503 

RC_R27006J 

RCJ^4168B6 

RC_AA460450 

RC_AA488433 
65 RCJ^78400J 

U28831 

RC_AA199588 



Homo sapiens CDNAFLJ12843 lis. done NT 3.6 

caldneurin4}lnding protein catsardn-l 3.3 

mitochondrial nbosomai protein Ij43 0.9 

DC2 protein 3 

stromal antigen 2 6.2 

mitDchondrial import receptor Tom22 1 .2 

protein kinase. cAMP-dependent regulato 1 

junctional adhesion molecule 1 1 .5 

karyopherln alpha 1 (importln alpha 5) 1 2 

CG1^9 protein; cell death-regulatory pr 1 .3 

retlculon4 1.2 

hypothetical protein FU21776 2 

ESTs. Weakly similar to AF220049 1 uncha 1 .3 

Hver-spedfic bH LH-2lp transcription fa 1 .5 

putative heme-binding protein 2 

ESTs. Moderately sinflar to 138022 hypot 1 .7 

Homo sapiens cDNAFLJ13372r6. done PL 33 

calmodulin 1 (phosphorytase kinase, delt 1 .3 

cellular retinoic acid-binding protein 1 1 .1 

26S proteasome^assodatBd padi homolog 3.2 

hlstone deacetylase 3 2.8 
Homo sapiens mRNA; cDNA DKFZp566M063 (fr 

HMT1 (hnRNP methyltransferase, S. cerevi 1 .8 
hypothetical protein 
mitochondrial cam'er homolog 2 
heat shocit 70kD protein 1A 
DEAD/H (Asp-Glu-Ala-Asp/His) box polypep 
junctional adhesion moleojie 1 
Homo sapiens cDNA: FLJ22120 fs» ctone H 
hypothetical protein MGC3178 
erythrocyte membrane proteii band 
hypotfietteal protein FLJ14495 
leptin receptor gen&ielated protein 
hypothetical protein FLJ10706 
matrix Gia protein 

v-maf muscutoaponeuroticfibrosaiT»ma (a 
peptidylprolyl isomerase B (cydophllln B) 
calumenin 

lysozyme (renal amyloidosis) 
ESTs 

ESTs; Weakly similar to SAS [H.sapiens] 



1.1 
1.5 
1.3 
1.6 
1J3 
2 

1.1 
0.5 
2.5 
1.1 
1.4 
4 

0.8 
1.8 
3.5 
0.9 
1.8 
1.8 
ESTs 
3.4 
1.6 
ESTs 
3.4 

cell diviston cyde 42 (GTP-bindIng protein; 25kD) 



Homo sapiens ck)ne 23967 
ESTs 

SRY (sex delemnining region Y)-box 4 



fibronectin 1 
secreted fr^ed-reiated protein 4 
Homo sapiens mRNA for putative vacuolar 
ESTs; Weakly similar to (deRine not 
ESTs 



ESTs; Highly simMar to heat shock factor 
ESTs; Highly similar to heat shock factor 
ESTs 

Homo sapiens lysophospholipase (LPL1) 
small inducible cytokine A5 (RANTES) 
ESTs 
keratin 19 
ESTs 

ESTs; Weakly simSar to predated using 
fibroblast growth factor receptor 2 (bacteria- 
ESTs; Weakly similar to deduced amino ackj 1.1 

Homo 

Human protein immuno-reactive with antl-PTH4.4 
Homo sapiens actin-related protein Arp3 (ARP3) 
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3.6 
4.7 
0.9 
1.1 
2.9 
ESTs 
1.4 
17 
1.6 
2.2 
0.9 
2.8 
3.1 
1.6 
3.1 
1.5 



3.6 

I. 6 
4.2 
17.3 
4 

6,5 
3.5 
47 
.5,6 
3.5 
13.9 
5.1 
3.9 
6.2 

II. 3 
3.6 
2.8 
4,7 
3.7 
2.5 
22 

1.1 5.2 
8.8 
5.6 
6.1 
7.6 
6.5 
3,5 
5.7 
6,2 
3.7 
4.9 
3.6 
3.5 
11.2 
5.6 
4.8 
4.6 
4,5 
6.6 
3.9 

6.2 0.8 
2.6 
3,5 

4.6 ' 3 
0.4 

3.1 3.9 
15.2 
4 
5 

8.4 
4.6 

1.3 4 
5 
5 

3.7 
3,8 
9,9 
4.8 
1.1 
3.7 
3.1 
3.7 
4 

sapiens HRtHFB2115 mRNA; partial cds 1.5 
0.6 

1,8 4.7 
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AF006082 


Homo sapiens actin-related protem Arp2 (ARP2) 


1.6 


10.9 


RC„H90899 


desmop!akln(DPI;DPII) 5.4 


5.5 




RC W95070 


desmoplakin (DPI; DPII) 5 


2.6 




RC^T90946J 


Human mRNA for K1AA263 gene; complete cds 


1.1 


3.9 


D87258 


protease; serine; 11 (IGF binding) 2.4 


3.5 




AA313414 s 


ESTs; Weakly similar to cDNA EST EMBLrT1 157 


1.5 


5.3 


RC.H73484 s 


ESTs; Weaklysimilartosimllarto Yeast 1.3 


6.3 




AFFX-HUMISGF3A/M97935_3 


2.3 


13.5 


AFFX-HUMRGE/M10098_5 1.1 


7.9 




AFFX-M27830 5 




0.5 


7.4 


AFFX-M27830.5 




0.6 


5.4 


RCJW063431J 


ESTs 


0.8 


4.1 


RC.T63769J 


fem'Gn; light potypeptkie 1 .1 


3.7 
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TABLE 8A 



Table 8A shows the accession numbers for those pkeys lacking unigenelD's for Table 8. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
niRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset Identifier number 

CAT number Gene clusternumber 
Accession: Genbank accession numbers 



Ptey CAT number Accessions 

108469 116761 1 AA079487AA1 28547 AA1 28291 AA079587AA079600 
125076 190299J AA973971 T8B817 AA253263 

114636 109698.1 AA075488AA 129081 AA074851 AA082BS2AA074732AA084908 AA084751 AA07e042AAl31172 AA085374 AA079519 
AA074510 AA1 13824 AA102437 AA070833 AA070143 AA084693 AA084389 AA076373 AA075492 AA062834 AA084335 
AA078829 AA079344 AA069916 AA079275 AA070914 AA654069 AA0ai976 AA080957 AA0831 15 AA070942 AA085296 

123526 genbanK_AA608657 AA608657 

123533 genbanlLAA608751 AA608751 

125090 genbank_T91518 T91518 

125154 genbanlLW38419 W3a419 

118475 genbank^N66845 N66845 

118505 genbank_N57343 N67343 

101046 entre?.K01160K01160 

129982 22U67 Z14221 AW381862 M97920 AW401444 Z66542 M29470 AW406502 X61011 M34024 AA327072 Z14166 Z14167 Z14165 

AW403806 Z14200 AA383972 Z14205 Z14201 M18513 Z14202 AW403684 X14584 AF062221 U43760 X65892 X65883 X62107 
Z80847 X65885 X65893 AF062142 X65891 X17675 Z47274 Z47277 Z47276 X65888 Z47275 X62109 AF062140 L01278 
AF062134 AF062139 X81723 Z80840 X81733 X81743 X81744 X81732 Z80843 AW402942 AW403516 XB5919 AF062190 
AF0621 77 AF062222 AF0621 15 Z47240 AF062263 AF062261 AF062223 AF06221 1 Z47238 AW401714 AW404008 AW404991 
AF062280 M99595 Z47214 Z47232 Z47218 M26995 AF062184 X65895 L38433 X81731 21 1946 Z47226 AF062205 AF174012 
L01276 AF062168 AF062136 X81755 X81748 AF174D19 AF062285 L08924 L1 1699 AW402665 L09070 L28049 L08090 
AW407843 Z14171 AW402944 L08083 AW405627 L33035 L26907 M17750 1^8052 M17751 AJ239360 U19885 L14821 X56526 
AB014341 L12087 112098 U68231 L12184 AF062242 AF022000 U64499 U00570 AI268604 Y15773 X64239 X62969 U00506 
X73605 X99360 U00577 D83677 AB021539 AF035796 233899 U00588 AJ239353 AF174062 Z33901 X98899 AF174058 X63080 
D83676 AB021529 218318 U00488 L01412 X81746 U21262 U21272 U00560 AF174060 U00547 U00561 213321 S73957 
M26435 AF115130 AF115117 292896 U21254 292895 AF1 151 12 AF062290 T28938 Y09386 AF174067 U27189 Y09384 
U77373 AF174057 M17749 X69692 AF174038 U64478 U64486 AF174098 X99365 AF174077 AF 1 74083 AF1 74089 AF015 133 
AF127792 AJ006171 U00510 U58144 AF004323 AF1 15109 AF004324 AF015130 AF090414 AF090418 AB021536 AB021519 
296957 AF021986 292898 296956 L34164 AF062251 AF052524 U00549 U00541 D83687 AF052522 AF087420 AF052520 
AF02201 1 L43085 U00536 L19915 L43083 AF090420 AB021 530 AB021534 296955 296954 233903 X62964 AF103282 U00509 
AF062298 AF062289 AF062307 AW408326 L33037 L04337 L04328 X81742 L04336 L03385 AF032360 S56184 AF062191 
AF191092 AI906954 AF103184 L04343 AW364860 AF001424 AF103163 298717 AF103143 AJ008250 L04323AF1 03321 
AF103335 L04342 L03818 L03817 AW404978 AW403690 U86801 AF103150 AJ010334 AF035027 AJ007327 AF103115 
AF017458 AJ008207 AJ0a8183 AJ008196AJ008241 AJ00820a AF103210 AF06a668 AF068670 AF068671 AF068664 
AF068669 AF068666 AF068665 AF068672 AW371244 AW403670 AW408074 AW404575 AW362153 AW403803 AW406702 
AW351514 D78345 T29140 J00231 NH-0Q2179 AW405146 AA301091 X04646 H64660 AW402990 AW406534 T93007 
AI857980 AW368899 AI905833 AW406586 AA482084 AI872299 AA715266 AW404328 AI831674 AI709348 AA6031 12 
AW514864AA485775 H64492AW404789AA487630AA715498AA295885 T27613T98113 

108470 genbankJ^079500 AA079500 
101447 entre^M21305 M21305 
124447 genbanK.N48000 N48000 
101624 entrei_M55998 M55998 

131791 221 260 X62111 S57984AJ 131056 247243 247235 AF062268 247237 AJ131058AF0621 30 AF062282X621 08 AA385989AA464794 
X69861 AW402964 M90808 Z9a735 Z98734 Z98736 AF035018 X79161 U00545 AF174046 AF174071 U00552 U96288 
AF021989 AF062255 AF174061 873953 AF062135 AF062155 X64147 U00555 AF174070 AF129754 U86789 Z98714 298738 
S75886 AJ0Oai75 AJQ08188 AJ008188 Y08403 H15753 H22208 AA327241 AW405737 H42300 H39677 H26239 H26903 
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H45128 R86072 AA327555 A1660584 AW361537 AW383759 S71043 H15014 H45570 H42819 H45523 H45134 R72043 
H24543 H27636 H27610 T28147 H25496 AW364071 AW364072 H45561 H45556 H42605 AI735017 T47421 R48719 H27570 
H44599 AI459598 H42347 H41938 H24993 AA345888 H22339 AI538691 AJ012264 AA664201 AI8a0450 AA327310 A1991^0 
AI833028 AW001210 AI956075 H30467 AA326915 H41943 AI749266 AI744441 AA327377 AW512326 Af735170 H01634 
AI587047AI571623AA3274B6AA327103AA327195AA326973T28143 



130064 22U64 X57815 L29157 AA367448 AW328098 AW404536 L29156 L29163 H27683 R83195 AA295096 AW327822 H25458 AW404692 



X57819 X57823 AW405604 AW404447 Z34914 AW406542 AA427726 AA604389 AW405608 AW405918 AW405117 
AW407182 L03632 AW405058 L03627 AW407470 R72738 L21959 AW37573a X87888 AF124166 AF054638 AJ241408 
U09902AW405284 M12025 AA360219L03631 L03629 AW327727AF1 94810 AF058076AF194686AF063774AF063737 
AF063755 AF083740 AF063739 AF063738 S73129 L43088 AF063775 L33026 AF060138 AF194813 AF194812 AF194811 
AF194809 AF194814 AF1 94808 AF064505 AF064503 AF063765 AF063757 AF063752 AF083716 AF063736 AF194806 
AF058077 AF063747 AF063772 AF063781 AF060137 AF194805 AF0S0134 AF060132 AF058074 AF063754 AF063704 
U38589 Z18332 AF060122 AF194807 AF060135 AF064506 AF064504 AF063773 L26540 AF058072 AF060131 AF064500 
AF060136 AF064513 AF194683 AF19471 1 AF058075 AF063717 Z19546 AF194581 X72746 U96393 U09890 X98897 
AF194592 M80916AJ241405AF194632AJ388659AF194625 Z74694 AF194588 AF194601 U09901 U09911 U09892 
AJ010336 AJ006162 AJ249377 AJ241414 AA327392 U97248 X72747 Z46850 X95739 AW406701 AJ243109 AF194609 
S80758 AF194595 AF194596 L22483 Z70262 D84141 AW40575a AF001788 AF194580 D84143 U76684 AF194593 L03630 
X87892 X91134 U21249 246346 AJ132426 AF103659 AJ233718 AF021038 AJ233727 H24657 U09882 S75627 AA573599 
AF047231 AF047232 AF103616 AW404484 Z46848 S76132 AF103663 AF103713 Y17940 AF047216 AF103595 Y17956 
AF052799 AF052797 AF052802 AF052798 AF052801 AF052794 AF052796 AF047218 AF052800 AF047217 AF052795 
AF032351 AF103701 AF103708 AF103710 AF103706 S62232 AF103645 AF103632 AF103647 AF1 03644 AF1 03640 
AF103692 AF093581 AF103620 AW405934 AI445389 AW383753 AA360256 AF099676 H21654 H39501 AI820828 H53689 
W26785 AW384496 AW407708 AA541 663 AA911602 AI821461 AA588300 AA327050 H42717 AI951280 AA421322 AI923193 
AA864302 H25133 D87023 J0301 1 M61771 D87017 AA526865 AA253450 



130232 18831_2 U29463 U08715 W24970 AA584303 T39581 AA155603 AA305043 AA429426 W05664 AA102382 AA482044 W24487 AA319080 
T88946 F10108 AA232161 AA243117 AA158937 AA100864 
109097 genbanHJ\A167512 AA167512 



124842 217726,1 F 
103758 AA084874J.al 



R56485 R37248R59992 
I AA084874_f 
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TABLE 9: Figure 9 from BRCA 001-2 US 

Table 9 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenefD: Unlgene number 

Un^ene Title: Unlgene gene title 



Pkey ExAccn UnlgenelD UnlgeneTrtle 

100690 AA383256 Hs.1657 estrogen receptor 1 

102211 BE314524 Hs.78776 putative transmembrane proteh 

103587 BE270266 Hs.8212d 5T4 oncofetal troplioblast glycoprotein 

104115 AF183810 Hs.26102 opposite strand to trtehorhinophalangeal syndrome I 

105038 AW503733 Hs.9414 KIAA1488 protein 

105500 AW602166 Hs^2399 CEGP1 protein 

105990 Ai690586 Hs^03 hypothetical protein FU22060 

106155 AA425414 Hs.33287 nuclear factor l/B 

106373 AW503807 Hs.21907 histone acetyltransferase 

106414 BE568205 Hs^8827 mitogen-activated protein kinase kinase kinase 2 

110009 BE075297 Hs.6614 ESTs. Weakly similar to A43932 mucin 2 precursor, Intestinal 

111900 AF131784 Hs.25318 Homo sapiens done 25194 mRNA sequence 

114540 AI904232 Hs75323 prohibitin 

116470 AI272141 Hs.83484 SRY (sex detemiining region YHx)x 4 

117280 M18217 Hs.172129 Homo sapiens cDNA: FU21409 fis. ctone Ca03924 

119771 AI905687 Hs.2533 EST 

121723 AA243499 Hs.104800 hypothetical protein FU10134 

124059 BE387335 Hs.283713 ESTs. Weakly similar to S64054 hypothetk:al protein YGLOSOw 

131148 AW953575 Hs.303125 p53-lnduced protein PIGPC1 

132371 AA235448 Hs.46677 PRO2000 protein 

134169 AI690916 Hs,178137 transducer of ERBB2. 1 

302235 AL049987 Hs.166361 Homo sapiens mRNA; cDNADKFZp564F1 12 

452410 AL133619 Hs,29383 Homo sapiens mRNA; cDNA DKFZp434 
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TABLE 10: Figure 10 from BRCA 001-3 PCX 

Table 10 dq)icts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank acoessfon number 

UnlgeneiD: Unigene number 

Unigene Title: Unigene gene title 

R1: Ratio of tumor to normal body tissue 

R2: Ratio of 90^ percenfale tumor to body 

R3: Ratio of percentile body to tumor 

R4: Ratto of tumor to nomial breast tissue 



Pkey BcAccn 


UnlgeneiD 


Unigene THIe 


R1 


R2 


R3 


R4 


100082 AA130080 


Hs.4295 


proteasome (prosome, macropain) 268 subu 


4.2 


152 


36 


12.2 


100103 AA380887 


Hs.5085 


dortchyl-phospliate mannosyltransi^rase p 


9.8 


123 


13 


5 


100131 D12485 


Hs.11951 


ectonucleotkle pyrophosphatase/phosphodi 


13.2 


244 


19 


9.9 


100147 D13666 


Hs.136348 


osteoblast specific factor 2 (fasctctin 


15.7 


1030 


66 


5 


100154 H60720 


Hs.81892 


KIAA0101 gene product 


4.1 


320 


78 


10.6 


100157 D14661 


Hs.119 


Wilms' tumour 1 -associating protein 


47 


119 


26 


3 


100169 AL037228 


Hs.82a43 


D 123 gene product 


5.1 


106 


21 


9.2 


100203 ,BE242284 


Hs.172199 


adenylate cyclase 7 


47 


47 


1 


4.3 


100210 D26361 


Hs.3104 


KIAA0042gene product 


47 


47 


4 


07 


100219 AW972300 


Hs.118110 


bone nnanow stromal cell antigen 2 


3^ 


350 


93 


1.9 


100234 D29677 


Hs.3085 


KIAA0054 gene product HeOcase 


4.1 


64 


16 


3 


100248 NR/l_015156Ms.7B398 


KIAA0071 protein 


3.4 


77 


23 


5.9 


100252 NM_006207Hs.170040 


platelet-derived growth fector leceptor- 


4.5 


45 


4 


4 


100260 D38491 


Hs.322478 


K1AA0117 protein 


5.9 


59 


1 


2.6 


100279 ID42084 


Hs.82007 


K[AA0094 protein 


3.5 


96 


28 


1.3 


100286 BE247550 


Hs.86859 


growth factor receptor-bound protein 7 


3.1 


306 


98 


1.5 


100294 AA331881 


Hs.75454 


peraxiredoxin 3 


12.8 


128 


1 


117 


100335 AW247529 Hs.6793 


platelet-activating factor acetylhydrola 


4.2 


187 


44 


5.4 


100365 AI878927 


Hs.79284 


mesodemi specific transcript (mouse) horn 


4.5 


129 


29 


3.1 


100375 D80004 


Hs,75909 


KIAA0182 protein 


3.5 


78 


23 


4.8 


100409 D86957 


Hs,80712 


KIAA0202 protein 


10.2 


102 


1 


4.8 


100410 086^1 


HS79299 


lipoma HMGIO fuston pailner-iike 2 


4 


40 


1 


3.8 


100414 NM.014735HS.82292 


KIAA0215gene product 


3.2 


32 


2 


2.9 


100418 D86978 


Hs.84790 


KIAA0225 protein. 


3.6 


36 


7 


3.2 


100438 AA013061 


Hs.91417 


topotsomerase (DNA) II binding protein 


5.6 


76 


14 


2 


100439 AA347720 


Hs.122669 


KIAA0264 protein 


3.5 


35 


9 


3.1 


100448 AF234887 


Hs.57652 


cadherin, EGF LAG seven-pass G-type rece 


5.5 


145 


27 


2.2 


100449 D87470 


Hs.75400 


KIAA0280 protein 


3.4 


34 


1 


1.2 


100522 X51501 


Hs.99949 


prolactin-induced protein 


227 


760 


34 


1.4 


100552 AA019521 


Hs.301946 


lysosomal 


14.4 


144 


9 


47 


100643 NWL005032HS.4114 


plastin 3 (T isoform) 


4.1 


259 


63 


1.9 


100661 BE623001 


Hs.132748 


Homo sapiens ribosonnal protein L39 mRNA, 


3.3 


116 


36 


2.2 


100666 L05424 


Hs.169610 


CD44 antigen (homing function and Indian 


8.5 


85 


1 


3.2 


100667 L05424 


Hs.169610 


CD44 antigen (homing function and Indian 


3 


594 


201 


2.3 


100745 BE207168 


Hs. 144630 


nuclear receptor subfamily 2, group F, m 


5 


82 


17 


0.9 


100774 J05581 


Hs.89603 


mucin 1, transmembrane 


3.5 


37 


11 


2.8 


100783 AF078847 


Hs.191356 


general transcription factor IIH, polype 


97 


97 


10 


7.2 


100821 M26460 




gb:Homo sapiens (clone 104) retinoblasto 


33 


33 


1 


0.8 


100864 BE563957 


Hs.74861 


activated RNA polymerase II transcriptio 


37 


477 


130 


3.1 


100877 X80821 


HsJ27973 


KIAA0874 protein 


6.3 


63 


4 


57 


100892 BE245294 


Hs.180789 


S164 protein 


47 


47 


1 


4.2 


101038 BE297139 


Hs.79411 


repiteatton protein A2 (32kD) 


3.8 


115 


30 


7.1 


101046 K01160 




NM_002122.Homo sapiens major histocompat 


3.9 


390- 


100 


11.1 


101079 BE264901 


Ks.250502 


carbonic anhydrase VIII 


3.9 


39 


8 


3.6 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



101084 


AW409d34 Hs.75528 


nucleolar GTPase 


4.1 


53 


13 


4 


101104 


AW862258 Ks.169266 


neuropeptide Y receptor Y 1 


15^ 


153 


1 


14.1 


101185 


NM-001621HS.170087 


aryl hydrocarbon receptor 


11.3 


113 


8 


3.9 


101188 


L20320 Hs.184298 


cydin-dependent kinase 7 (homoiog of Xe 


3.1 


118 


38 


2 


101201 


L22524 Hs.2256 


matrix metalloproteinase 7 (MMP7; uterin 


8.2 


396 


48 


0.9 


101232 


AU077288 Hs.242894 


ADP-ribosylation factor-Rke 1 


4 


110 


28 


10.7 


101275 


BE545277 Hs.3273 


Ts translation etongation factor, mhoch 


4.2 


50 


12 


4.4 


101300 


BE535511 HSJ4137 


transmembrane trafficking protein 


6.6 


135 


21 


13.1 


101396 


BE267931 Hs.78996 


proliferating cell nuclear antigen 


6.4 


249 


39 


22.4 


101447 


M21305 


gbiHuman alpha satellite and satellite 3 


65 


878 


135 


0.8 


101448 


NM 000424HS.195850 


keratin 5 (epidennolysis bullosa simplex 


•4.8 


622 


130 


0.7 


101470 


NM.000546Hs.1846 


tumor protein p53 (li-Fraumenl syndrome) 


5.1 


97 


19 


9.3 


101478 


NM-002890Hs758 


RAS p21 protein activator (GTPase activa 


9.6 


96 


1 


8.5 


101484 


AA053486 Hs.20315 


interferon-Induced protein with tetratri 


11.2 


112 


8 


5.9 


101507 


X16896 Hs.82112 


Interleukln 1 receptor, type 1 


3.9 


39 


2 


3.5 


101621 


BE391804 Hs.62661 


guanylate binding protein 1, interferon- 


3.6 


36 


1 


2.6 


101624 


M5599d 


gbiHuman alpha-1 collagen type 1 gene, 3 


3.1 


2898 


923 


2.2 


101664 


AA436989 Hs.121017 


H2A hlstone family, member A 


6.9 


103 


15 


8.4 


101684 


M63256 Hs.75124 


cerebellar degeneration-related protein 


6.4 


64 


2 


4.9 


101724 


L11690 Hs.620 


bullous pemphigoid antigen 1 (230/240kD) 


9.4 


94 


1 


0.3 


101754 


S70114 Hs.239489 


TIA1 cytotoxic granule-associated RNA-bi 


8.9 


89 


5 


8 


101767 


M81057 Hs.180884 


caiboxypeptidase B1 (tissue) 


3.6 


824 


227 


1.4 


101791 


M83822 Hs.62354 


cell division cyde 4-like 


9 


144 


16 


13 


101794 


M84605 Hs.957 


putative opioid receptor, neuromedin K ( 


3.3 


36 


11 


2.4 


101803 


AW024390 Hs.155691 


pre-B-cell leukemia transcription factor 


5.4 


180 


34 


15.9 


101809 


M86849 Hs.323733 


gap junctton protein, beta 2, 26kD (conn 


12 


120 


8 


9 


101839 


AA446644 Hs.692 


GA733-2 antigen; epithelial glycoprotein 


3.1 


353 


116 


2.8 


101888 


AL049610 Hs.95243 


transcription elongation factor A (Sli)- 


7.3 


73 


1 


5.3 


101960 


AL036287 Hs.194662 


calponin 3. acidic 


3.8 


399 


105 


3.3 


102009 


BE245149 Hs.82643 


protein tyrosine kinase 9 


4.6 


148 


32 


11.3 


102095 


U11313 Hs.75760 


sterol carrier protein 2 


9.5 


95 


4 


8.8 


102123 


NM.001809Hs.1594 


centromere protein A (17kD) 


4.2 


42 


7 


3.4 


102125 


NM.006456HS.288215 


slalyltransferase 


9.3 


93 


4 


3 


102139 


NM.004419Hs.2128 


dual specificity phosphatase 5 


5.4 


137 


26 


2.5 


102162 


AA450274 Hs.1592 


CDC 16 (cell division cycle 16, S. cerevi 


4.6 


151 


33 


2 


102165 


BE313280 Hs.159627 


death associated protein 3 


9.3 


93 


5 


8 


102193 


AL036335 Hs.313 


secreted phosphoproteln 1 (osteopontin, 


45.7 


457 


1 


39.7 


102211 


BE314524 Hs.78776 


putative transmembrane protein 


3.9 


442 


114 


1.3 


102221 


NM 006769HS.3844 


LIM domain only 4 


4.9 


49 


1 


3.6 


102242 


U27185 Hs.82547 


letinoic ackl receptor responder (tazaro 


3.1 


31 


1 


1.3 


102258 


NM 001546HS.34853 


Inhibitor of DNA binding 4, dominant r»eg 


3.8 


163 


43 


0.5 


102302 


AA306342 Hs.69171 


protein kinase C-like 2 


4.5 


45 


1 


3.6 


102304 


AF015224 Hs.46452 


mammaglobin 1 


8.5 


2058 


243 


1.4 


102348 


U37519 Hs.87539 


aWehyde dehydrogenase 3 family, member 


6.4 


428 


67 


2.3 


102369 


U39840 Hs.299867 


hepatocyte nuctear factor 3, alpha 


6.7 


67 


9 


6.3 


102407 


AW602154 Hs.82143 


E74-rike fiador 2 (ets domain transcript 


5.3 


53 


1 


4.8 


102409 


BE300330 Hs.1 18725 


selenophosphate synOietase 2 


3.3 


111 


34 


7.5 


102457 


NM.001394Hs.2359 


dual spedfidty phosphatase 4 


20.2 


202 


5 


1.3 


102544 


NM003937HS.169139 


kynureninase (Ukynurenlne hydrolase) 


3.8 


38 


1 


1.5 


102567 


U63830 Hs.146847 


TRAF family member-assodated NFKB activ 


8.2 


82 


1 


6.8 


102580 


U60808 Hs.1 52981 


CDP-diacytgiyceroI synthase (phosphatida 


4.1 


41 


1 


3.3 


102618 


AL037672 Hs.81071 


extracellular matrix protein 1 


10.2 
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62 
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102638 


U67319 Hs.9216 


caspase 7. apoptosls-related cysteine pr 


5 


66 


13 


5.3 


102663 


NM.002270HS.168075 


karyopherin (Importin) beta 2 


6.1 


126 


21 


2.4 


102669 


U71207 Hs.29279 


eyes absent (Drosophila) homoiog 2 


4.5 


45 


1 


2.8 


102742 


U79293 Hs.159264 


Human clone 23948 mRNA sequence 


4.1 


41 


1 


2.4 


1027M 


U85658 Hs.61796 


transcription factor AP-2 gamma (actlvat 


4.4 


255 


58 


1.6 


102805 


U90304 Hs.25351 


Iroquois homeobox protein 5 


3.6 


142 


39 


1.6 


102813 


BE242035 Hs.151461 


embryonic edodemn development 


3.6 


35 


1 


Z7 


102823 


D85390 Hs.5057 


carboxypeptidase D 


5.6 


56 


1 


5.3 


102825 


BE262386 Hs.7137 


ctones 23667 and 23775 zinc finger prote 


4.2 


42 


7 


3.7 


102899 


AI815559 Hs.75730 


signal recognition particle receptor fd 


3.2 


58 


18 


5 
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NM^002275Hs.80342 


keratin 15 


5.8 


753 


131 


0.4 


102927 


BE512730 Hs.65114 


kerab'n 18 


3.1 


815 


266 


1.7 


102961 


AL1 19505 Hs.198166 


activating transcription factor 2 


3.2 


32 


4 


2.6 


102968 


AU076511 HSJ54672 


methylene tetrahydrofblate dehydrogenase 


5.7 


251 


44 


6.6 


103003 


AI910275 Hs.1406 


trefoil fector1(pS2) 


5.6 


1346 


239 


5.4 
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103023 AW500470 Hs.1 17950 


multifunctional polypeptide similar to S 


5.8 


218 


38 


13 


103024 NNL002343HS.105938 


ladotransfenln 


3.7 


1421 


388 


1.9 


103036 M13509 Hs.83169 


matnx metaltoproleinase 1 (MMP1; inters 


3.1 


94 


30 


5.8 


103038 AA926960 Hs.334883 


CDC28 protein kinase 1 


3.5 


332 


94 


3.1 


103119 X63629 Hs.2877 


cadherin 3, type 1» P-cadherin (placenta 


4.8 


312 


65 


30.9 


103134 X55724 Hs2839 


Nome disease (pseudoglioma) 


5.2 


331 


64 


1.5 


103134 X65724 Hs.2839 


Nonle disease (pseudoglioma) 


4.9 


49 


5 


3.8 


103171 AW58305a Hs.234726 


serine (or cysteine) proteinase inhibito 


3.3 


1497 


458 


2.1 


103206 X72755 Hs.77367 


monokine induced by gamma interferon 


3.5 


796 


228 


3.2 


103208 AW411340 Hs.31314 


retlnoblastoma-bindlng protein 7 


5.6 


191 


34 


3.5 


103226 X75042 Hs.44313 


v-rel avian reticuloendotheiiosis viral 


4.1 


53 


13 


4.9 


103333 AA206186 Hs.79889 


monocyte to macropiiage differentiation-a 


3.4 


34 


8 


2.3 


103346 X87613 Hs.5464 


thyroid homione receptor ooacth/atfng pr 


3.9 


43 


11 


1 


103352 H09366 Hs.78853 


uiacil-DNA glyoosylase 


9.3 


93 


8 


8.2 


103375 NI\/L005982Hs^4416 


sine oculis homeobox (Drosophila) homdo 


9.7 


97 


1 


93 


103376 AU)36166 Hs.323378 


coated vesicle membrane protein 


6.3 


98 


16 


9.1 


103391 X94453 Hs.114366 


pyrTollne-5-carboxylate syntiietase (glut 


43 


77 


18 


7.2 


103438 AW175781 Hs.152720 


M-pliase phospiioprotein 6 


4.9 


153 


31 


2.4 


103453 AI878922 Hs.180139 


SIVfT3 (suppressor of mif two 3, yeast) tio 


4.9 


261 


53 


3.7 


103471 Y00815 Hs.75216 


protein tyrosine ptiospiiatase, receptor! 


3.5 


564 


162 


1.7 


103500 AW408009 Hs.22580 


alkylglycerone phosphate synthase 


3.9 


49 


13 


2.5 


103557 AL133415 Hs.297753 


vimentin 


7.5 


136 


18 


3.4 


103587 BE270266 Hs.82128 


5T4 oncofetal trophoblasl glycoprotein 


7.9 


79 


2 


6.9 


103605 BE409838 Hs.194657 


cadherin 1, type 1, E-cadherin (epithelf 


3.3 


745 


229 


1.8 


103606 AW403814 Hs.41714 


BCL2-associated athanogene 


3.2 


41 


13 


2.8 


103613 NNt000346Hs.2316 


SRY (sex detemiining region Y)-bQx9 (ca 


7.3 


73 


1 


5.2 


103658 NM.000088HS.172928 


collagen, type 1, alpha 1 


3.8 


1612 


429 


3.1 


103666 NM.003528Hs.2178 


H2B histone family, member Q 


3.2 


32 


5 


2.8 


103988 AA314389 Hs.42500 


ADP-rlbosylation factor-illce 5 


Z2 


32 


9 


2.7 


103990 AB033112 Hs.42179 


bromodomain and PHD finger containing. 3 


4.9 


49 


1 


4.2 


104052 NM-002407HS.97644 


mammaglobln 2 


72 


498 


69 


93 


104115 AF183810 Hs.26102 


opposite strand to trtchortilnophatangeal 


29 


290 


1 


26.8 


104129 H63349 Hs.98806 


hypothetical protein 


3.7 


37 


7 


2.1 


104146 AW880614 Hs.146381 


RNA binding motif protein, X chromosome 


5.2 


52 


1 


43 


104147 BE081342 Hs.283037 


HSPC039 protein 


8 


84 


.11 


63 


104209 AB012113 Hs.16530 


small inducible cytokine subfamily A (Cy 


5.8 


58 


1 


3.2 


104239 AB002367 Hs.21355 


doubtecorlin and CaM kinase-like 1 


6.4 


64 


8 


3 


104278 AW583693 Hs.109253 


N-temiinal acetyltransferase complex ard 


4.7 


229 


49 


7.9 


104309 Aj337300 Hs.284123 


hypothetical protein MGC4604 


3.2 


32 


7 


2.4 


104394 AA129551 Hs.172129 


Homo sapiens cDNA: FLJ21409 fis, done C 


5.3 


144 


27 


13.1 


104432 X51501 Hs,99949 


prolactln-induced protein 


6.9 


1494 


218 


13 


104558 R56678 Hs.88959 


hypothetical protein MGC4816 


7.7 


77 


8 


6.9 


104567 AA040620 Hs.5672 


hypothetical protein AF140225 


3.7 


37 


5 


2.5 


104590 AW373062 Hs.83623 


nuclear receptor subfamily 1 , group 1. m 


6.1 


493 


81 


0.7 


104602 H47610 


gb7p75f03vSl Soaies fetal liver spleen 


3.8 


38 


4 


1.2 


104613 AF123303 Hs.24713 


hypothetical protein 


4.8 


231 


49 


7.3 


104633 H00820 Hs.30977 


ESTs. Weakly similar to 834087 hypotheti 


3.4 


154 


46 


3 


104636 R82252 Hs.106106 


protein kinase (cAMP-dependent, catalyti 


5 


468 


94 


4.7 


104660 BE298665 Hs.14846 


Homo sapiens mRNA; cDNA DKFZp564D016 (fr 3.8 


82 


22 


3.1 


104667 AI239923 Hs.30098 


ESTs 


145 


149 


1 


6.4 


104766 BE244072 Hs.20815 


macrophage erythroblast altacher 


63 


165 


26 


Z2 


104787 AA027317 


gb:2e97d1U1 SoaresJetaLhearLNbHH19W 


3.8 


4u 


•4 i 
M 


0.0 


104804 AI858702 Hs,31803 


ESTs, Weakly similar to N-WASP [H^sapien 


7.7 


77 


1 


5.1 


104807 AI139058 Hs.125790 


leucine-rich repeat-containing 2 


7 


70 


1 


6.5 


104846 AI250789 Hs.32478 


ESTs 


4.7 


201 


43 


4.5 


104896 AW015318 Hs.23165 


ESTs 


7.4 


74 


1 


6 

33 


104919 AA026880 Hs.25252 


prolactin receptor 


3.9 


280 


72 


104926 BE298808 Hs.33363 


DKFZP434N093 protein 


4.2 


136 


32 


4 

A2 


104943 AF072873 Hs.114218 


frizzled (Drosophila) homolog 6 


162 


162 


1 


104968 AI249SQ2 Hs.29669 


ESTs 


3.8 


38 


1 


2.4 


104977 AI392640 Hs.18272 


amino acid transporter system A1 


3.2 


522 


165 


1.9 


104997 AA121686 Hs.10592 


ESTs 


3.2 


32 


4 


2.9 


105029 AI122691 Hs.13268 


ESTs 


3.7 


157 


43 


3,6 


105038 AW503733 Hs.9414 


KIAA1488 protein 


5.5 


55 


1 


5.2 


105041 AB037716 Hs56204 


KIAA1295 protein 


103 


103 


. 1 


3.9 


105086 AA148710 Hs.79914 


lumk:an 


6.6 


66 


1 


5.4 


105088 H58589 HS^5156 


Homo sapiens cDNA FIJ11027 fis, cbne PL 


3.1 


31 


1 


2.5 
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metiiyl-Cp6 binding domain protein 4 


4.2 


90 


22 


2.0 


10o1d2 AL1o3033 Hs.40o4 


KIAA1U23 protein 


0 


60 


b 


4.0 


105167 AW0I2147 ns.3205o 


Honno sapiens C1on19 nnRNA» partial cxls 


3.8 


38 


2 


3.2 


10o17o AA3i3o25 Hs.21941 


AD030 protein 


9.3 


436 


47 


c 0 

5.8 


105195 AA97509D Hs. 19522 


hypotnetical protem PR 02849 


5.7 


57 


0 




105200 AA328102 Hs.24641 


cytoskeleton associated protein 2 


4.5 


46 


1 


3.6 


105248 AVV952479 nS.22626 


tropGmodulin 3 (ubiquitous) 


4.3 


43 


1 


3.9 


105252 AB02^70 Hs.9728 


ALEX1 protein 


8 


80 


6 


7.3 


105253 AW997484 Ms.5003 


KfAA0456 protein 


3.9 


39 


6 


3.2 


105280 AA894638 Hs.14600 


ESTs 


3.5 


35 


7 


2.7 


105288 Ng9673 Hs.3585 


ESTs, Weakly sliTvIar to AF126743 1 DNAJ 


4.5 


45 


10 


0.5 


105309 AW)00796 ns.4104 


hypothetical proteh 


3.8 


93 


25 


7.5 


105329 AA234561 Hs.22862 


ESTs 


2.8 


131 


47 


3.9 


105344 AF1 51073 Hs.8645 


hypothetical protein 


3.9 


79 


20 


6.5 


105376 AW994032 Hs.8768 


hypothetical protein FU10849 


5.1 


181 


35 


15.8 


105386 AW500718 Hs.8115 


Homo sapiens, done MGC:16169, naRNA,com 


4.1 


41 


2 


3.3 


105400 AF1 98620 Hs.65648 


RNA binding motif protein 8A 


6.2 . 


62 


6 


5.6 


105426 W20027 Hs.23439 


ESTs 


3.3 


206 


63 


2.2 


105483 AL1 37257 Hs.23458 


Homo sapiens cDNA: FLJ23015 fis. done L 


3.2 


466 


146 


8.4 


105496 AL1 17441 Hs.301997 


hypothetical protein FLJJ3033 


16.6 


166 


8 


12.7 


105500 AW602166 Hs.222399 


CEGP1 protein 


25.4 


508 


20 


3 


105508 AA1 73942 Hs.326416 


HooK) sapiens mRNA; cDNA DKFZp564H1916 (f 9 


117 


13 


10.6 


105511 AB037829 Hs.3862 


regulator of nonsense transcripts 2; DKF 


3.2 


32 


6 


1.5 


105516 AK001269 Hs30738 


hypothetical protein FU10407 


8.3 


83 


3 


1.8 


105539 Ad040684 Hs. 109694 


KIAA1451 protein 


3.5 


73 


21 


1.0 


105564 BE616694 Hs.288042 


hypothetical protein FU14299 


5.8 


336 


58 


2 


105610 AA2B0072 Hs.99872 


fetal Alzheimer antigen 


3.2 


32 
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105516 R35343 Hs.24968 


Human DNA sequence from clone RP1-233616 4.8 


79 


17 


5.2 


105627 AA281279 Hs.23317 


hypo^etical protein FLJ14681 
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19 


1.7 


105640 AA001021 Hs.6685 


thyroid hormone receptor interactor 8 
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ESTs 
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histone deacetylase 3 
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64 
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upstream binding protein 1 (LBP-1a) 
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Homo sapiens cDNA FU11918 fis. done HE 
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nuclear receptor coactivator 2 
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34 
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sema domain, immunoglobuiin domain (tg). 
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chemokine-like factor, alternatively spt 


5.4 
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ESTs 
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hypothetical protein FU20160 
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hypothetical protein MGC4643 
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Homo sapiens cDNA FLJ13618 lis. done PL 
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ESTs. Moderately similar to CA1C RAT Ca 
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ESTs 
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secreted modular calcium-binding protein 
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hypothetical protein FLJ1361 2 


4.3 


772 


179 


4 7 
1.7 


105864 A 1040775 ns.2o33^ 


Homo sapiens cDNA: FLJ21560 fis, clone C 
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GCN5 (general control of amino-acid synt 
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hypothetical protein FU10846 
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34 
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105886 AK001735 Hs^83 


UDP-9lucose:gtyooprotein glucosyltransfe 
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45 
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13 


105906 N25986 Hs.22380 


ESTs 
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34 
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106012 AI240665 Ms.8895 


ESTs 


212 


212 
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17.4 


106020 AA043039 Hs.7870 


hypothetical protein 
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47 ■ 
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4.4 


106024 AL122072 Hs.103804 


lieterogeneous nudear n'bonudeoprotsin 


4.4 


174 


40 
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106034 AW952005 Hs.14928 


hypothetical protein FLJ12903 
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ESTs 
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F-boxonly protein 22 


3.4 
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35 
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106060 NNL001329HS.171391 


C-tenninal binding protein 2 


3.6 
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4.6 


105070 r74445 Hs.5957 


Homo sapiens clone 24416 mRNA sequence 
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103 


6.9 


106095 AF115402 Hs.11713 


E74-like factor 5 (ets domain transcript 


26.3 


356 


14 
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106096 AW379378 Hs.170121 


protein tyrosine phosphatase, receptor t 
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hypothetical protein FLJ13352 
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108459 AA789081 Hs.4029 
106503 AB033042 Hs59679 
106508 AI205785 Hs30348 
106565 NM-014892HS.227602 
106586 AA243837 Hs.57787 
106589 AK000933 Hs.28661 
106596 AA452379 Hs.293552 
106611 R49131 Hs.26267 
106628 AW188205 Hs.12311 
106650 AL049951 Hs.22370 
106683 BE296396 Hs.14512 
106698 N28524 Hs.29403 
106710 N38902 Hs.334437 
106717 AA600357 Hs.239489 
106747 NM.007118Hs,171957 
106834 AL044182 Hs.28070 
106846 AB037744 Hs.34892 
106868 BE185536 Hs.301183 
106882 AA149537 Hs.26994 
106893 AA835868 Hs.25253 
106895 AK001826 Hs.25245 
106897 AF039023 Hs.167496 
106916 AA134329 Hs.24170 
106962 AI868648 Hs.22315 
106968 AF216751 Hs.26813 
106990 AA280722 Hs.24758 
107008 AL157479 Hs.23740 
107014 AA598820 
107032 AV650537 Hs.247309 
107056 AW401864 Hs.18720 
107071 AW385224 Hs.35198 
107080 AL122043 Hs.19221 
107102 AB037765 Hs.30652 
107109 AA249096 Hs.32793 
107136 AV661958 Hs.8207 
107151 AW378065 Hs.8687 
107217 AL080235 Hs.35861 
107222 BE172058 Hs.82689 
107240 A1290284 Hs.159872 
107248 AW263124 Hs,315111 
107295 AA186629 Hs.80120 
107299 BE277457 Hs.30661 
107316 T63174 Hs.193700 
107318 T74445 H8.5957 
107485 AL042613 Hs.262476 
107612 AI498986 Hs.60090 
107638 A1580492 Hs.42743 
107727 AA149707 Hs.173091 
107859 AW732573 Hs.47584 
107876 AW372451 Hs.61184 
107884 AA054949 Hs.61307 
107886 AA025782 Hs.61284 
107908 AF087999 Hs.42826 
107922 BE153855 Hs.61460 



nuclear factor l/B 9.9 

KIAA1323 protein 6.7 

Homo sapiens clone 015h12 My015 protein 3.3 

KIAA1453 protein 3.8 

hypothetical protein FLJ20550 3.3 

hypothetical protein FU10955 4.5 

ESTs 7 

K!AA0406 gene product 3.2 

ESTs 3.8 
Homo sapiens mRNA; cDNA DKFZp586M0723 (f 16 

Homo sapiens cDNA: FU21487 fis, done C 4.9 

Homo sapiens done 25142 mRNA sequence 3.1 

r^oflietical protein FU20303 3.1 

glioma-amplifledsequenoe-41 3.1 

cofactor required for Spl transcriptiona 5.5 

ESTs 4.4 

KIAA1 116 protein 7.4 

ESTs 15.2 

Homo sapens cDNA aJ10071 fis. done HE 3.8 

ESTs. Moderately simitar to ALU7_HUMAN A 4.9 

ATP-dependant interferon response protei 5.8 

Homo sapiens clone 23570 mRNA sequence 5.3 
Homo sapiens mRNA; d)NA DKFZp564O0122 (f 5.4 

DIPS protein 3.6 

hypothetical protein F1J22060 5.7 

hypothetical protein M6C4248 4.4 

TIA1 cytotoxic granule-assodated RNA-bi 4.3 

triple functional dom^ PTPRF interact 4.6 

KIAA0753 gene product 3.5 

KIAA1323 protein 5.4 

molecule possessing ankyrin repeats indu 3.3 

hypdhetlcal protein FU20477 3.8 

mannosidase, alpha, dass 1A, member 1 4.3 

hypothetical protein FU11269 3.6 

RAN binding protein 6 4.5 

Homo sapiens, clone IMAGE:3685398, mRNA, 5.7 

ESTs 3.5 

CDA14 5.5 

ESTs. Weakly similar to 138022 hypotheti 3.2 

KIAA1598 protein 5.1 

gbae36h1 2.s1 Gessler Wilms tumor Homo s 3.3 

suodnate-CoA l^ase. GDP-fonnIng, beta 3.1 

programmed cell death 8 (apoptosis-induc 3.1 

edonudeofidepyrophosphatase^phosphodl 3.1 

hypothetical protein DKFZp566G1424 3.9 

KIAA1344 protein 6.3 

ESTs 4.6 

GK001 protein 2.5 

ESTs 15.6 

DKFZP586E1621 protein 4.8 

tumor rejection antigen {gp96) 1 3.4 

ESTs 3.6 

nuclear receptor co-rBpressor/HDAC3 comp 5,4 

UDP-N-acetyl-a)pha-0-gaiactosamlne:polyp 4.6 
hypothetical protein MGC4606 12.5 
Homo sapiens mRNA; cDNA DKFZp586l0324 (f 3.2 

Homo sai}iens clone 24416 mRNA sequence 3.5 

^adenosytmethlonlne decarboxylase 1 5.8 

Homo sapiens cDNA FU13595 fis, clone PL 3.2 
hypothetical protein 4.4 
ublquitin-like3 3.5 
potassium voltage^ated channel, delayed 5.7 
C6I-79 protein 3.5 
ESTs 4.3 
ESTs 3.1 

ESTs 4.7 
Igsuperlamily receptor LNIR 9 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



107994 
108040 
108055 
108063 
108339 
108467 
108539 
108634 
108647 
108695 
108778 
108806 
108807 
108810 
108846 
108857 
108893 
108917 
109010 
109060 
109101 
109112 
109124 
109128 
109160 
109166 
109173 
109178 
109235 
109273 
109292 
109391 
109411 
109412 
109415 
109481 
109517 
109597 
109729 
109795 
109799 
109883 
109912 
109937 
109958 



110009 
110240 
110369 
110426 
110478 
110481 
110581 
110674 
110705 
110721 
110731 
110769 
110775 
110787 
110799 
110818 
110839 
110844 
110854 
110882 
110908 



AA036811 Hs.48469 
AL121031 Hs.159971 
AJ404672 Hs.334483 
BE548479 Hs.14838 
AW151340 Hs.51615 
AI478658 Hs,94631 
AA084677 Hs.54558 
AW022410 Hs.69507 
BE546947 Hs.44276 
AB029000 Hs.70823 
AF133123 Hs.90847 
AF07057B Hs.71168 
AI652236 Hs.49376 
AW295647 Hs.71331 
AL117452 Hs.44155 
AK001468 Hs.62180 
BE276891 Hs.194691 
AI380268 Hs.173648 
NKt007240Hs.44229 
BE062109 Hs.241551 
AW608930 Hs.52184 
AW419196 Hs.257924 
AK000684 Hs.183887 
H89083 Hs.181915 
BE220601 Hs.301997 
AA219691 Hs.73625 
AA179962 Hs.73643 
AW976516 Hs.283707 
AI381800 Hs.300684 
AA375752 Hs.82719 
AW975746 Hs.l88662 
AL096658 Ms.184245 
R98881 Hs.109655 
BE543313 Hs.209473 
U80736 Hs.1 10826 
AA878923 Hs,289a69 
AI631874 Hs.155140 
AA989362 Hs.293780 
F10024 Hs.268740 
AA173942 Hs.326416 
AW965076 Hs.180378 
R68827 Hs.95011 
AW390822 Hs.30l528 
AI084066 Hs.20072 
AA001266 Ks.133521 
A1796320 Hs.10299 
BE075297 Hs.6614 
At668594 Hs.176588 
AK000768 Hs.107872 
AI610702 Hs.28212 
H11236 Hs.31034 
AF075089 Hs.36823 
H61560 

AA071276 Hs. 19469 
AB007902 Hs.32168 
H97578 Hs.31319 
NIVL014899HS.188006 
BE000831 Ks.23837 
N22414 

AA831267 Hs.12244 
AI089660 Hs.323401 
AL157503 Hs.27552 
AF153330 Hs.30246 
AI740792 Hs.167531 
BE612992 Hs.27931 
AW963705 Hs.301183 
AI433165 Ms.9856 



LIM domains containing 1 4.5 

SWI/SNF related, matrix associated^ act! 6.5 

hypottietlcal protein FU23571 7.4 

hypothetical protein FU10773 3.4 
ESTs, Weakly similar to ALU7_HUMAN ALU S 18.7 

brefeldin A-inhibited guanine nucleotide 3.8 

hypothetical proteb FLJ22222 5.7 

ESTs 3.2 

homeoboxCIO 8.7 

KIAA1077 protein 3.7 

genera) transcnption factor II IC, polyp 3.7 

Homo sapiens done 24674 mRNA sequence 3.4 

hypothetical protein FIJ20644 3.5 

hypothetical protein MGC5350 5.3 

OKFZP586G1517 protein 4.8 

aniQin (Drosophlla Scraps homolog). act 5.4 

retinoic acid induced 3 3.1 

ESTs, Weakly similar to Zinc-finger prot . 3.3 

dudi speclficify phosphatase 12 3.4 

chloride channel, caicium activated, fam 3.1 

hypothetical protein aJ20618 3.4 

hypothetical protein FU13782 4.1 

hypothetical protein FLJ22104 3.3 

ESTs 4 

hypothetical protein FU13033 3.8 

RAB6 interacting, Idnesln-lilce (rabldnes 8.8 

EST 3.2 

Homo sapiens cDNA: FLJ21354 fe, done C 3.2 

calcitonin gene-related peptide-reoeptor 4.9 
Homo sapiens mRNA; cDNA DKFZp586F1822 (f 2.9 

klAA1702 protein 7.1 

KIAA0929 protein Msx2 inteiactlng nuclea 6.9 

sex comb on midleg (Drosophila)-lil(e 1 3.3 

hypothetical protein FUI 0520 4.2 

trinucleotide repeat containing 9 12.3 

hypothetical protein FLJ21016 3.2 

casein Idnase 2, alpha 1 polypeptide 8.3 

ESTs 5.9 

ESTs 3.2 
Homo sapiens mRNA; cDNA DKFZp564H1916 (f 5.9 

hypothetical protein 669 - 5 

syntrophin, beta 1 (dystrophln-associate 37 

L-kynurenlne/alpha-amlnoadlpate aminotra 142 

myosin regulatory fig ht chain (ntetaciln 4.1 

ESTs 4.2 

Homo sapiens cDNA FU13545 fis. done PL 3.2 

ESTs, Weakly similar to A43932 mucin 2 p 6.3 
ESTs, Weakly similar to CP4Y„HUMANCYT0C 4.6 

hypothetical protein FU20761 3.8 
ESTs, Weakly similar to TRHY^HU!^ TRICH 67 

peroxisomal biogenesis factor 1 1 A 37 

ESTs 3.6 

gb:yr22g03.sl Scares fetal Bver spleen 3.3 

KIAA0859 protein 3.5 

KIAA0442 protein 3.6 

ESTs 4.4 

KIAA0878 protein 3.3 

Homosapiens cDNA FU11812 fis, clone HE 13.5 

gb:yw39a07.s1 Weizmann Olfactory Epithel 5.4 

hypothetical protein FU20097 4.7 

dpy-30-like protein 5 
Homosapiens mRNA: cDNA DKFZp586N2424 (f 3.1 

solute cam*er family 19 (thiamine trans 8,4 

methylcrotonoyl-Coenzyme A carboxylase 2 10.5 

hypothetkal protein F1J10607 simllarto 7.9 

molecule possessing ankyrin repeats indu 3.9 

ESTs 3.1 
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37 

34 

35 

53 

96 
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33 

34 

31 

71 
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33 
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32 

32 
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114 

71 

69 

39 

56 

123 

286 

83 

59 

41 

208 

50 

37 

142 

41 

58 

136 

693 

913 

38 

78 

37 

36 

33 

35 

282 

103 

138 

135 

54 

47 

50 

31 

84 

105 

79 

353 

31 



1 
2 
8 
1 
1 
1 
1 
5 

29 

168 

1 

1 

1 

1 

20 
1 

170 

5 

1 

8 

21 

82 

1 

7 

62 
23 
1 

10 

25 

39 

1 

5 

12 

14 

1 

91 

8 

10 

13 

36 

5 

4 

1 

7 

14 

43 

110 

199 

7 

12 

1 

10 
1 

8 

79 

24 

42 

1 

1 

4 

1 

1 

1 

4 

1 

90 
1 



3.8 

6 

6 

2.3 
17 
3.2 
4.9 
17 
57 
3.8 
3.2 
2.8 
32 
2.8 
6.5 
4 

4.1 
1.7 
2.6 
2 

2.4 

3.4 

2.9 

1.1 

3.8 

16.1 

2.2 

2.9 

10.4 

9.9 

6.5 

62 

I. 5 
2.2 

II. 3 
5.7 
1.9 
42 
3.3 
1.8 
4.1 
2 

9.5 
1.7 
0.8 
3.6 
72 
2.9 
2.8 
3 

2.1 
25 
1.8 
1.9 
1.7 
3.8 
3.6 
5.1 
3.7 
42 
4.3 
2.7 
5.3 
7.1 
62 
12 
1.3 
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110915 BE0922d5 Hs^724 
110930 BE242e91 Hs.14947 
110970 Y19062 Hs.96870 
111084 H44186 Hs.15456 
1111K N63823 Hs.269115 
111132 AB037807 Hs.83293 
111164 N46180 Hs.122489 
111179 AK000136 Hs.10760 
111184 AI815486 Hs.243901 
111190 AK002055 Hs.l51046 
111221 AB037782 Hs.15119 
111223 AA852773 Hs.334838 
111229 AW389845 Hs.l 10855 
111234 AA902656 Hs.21943 
111241 AA345644 Hs288880 
111345 AW263155 Hs.14559 
111353 W20090 Ks.6616 
111357 BE314949 Hs.87128 
111378 AW160993 Hs.326292 
111389 AKD00987 Hs.169111 
111540 U82670 Hs.9786 
111806 BE071382 Hs.279008 
111884 AW502285 Hs.127236 
111923 BE383234 Hs.25925 
111929 AF027208 Hs.1l2360 
1119« R40576 Hs.21590 
111987 NM-015310HS.6763 
112092 R44538 
112134 R41823 Hs.7413 

112197 NM.003655Hs^637 

112198 AI432672 Hs^88539 
112244 AB029000 Hs.70823 
112253 R51818 

112269 R53734 Hs.25978 
112275 AW972635 Hs.301904 
112280 AA863360 Hs.26040 
112305 AK000914 Hs.26244 
112483 AW969785 Hs,285885 
112513 R68425 Hs.13809 
112571 AA412205 Hs.140996 
112971 242387 Hs.83883 
113023 AL134324 Hs.7312 
113047 A1571940 Hs.7549 
113073 N39342 Hs.103042 
113083 AA283057 Hs.266957 
113287 T68847 H&194040 
113296 AW449560 Hs.89576 
113523 AI791905 Hs.95549 
113604 AI075407 Hs.296083 
113617 AI869372 Hs.17207 
113702 T97307 
113783 AL359588 Hs.7041 
113791 AI269096 Hs.135578 
113794 T62849 Hs.11090 
113804 BE247683 Hs.14511 
113808 W44735 Hs.9286 
113847 NM_005032Hs.4114 
113849 AA457211 Hs.8858 
113867 AW002834 Hs.24095 
113886 W76Q27 Hs.23920 
113923 AW953484 Hs.3849 
113936 W17056 Hs.83623 
113950 AI267652 Hs.30504 
114030 AI825386 Hs.164478 
114051 AB026436 Hs.177534 
114057 AF1 16653 Hs.34192 
114082 AK001612 H5.26962 



hypotfietical protein FU13187 20.9 

ESTs 3.4 

staufen prosophila, RNA-binding protein 3.5 

POZ domain containing 1 4 .3 

ESTs. Moderately similar to Z195 JIURAAN Z 5.4 

hypothetical protein 7.2 

Homo sapiens cDNA FU13289 fte. ctone OV 7.7 

asporin(LRR class 1) 25.1 

Homo sapiens cDNA RJ20738 fis, done HE 3.9 

tiypothetical protein FU1 1 193 6.3 

KIAA1351 protein 3.7 

KIAA1866 protein 3.6 

ESTs 4^ 

NtF3 (Nggi interacting factor 3, S.pombe 33 

PAN2 protein 4^ 

hypothetical protein FU10540 4.3 

ESTs 4.1 

hypothetical protein FU23309 3.8 

hypoftetIcalgeneDKFZp434A1114 4.3 

oxidation resistance 1 3.4 

zinc finger protein 275 3.5 

hypothetical protein FLJ20 170 3.5 

hypothetical protein FU12879 3.2 
Homo sapiens, done MGC:15393.mRNA, com 6.2 

prominin (mouse)-liKe 1 8.1 

hypothetical protein DKFZp564O0523 4.2 

KIAA0942 protein 6.5 

gb:yg29c02.sl Scares Intant brain INIB H 3.3 

ESTs; calsyntenin-2 6.1 

ESTs 3.5 

hypothetical proten FU22191 3.5 

KIAA1077 protein 57 

gb:yg77h12.s1 Scares infant brain INIB H 4 

ESTs, Weaidy similarto 2109260A B cell 37 

hypothetical protein FLJ12671 4.3 

ESTs, Weakly similar to fatty add omega ZB 

hypothetical protein FLJ10052 3.5 

Homo sapiens cDNA FU1 1321 fis, done PL 4.2 

hypothetical protein FLJ10648 4.7 

ESTs 4.8 

transmembrane, prostate androgen induced 4.5 

ESTs 3.2 

ESTs 9.6 

microtubule-associated protein IB 9.1 
hypothetical protein FU14281 ' 6.5 

ESTs. Weakly similar to 138022 hypotheti 3.5 

inner mitochondrial membrane peptidase 2 3.5 

hypothetical protein 7,6 

ESTs, Moderately similar to 154374 gene 3.1 

Homo sapiens cDNA FU11922 fis, done HE 3.6 

gb:ye53h05.s1 Soares fetal liver spleen 12.3 

hypothetical protein DKFZp762B226 4.6 

chitobiase, di-N-acetyl- 3.6 

membrane-spanning 4-domains. subfamily A 3.3 

dual specificity phosphatase 1 1 (RNA/RNP 3.3 

H omo sapiens cDNA: FU21278 lis. clone C 5. 1 

plast{n3 0'lsoform] 3.2 

bromodomain adjacent to zinc finger doma 4.3 

ESTs 6.1 

hypothetical protein FU11105 4 

hypothetical protein FU22041 similarto 3.7 

nuclea r receptor subfamily 1 , group I. m 4.3 
Homo sapiens mRNA; cDNA DKFZp434E082 (fr 107 

hypothetical protein FU21939 similar to 4.4 

dual spedfidty phosphatase 10 4.5 



Homo sapiens PRO0823 mRNA. complete cds 3.5 
Homo sapiens cONA FU10750 fis. done NT 3.1 
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114124 W57554 H5.125019 lymphoid nuclear protein (LAF-4) mRNA 242 242 10 5.6 

114138 AW384793 Hs.15740 Homo sapiens mRNA; cDNADKFZp434E033(fr 67 67 1 S2 

114162 AF155661 Hs.22265 pyruvate dehydrogenase phosphatase 3.8 73 19 1.8 

114196 AF017445 Hs.150926 fuoose-1 -phosphate guanylyltransferase 4.4 104 24 5.1 

5 114208 AL049466 Hs.7859 ESTs 57 57 1 4.9 

114239 AL137667 Hs.267445 Homosaplens mRNA; cDNA DKFZp434B231 (fr 3.3 33 1 2.4 

114251 H15261 Hs.21948 ESTs 4.2 46 11 1.4 

114306 AF100143 Hs.6540 fibroblast growth factor 13 4.5 45 2 3 

114460 AF183810 Hs.26102 trlchorhinophalangeal syndrome I 4.4 44 1 3 

10 114542 AW970128 Hs.91011 anteriorgradient2 (Xenepus laevis) horn 4.7 770 166 5.8 

114652 AI521936 Hs.107149 novel protein similar to archaeal, yeast 5.2 52 3 2.3 

114767 AI859855 Hs.154443 minfchromosome mamtenance deficient (S. 4.6 196 43 10 

114768 AF212848 Hs.182339 ets homologous factor 13.7 137 1 8.9 
114774 AV656017 Hs.184325 CGI-76 protein 3.3 168 51 7.3 

IS 114798 AA159181 Hs.54900 serologicaily defined colon cancer antig 7.4 137 19 1.8 

114821 AI648602 Hs.55468 ESTs 47 57 12 47 

114860 AL157545 Hs.42179 bromodomain and PHD finger containing, 3 9.1 91 1 7.6 

114918 BE165762 Hs.23518 hypothetical protein from BCRA2 region 10.1 111 11 10.2 

114940 BE092696 Hs.75928 ESTs 6.4 67 11 5 

20 114965 AI733881 Hs.72472 BMP-RIB 35.9 359 10 297 

114969 AW162998 Hs.24684 KIAA1376 protein 9.4 94 8 7.3 

114988 AA251089 gb:zs04f05.s1 NCLCGAP^GCBI Homosaplens 11.5 115 1 6.9 

115004 AA329340 Hs.4867 mannosyl (alpha-1.3-)^lycoproteln beta- 4.2 42 9 1.1 

115054 AW265668 Hs.87729 hypothetical protein FU 12428 5.1 51 1 4.2 

25 115061 AI751438 Hs.41271 Homo ^piens mRNA full length insert cDN 4.5 290 65 37 

115140 NfvL014158Hs.279938 HSPC067 protein 4.8 48 1 4.4 

115142 AI623693 Hs.191533 ESTs 3.2 49 16 4.2 

115188 AK000219 Hs.88367 hypothetical protein FU20212 3.3 33 1 3 

115206 AW183695 Hs.186672 ESTs 5.8 58 1 5 

30 115221 AW365434 Hs.79741 hypothetical protein FU101 16 5.5 343 62 2.5 

115262 AI422867 Hs.88594 ESTs 11.2 112 1 10.3 

115291 BE545072 Hs.122579 hypothetical protein FU10461 4.5 96 21 7.8 

115536 AK001468 Hs.62180 anillin (Drosophiia Scraps homolog), act 5.9 59 1 4.2 

115583 NM_012317Hs.45231 leucine zipper, down-regulated in cancer 9.8 98 1 8.8 

35 115600 AA081395 Hs.42173 HomosaptenscDNA FLJ10366 fis, clone NT 4.6 46 2 1.8 

115622 AI088691 Hs.208414 Homo sapiens mRNA; cDNADKFZp564D0472(f 4.4 44 7 1.1 

115645 N36110 Hs.305971 solute canier family 2 {facilitated glu 3.2 372 115 2.1 

115674 AW992356 Hs.8364 Homo sapiens pyruvate dehydrogenase klna 10.2 506 50 2.8 

115575 W87707 Hs.82065 interieukin 6 signal transducer (gp130. 5.2 405 78 10,1 

40 115719 AW992405 Hs.59622 Homo sapiens, clone I MAGE:3507281, mRNA, 7.6 144 19 13.9 

115725 AW899053 Hs.76917 F-box only protein 8 3.1 58 19 2.5 

1 15764 AW582256 Hs.9101 1 anterior gradient 2 (Xenepus laevis) hom 5.7 368 65 28.5 

115821 AW338063 Hs.130965 zinc-finger protein ZBRK1 3.9 39 8 2.2 

115825 R50956 Hs.159993 gycosyltransferase 45 79 ■ 19 1.9 

45 115839 6E300266 Hs.28935 transducln-like enhancer ofspliti, hom 5.8 58 1 4.4 

115844 AI373062 Hs.332938 hypothetical protein l\AGC5370 6.2 62 1 5.4 

115892 AA291377 Hs.5Q831 ESTs 3.2 40 13 0.7 

115967 AI745379 Hs.42911 ESTs 8.4 101 12 87 

116093 AW673312 Hs.50848 hypothetfca) protein FU20331 3.6 36 1 2 

50 116097 AI198719 Hs.l76376 ESTs 5.1 51 1 2 

116107 AL133916 Hs.172572 hypotheUcal protein FU 20093 3.4 34 8 1 

116127 AF126743 Hs.279884 DNAJ domain-containing 3.5 35 8 3.3 

116129 AF189011 Hs.49163 putative ribonuclease III 4.5 45 9 3.4 

116204 AW861622 Hs.108646 Homo sapiens cDNAFU 14934 fa, clone PL 5.2 52 4 3.9 

55 116226 AW976438 Hs.17428 RBPI-like protein 3.8 38 7 2.1 

116238 AV660717 Hs.47144 DKFZP586N081 9 protein 5.1 198 39 17.9 

116250 N76712 Hs.44829 ESTs, Weakly similar to 138022 hypotheU 13.3 133 8 3.2 

116256 AA328153 Hs.88201 ESTs, Weakly simflar to A Chain A, Cryst 3.3 106 33 9.8 

116298 AI955411 Hs.94109 Homo sapiens CDNAFU13634 fis, clone PL 4.8 179 38 2.8 

60 116336 AL133033 Hs.4084 KIAA1025 protein 3.2 173 55 3 

116351 AL133523 Hs.82501 simflarlo mouse Xm1/Dhm2 protein 37 37 1 1.8 

116355 N50174 Hs.46765 ESTs 3.9 39 10 0.6 

116379 AA448588 Hs71252 hypothetical protein DKFZp761C 169 5.6 106 19 9 

116429 AF1 91018 Hs.279923 putative nucleotide binding protein, est 3.6 256 72 3.7 

65 116450 AI654450 Hs.47274 Homosaplens mRNA; cDNADKFZp564B176 (fr 3.1 119 39 2 

116461 AA313607 Hs.58633 Homosaplens cDNA:FU22145 fis, done H 5.5 315 58 3.1 

116470 AI272141 Hs.83484 SRY (sex determining region Y)4)ox 4 3.4 496 144 1.6 
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16507 AI418366 Hs.6850l 


ESTs 


3.1 


31 


4 


1.9 


16579 AW888411 Hs.81915 


leukemia-assooated phosphoprotein p18( 


3.3 


931 


279 


5,6 


16625 F01601 Hs.241567 


RNA binding motrf, sfngle stranded mter 


3.6 


36 


1 


1.9 


16674 AI768015 Hs.92127 


ESTs 


4.5 


96 


22 


6.9 


16680 AW902848 Hs.273829 


ESTs 


A2 


42 


1 


2.7 


16710 F10577 Hs,306088 


v-crk avian sarcoma virus CT10 oncogene 


7.1 


71 


9 


6.9 


16724 AA741307 Hs.65641 


hypothetical protein FU20073 


4.3 


190 


44 


5.4 


16786 H25836 Hs.301527 


ESTs, Moderately simlbr to unknown |H^ 


22.8 


228 


9 


12.4 


16787 AW362955 Hs.15641 


Honro sapiens cDNA FLJ14415 fis, done HE 


4.9 


108 


22 


9 


16790 AW161357 Hs.101174 


microtubule-assoctated protein tau 


4.6 


163 


35 


7.3 


16844 H64938 Hs.337434 


ESTs. Weakly simaar b A46010 X4inked 


6.9 


69 


10 


2.4 


17027 AW085208 Hs.130093 


ESTs 


4.8 


48 


1 


25 


17067 H91164 Hs.335797 


ESTs 


3.3 


33 


1 


2.3 


17129 H95785 Hs.167652 


ESTs, Highly similar to 1819485A CENP-E 


3.1 


38 


13 


1.7 


17147 AW901347 Hs.38592 


hypothetteal protein FU23342 


4.8 


48 


1 


0.9 


17170 N25929 Hs.42500 


ADP-ribosy)ation factor-like 5 


3.1 


295 


96 


27.9 


17209 W03011 Hs.306881 


MSTP043 ptDleIn 


3.6 


41 


12 


2.8 


17280 M18217 Hs.172129 


Homo sapiens cDNA: FLJ21409 fis. done C 


3.9 


322 


83 


4.4 


17367 AI041793 Hs.42502 


ESTs 


3.5 


72 


21 


1.3 


17412 N32536 Hs.42545 


solute earner family 16 (monocarboxylic 


17.4 


174 


9 


6.9 


17475 N30205 Hs,93740 


ESTs, Weakly similar to 138022 hypotheti 


3.2 


35 


11 


0.7 


17634 AW341539 Hs.13323 


hypottietical protein FU22059 


5 


50 


1 


4.7 


17667 U59305 Hs.44708 


Ser-Thr protein kinase related to the my 


4.5 


211 


47 


5 


17852 AW877787 Hs.136102 


KIAA0853 protein 


4.6 


46 


1 


3.8 


17873 N49967 Hs.46624 


HSPC043 protein 


3.1 


31 


1 


2.7 


17924 AI521436 Hs.38891 


ESTs 


4.9 


49 


1 


4.4 


18138 AA374756 Hs.93560 


Homo sapiens mRNA for KIAA1771 protein. 


5 


50 


2 


3.1 


18449 AI813865 Hs.164478 


hypottietical protein FtJ21939 similar to 


3.6 


89 


25 


0.9 


18467 AP091434 H8.43080 


platelet derived growSi factor C 


3.2 


378 


117 


2.8 


18472 AL157545 Hs.42179 


bronrodomain and PHD finger oontainlng, 3 


14.5 


145 


1 


2.4 


18475 Ne6845 


gb;za46c1 tsl Soares fetal Ih/er spleen 


3.1 


199 


64 


1 


18509 N22617 Hs.43228 


Homo sapiens cDNA FLJ1 1835 fis. done HE 


6 


60 


5 


3.7 


118528 AI949952 Hs.49397 


ESTs 


3.3 


81 


25 


1.5 


118828 N79496 Hs.50824 


EST, Moderately similar to 154374 gene N 


3.4 


740 


217 


2.8 


18836 AW134482 Hs.173001 


hypothetical protein FLJ13964 


4.3 


162 


3S 


12.1 


18854 T58283 Hs.10450 


Homo sapiens cONA: FLJ22063 fis, done H 


3.4 


118 


35 


2.3 


(18873 A1824009 Hs.44577 


ESTs 


3.5 


35 


1 


2.9 


118888 AI191811 Hs.54629 


ESTs 


8.4 


84 


10 


0.8 


118901 AW292577 Hs.94445 


ESTs 


7.3 


73 


3 


5.4 


118981 N29309 Hs.39288 


ESTs 


5 


50 


5 


4.7 


118991 NM_016657Hs.250696 


KDEL (Lys-Asp-Glu-Leu) endoplasmic letic 


3.7 


37 


6 


0.5 


119023 Ng8488 


gb:zb82h01^1 Soares_senesoenLfibroblas 


3.3 


36 


11 


0.6 


119088 R39261 Hs.90790 


Homo sapiens cDNA: FU22930 fis, done K 


3.3 


167 


51 


2.6 


119126 R45175 H5.117183 


ESTs 


5.3 


53 


6 


2.3 


119128 H09334 Hs.92482 


ESTs 


3.7 


37 


4 


3 


119271 AI061118 Hs.65328 


Fanconi anemia, complenoentatidn group F 


8.2 


82 


1 


6.4 


119298 NIVL001241HS.155478 


cyclin T2 


4 


40 


4 


1.2 


119307 BE048061 Hs.37054 


ephrin.A3 


3.3 


571 


171 


2 


119367 T78324 Hs.250895 


nbosomal protein L34 


3.4 


34 


3 


2.4 


119427 AW474547 Hs.53565 


Homo sapiens PIG-M mRNA for mannosyltran 


4.6 


60 


13 


4.8 


119580 AL079310 Hs.92260 


higli-nfwbllity group protein 2-like 1 


8.1 


94 


12 


6.5 


119500 Ar0o8U33 nS.ioS^o 


coTS 


J.J 


OJ 


0 
0 




119638 NWL016122HS.56148 


NY-REN-58 antigen 


3.3 


33 


10 


05 


119676 AA243837 Hs.57787 


ESTs 


5,4 


54 


1 


4.1 


119717 AA918317 Hs.57987 


B-cell CLL/lymphoma 1 1 B (zinc finger pro 


4.6 


46 


7 


0.8 


119771 AI905687 Hs.2533 


EST 


3.5 


2073 


595 


2.1 


119780 NMJ16625HS.191381 


hypolhetteal protein 


4.4 


44 


1 


3.1 


119786 AL133396 Hs.121281 


prion protein 2 (dubteQ 


3.4 


34 


1 


2.5 


119805 AJ223810 Hs.43213 


ESTs. Weakly similar to lEFS.Hlil^N TRANS 


3.6 


36 


1 


2.9 


119859 AW245741 Hs.58461 


ESTs. Weakly similar to A35659 krueppel- 


5.2 


52 


6 


1.8 


119699 AI057404 Ms.58698 


ESTs 


3.7 


37 


4 


1.9 


119940 AL050097 Hs.272531 


DKFZP586B0319 protein 


6.9 


162 


24 


2.6 


119943 BE565849 Hs.14158 


copinelll 


3.7 


590 


159 


3.8 


120132 W57554 Hs.125019 


lymphoid nudear protein {LAF-4) mRNA 


6.9 


319 


47 


2.1 


120150 BE005771 Hs.153746 


hypothetical protein FU22490 


5.3 


53 


5 


0.9 


120215 AF109219 Hs.108787 


phosphatidylinositol glycan, dass N 


3.2 


106 


34 


3.3 


120260 AK000061 HS.101S90 


hypothetical protein 


3.4 


34 


1 


1.7 
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120296 AW995911 Hs.299883 
120352 R06B59 Hs.193172 
120378 AA223249 Hs.285728 
120418 AW966893 Hs.26613 
120473 AA251973 Hs.269988 
120493 AW968080 Hs.152939 
120524 AA261852 Hs.192905 
120554 AA284447 Hs.271887 
120562 BE244580 Hs.302257 

120571 AB037744 Hs.34892 

120572 H39599 Hs.294008 
120588 AA703226 Hs.16193 
120649 AA687322 Hs.192843 
120656 AI952639 Hs.g8267 
120713 AW449855 Hs.g6557 

120821 Y19062 Hs.96870 

120822 AA347422 Hs.238040 
120915 AL135556 Hs.97104 
120922 AA481003 Hs.97128 
120977 AA398155 Hs.97600 
120999 AI972375 Hs.29626 
121125 AL042981 Hs.251278 
121176 AL121523 Hs.97774 
121202 AA970946 Hs.97794 
121429 AA406293 Hs.41167 
121448 AF044197 Hs.100431 
121463 AK000282 Hs,239681 
121517 AI0Q2966 Hs.235402 
121553 AA412488 Hs.48620 
121556 AA412494 Hs.98152 
121581 AA416568 

121709 A1338247 Hs.98314 
121723 AA243499 Hs.104800 
121831 AA449644 Hs.193063 
121853 AA425887 Hs.98502 
121873 AV550929 Hs.145696 
121913 AI249368 Hs.98558 
121916 AW1 17207 Hs.98523 
122004 AI810721 Hs.95424 
122063 AW794215 Hs.301226 
122223 AF169797 Hs.27413 
122235 AA436475 Hs,1 12227 
122273 AI298368 Hs.150926 
122383 AA446189 Hs.99051 
122507 BE567620 Hs.99210 
122524 AA449453 Hs.192915 

122636 AW651706 Hs.99519 

122637 AA454149 Hs.99357 
122798 AW366286 Hs.145695 
122861 AA335721 Hs.1 19394 
122873 AA749382 Hs.1 18797 
122946 AI718702 Hs.308026 
122963 AA478446 Hs.69559 
122974 AA447871 Hs.194215 
123016 AW338067 Hs.323231 
123090 AL135185 Hs.48778 
123137 AI073913 Hs.100686 
123255 AA830335 Hs.105273 
1^284 AA488988 Hs.293796 
123442 AA299652 Hs.111496 
123449 AL049325 Hs.1 12493 
123475 BE439553 Hs.250528 
123494 AW179019 Hs.1 121 10 
123503 AW975051 Hs.293156 
123516 AB037860 Hs.1 73933 
123518 AL035414 Hs.21068 
123523 AA608588 



hypothetical protein FLJ23399 4.2 

ESTs, Weakly similar to 138022 hypotheti 7.5 

abMnteractor 1 2 (SH3-containing protei 3.3 
Homo sapierw mRNA; cDNA DKFZp586F1323 (f 4.8 

ESTs 3.4 

Homo sapiens done 24630 mRNA sequence 3.9 

ESTs 6.8 

ESTs 3.2 

hypothetical protein FU10330 8.5 

KIAA1323 protein 3.7 

ESTs 3.6 
Homo sapiens mRNA; cDNA DKFZp586B211 (fr 5.6 

leucine zipper protein FKSG14 5.4 

ESTs Z2 

Homo sapiens cONA FU12727 fis. ctone NT 5.3 

staufen (Drosophila, RNA-bindmg protein 3.3 

EST, Weatdy similar to B34067 hypothetic 3.8 

ESTs 3.5 

ESTs 3.1 

ESTs 7.9 

hypothetical brain protein my038 5.1 

KIAA1201 protein 3.7 

ESTs 7 

ESTs 3.9 

ESTs 3.4 

B-cefl attracting chemoklne 1 (CXai3; 3.5 

hypothetical protein FLJ20275 10.3 

ESTs. Weakly similar to T26525 hypotheti 3.5 

TATA box bindhg protein (TBP)-assoctatB 4.6 

EST 4.2 

gb:2u05c10.s1 Soares_lBsfe_NHT Homo sap 3.2 
Homo sapiens mRNA; cDNA DKFZp586L0120 (f 3.4 

hypothetical protein FIJ10134 2.9 

Homo sapiens cDNA FLJ14201 lis, done NT 3.9 

hypothetical protein aJ14303 4.4 

splicing factor (CCl. 3) 3.6 

ESTs; protease inhibitor 15 (PI15) 2.7 

ESTs 3.5 

ESTs 4.9 

KIAA1 085 protein 3.2 

adaptor protein containing pH domain, PT 1 2.6 

membrane-assodated nuclete add binding 4.1 

fiicose-1 -phosphate guanylyltransferase 3.1 

ESTs 3.3 

ESTs 3.2 

ESTs. Weakly simHar to ALUIJiUMAN ALU 8 3.1 

hypolhet'cal protein FLJ14007 3.5 

EST 3.2 

splicing factor (CCl. 3) 3.2 

ESTs 5.6 

ubiqultin-conjugating enzyme E2D 3 (homo 3.6 

ma]or histocompatibility complex, dass 3.7 

K)AA1096 protein 7.2 

ESTs, Weakly similar to 138022 hypotheti 4.7 

Homo sapiens cDNA FU1 1946 fis. done HE 3.3 

niban protein 3.8 

ESTs, Weakly slmllartoJE0350 Anterior 9.9 

ESTs 4.1 

ESTs 3.7 

Homo sapiens cDNA FU1 1643 fe. done HE 6.7 
Homo sapiens mRNA; dDNA DKFZp564D036 (fr 3.4 

Homo sapiens, done IMAG&4098694, mRNA, 9.7 

mitochondrial libosomal protein L42 4.2 

ESTs, Weakly similar to 178885 serine/th 3.9 

nuclear factor 1/A 4.3 

hypothetical protein 5-8 

gb:ae54e06.s1 Stratagene lung carcinoma 3.1 
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45 



50 



55 



60 



65 



123527 AF150208 Hs. 108327 


damage-specific DNA binding protein 1 (1 


5 


121 


OK 




123570 AA608955 Hs. 109653 


ESTs 


6.6 


68 


10 


6.1 


123619 AA602964 


gb:no97c02.s1 NCLCGAP^Pr2 Homo sapiens 


85 


85 


4 
1 




123673 BE550112 Hs. 158549 


ESTs, Weakly similar b T2D3J^UMAN TRANS 3.9 


39 


5 


3.7 


123709 AA706910 Hs.1 12742 


ESTs 


3.9 


DO 


ID 


4.0 


123926 AA425769 Hs.227933 


A]g5, S. cerevisiae, homolog of 


3.4 




lA 


1 0 
0.0 


123960 AW082862 Hs.287733 


hypothetical protein FU23189 


4.5 


45 


O 

z 


1 c 

ox> 


124006 AI147155 Hs.270016 


ESTs 


5.8 


oZi 


90 


If 


124059 BE387335 Hs.283713 


ESTs, Weakly similar to S640S4 hypotheti 


10.4 


Qon 


DC 
00 




124287 H88296 Hs.5123 


inorganic pyrophosphatase 


3.1 


41 


14 


Z.7 


124292 H11341 Hs.13366 


Homo sapiens cDNA: FLJ23567 Fb, done L 


3.2 


32 


1 


1.5 


124308 AA249Q27 Hs.241507 


ribosomal protein 36 


10.5 


105 


1 


9.9 


124315 NM.005402HS.288757 


v-^l simian leukemia viral oncogene horn 


12.8 


141 


4 4 
11 


4*i 1 

12.2 


124461 AF283776 Hs.80285 


Homo sapiens mRlMA; cDNA DKFZp566C1723 (f 3.1 


31 


4 
1 


1.0 


124483 AI821780 Hs.179864 


ESTs 


3.3 


33 


1 


1.7 


124677 R01073 


gb:ye84c03.s1 Scares fetal liver spleen 


4.2 


42 


7 


3 


124777 R41933 Hs.140237 


ESTs, Weakly similar to ALUI.HUMAN ALU S 


3.4 


210 


63 


3.3 


124940 AF068846 Hs.103804 


heterogeneous nuclear ribonudeoprotein 


6.5 


162 


25 


14.7 


125079 T90298 Hs.271396 


ESTs 


3.1 


31 


6 


2.4 


125091 T91518 


gb:ye20f05.s1 Stratagene lung (937210) H 


3.4 


985 


286 


2.8 


125103 AA570056 Hs.122730 


ESTs, Moderately similar to KIAA1215 pro 


3.6 


224 


63 


4 


125144 AB037742 Hs^4336 


KIAA1321 protein 


6.3 


63 


6 


5 


125150 W38240 


Empirically selected from AFFX single pr 


3.6 


38 


11 


2.6 


125156 W93048 Hs.250723 


hypothetical protein MGC2747 


3.1 


31 


1 


2.8 


125226 AA782536 Hs.122647 


N'mynstoyltransferase2 


3.2 


37 


12 


3.6 


125279 AW401809 Hs.4779 


KIAA1150 protein 


13.1 


131 


1 


5.1 


125299 T32982 Hs.102720 


ESTs 


7.7 


81 


11 


7.6 


125303 AA173319 Hs.288193 


hypotheticat protein MGC12217 


14.3 


143 


9 


13.1 


125377 W72949 Hs.77495 


UBX domain-containing 1 


3.3 


34 


11 


3.2 


125390 AL038165 Hs.75187 


translocase of outer mitochondriat membr 


8.2 


124 


15 


11.5 


125471 AA421691 Hs.152601 


UDP-glucose ceramide gluoosyttransferase 


3.7 


224 


61 


21 


125617 AA287921 Hs, 164950 


ESTs 


6.7 


67 


1 


6 


125621 T62641 Hs.278544 


acetyi-Coenzyme A acetyitransferase 2 (a 


5.5 


55 


10 


4.2 


125628 AA418069 Hs,241493 


natural kilter-tumor reoognltton sequenc 


5.5 


63 


12 


1 


125660 AW292171 Hs.23978 


scaffold attachment factor B 


4.3 


68 


16 


2.8 


125698 AF078847 Hs.191356 


general transcription fador IIH, polype 


4.8 


48 


5 


4.1 


125745 AI858032 Hs.75722 


n'bophorin II 


6.8 


223 


3o 


0 Q 


125770 AA143045 Hs.81665 


v-kit Hardy-Zuckemian 4 feline sarcoma v 


8.3 


87 


11 


0.4 


125827 NM_003403Hs.97496 


YY1 transcription fador 


11.3 


124 


11 


9.7 


125852 AW630088 Hs.76550 


Homo sapiens mRNA; cDNA DKFZp5&4B12&4 (f 30.6 


306 


4 


26.5 


126349 T30968 Hs.13531 


hypotheticat protein aJ10971 


4.9 


68 


14 


4 A 

1.4 


126384 AW090198 Hs.4779 


KIAA1150 protein 


6.4 


74 


12 


6.6 


126590 W78968 Hs.181307 


H3hfstone, family 3A 


5 


264 


53 


O A 

3.4 


126645 AA316181 Hs.61635 


six transmembrane epithenal antigen of 


3.8 


38 


1 


2.7 


126663 AW518478 Hs.181297 


ESTs 


3.6 


36 


6 


2.9 


126695 AA643322 Hs.1 72028 


a disintegnn and metaltoproteinase doma 


3.1 


31 


1 


2.5 


126764 AA036755 Hs.102178 


syntaxin 16 


4.4 


76 


18 


1 


126801 AW663887 Hs.7337 


hypothetical protein FLJ10936 


3.8 


38 


1 


3 


126813 AW163483 Hs.48320 


double ring-finger protein, Dorfin 


6.7 


155 


23 


4 A 

1.4 


126838 AL043489 Hs.279609 


mitochondrial carrier homolog 2 


8.8 


110 


13 


10.5 


126855 AA129640 Hs.128065 


ESTs 


3.6 


36 


10 


1.9 


126971 T26989 Hs.283664 


aspartate beta-hydroxylase 


5.5 


7Q 


10 


A A 


127167 AA625690 Hs.190272 


ESTs 


3.1 


33 


11 


Z3 


127251 AA936428 Hs.128638 


ESTs 


3.5 


35 


1 


3.1 


127349 AA412108 Hs,269350 


ESTs 


4.8 


106 


22 


1 


127439 D80237 Hs.14368 


SH3 domain binding gtutamk: add-rich pr 


7.5 


75 


1 


6.5 


127537 AI926047 Hs.162859 


ESTs 


3.8 


38 


7 


3.4 


127542 AA703664 Hs.245474 


ESTs, Moderately simnar to ALU5_HUMAN A 


3.3 


33 


9 


0.9 


127677 AF175265 Hs.264190 


vacuolar protein sorting 35 (yeast homol 


4.3 


152 


35 


12.5 


127774 AA313639 Hs.119488 


cystein-rich hydrophobic domain 2 


5.4 


73 


14 


6.8 


127999 AW978827 Hs.69851 


nudeolar protein family A. member 1 (H/ 


5.2 


81 


16 


1.1 


128218 AA1B6733 Hs.292154 


stromal cell protein 


3.9 


220 


57 


2.5 


128305 AI954968 Hs.279009 


matrix Gla protein 


9.4 


94 


3 


5.3 


128470 AL049974 Hs.100261 


Homo sapiens mRNA: cDNA DKFZp564B222 (fr 4.6 


46 


8 


3.9 


128482 AI694143 Hs.295251 


progranruned cell death 4 


7.2 


72 


1 


5.8 


128501 AL133572 Hs.199009 


protein containing CXXC domain 2 


3.8 


38 


1 


0.9 


128517 AW994403 Hs.100861 


hypotheUcal protein FU14600 


5.6 


73 


13 


6.1 
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128530 AI932995 Hs.183475 

128579 N25956 Hs.101810 

128595 U31875 Hs.272499 

128610 N48373 Hs.10247 

128653 D87432 Hs.10315 

128742 AA307211 Hs,251531 

128773 NM^004131Hs.105l 

126790 AF026692 Hs.1057O0 

128793 AB011125 Hs.105749 

128794 NM_014720Hs.105751 
128835 AK001731 Hs.10639O 
128906 R57988 Hs.107O6 
128925 R67419 Hs.2185l 
128949 AA009647 Hs.8850 
129017 AA1 15333 Hs.107g68 
129075 BE250162 Hs.83765 
129095 L12350 ' Hs.10B623 
129151 N23018 Hs,171391 
129168 AI132988 Hs.109052 
129229 AF013758 Hs.109643 
129243 BE169531 Hs.109727 
129259 AF220050 Hs.181385 
129278 NM_O15344Hs.110OO 
129337 NM.014918HS.110488 
129351 AL049538 Hs.62349 
129366 BE220806 Hs.184697 
129393 BE219987 Hs.1669B2 
129457 X61959 Hs.207776 
129486 NM_005754Hsi20689 
129586 AW964541 Hs.11500 
129598 N30436 Hs.11556 
129691 M26939 Hs.119571 
129698 BE242144 Hs.12013 
129721 NIVL001415HS.211539 
129740 BE165866 Hs.83623 
129755 R42216 Hs.12342 
129801 R39246 Hs.239666 
129821 AB028945 Hs.12696 
129869 AI222069 Hs.13015 
129955 T71333 Hs.13854 
129977 NM_000399Hs.1395 
130036 BE061916 Hs.125849 
130057 AF027153 Hs.324787 
130095 AK001635 Hs.14838 
1301.15 T47294 Hs.149923 
130170 AW977534 Hs.151469 
130173 U38847 Hs.151518 
130343 AB040914 Hs.278628 

130356 AF127577 Hs.155017 

130357 AL135301 Hs.8768 
130385 AW067800 Hs.1 55223 
130407 BE385099 Hs.334727 
130417 AW163518 Hs.155485 
130441 U63630 Hs.155637 
130455 D90041 Hs.155956 
130466 W19744 Hs.180059 
130526 AW876523 Hs.15929 
130567 AA383092 Hs.1608 
130604 AA383256 Hs.1657 
130614 AI354355 Hs.16697 
130617 M90516 Hs.1674 
130619 AI963376 Hs.12532 
.130625 AF176012 Hs.260720 
130677 AL161961 Hs.17767 
130681 R62676 Hs.17820 
130693 R68537 Hs.17962 
130712 AJ271881 Hs^79762 



Homo sapiens done 25061 mRNA sequence 4.2 

HomosapiensGDNAFU14232fis, done NT 3.1 

shortchain alcohol dehydrogenase family 3.3 

activated leucocyte cell ad hesion molecu 7.3 

solute earner family 7 (cationlc amino 3.1 

proteasome (prosome, macropaln) subunit, 3.6 

granzyme B ^ranzynoe 2, cytotoxic T-lymp 3.9 

secreted frizzled-related protein 4 17.4 

KIAA0553 protein 3.1 

Ste20-related serine/threonine kinase 3.6 
Homo sapiens mRNA; cDNA DKFZp586H0924 (f 33 

epithelial protein lost in neoplasm beta 1 1 .3 

Homo sapiens cDNA FUl 2900 fis, clone NT 7.1 

a dbintegrin and metailoprotelnase doma 4.6 

ESTs 8.2 

dihydrofolate reductase 5 

thrombospondin 2 3.2 

C-termlhal binding protein 2 4.4 

chromosome 14 open reading frame 2 14.2 

polyadenylate binding protein-intefBdin 7.1 

TAKI-binding protein 2; KIAA0733 protein 5 

uncharaderized hematopoietic stem/jproge 5.2 

leptin receptor overlapping transcripl-l 3.7 

KIAA0990 protein 9.5 

ras assodation (RalGDS/AF-6) domain con 7.6 

Homo sapiens done 23785 mRNA sequence 7.1 

phosphatidylinositol glycan, class F 3.9 

aspartylglucosaminidase 3.6 

Ras^TPase-activating protein SH3-domain 4 

hypothetical protein FU21127 4.6 

Homo sapiens cONA FU12566 fis, done NT 4.2 

collagen, type III, alpha 1 (Ehlers^Danl 6.4 

ATP-blndihg cassette, sub-family E (OABP 4.8 

eukaryotic translation initiation factor 5.6 

nuclear receptor subfamily 1 , group I, m 4.5 

Homo sapiens done 24538 mRNA sequence 5.3 

Homo sapiens cDNA FLJ13495 fis, done PL 3.1 

cortactin SH3 domain-binding protein 1 1 .4 

hypothetical protein similar to mouse Dn 4.7 

ESTs 3.1 

early growth response 2 (Krox-20 (Drosop 3.2 

chromosome 8 open reading frame 2 67 

solute carrier family 5 (inositol transp 1 

hypothetical protein FLJ10773 14.6 

X-box binding protein 1 3.1 

caldum/calmodulin-dependent serine prot 5.3 

TAR (HIV) RNA-binding protein 1 4.2 

KIAA1481 protein 13.2 

nuclear receptor interacting protein 1 3.3 

hypothetical protein FLJ10849 8.1 

stanniocatein 2 72.2 

hypothetical protein MGC3017 6.5 

hunlingtin interacting protein 2 3.5 

protein kinase, DNA-actlvated, catalytic 6.1 

N-acetyltransferase 1 (arylamine N-acety 10.8 

Homo sapiens cDNA a J20653 fts, clone KA 3.9 

hypothetical protein aJ12910 3.9 

replication protein A3 (14kD) 4.4 

estrogen receptor 1 32.2 

down-regulator of Iranscrfplion 1,TBP-b 5.2 

glutamlne-fhJctDs&^-phosphate transamin 10 

chromosome 1 open reading frame 21 3.9 

J domain containing protein 1 10.5 

KIAA1554 protein 6.8 

Rho^ssodated. ooited-coD containing p 4.1 

ESTs 9.2 

bromodomainconteinhg 7 17.5 
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130723 BE247676 Hs.18442 


E-1 enzyme 


8.1 


81 


3 


2.8 


130751 AF052105 Hs.l8879 


chromosome 12 open reading frame 


45 


49 


1 


4.3 


130780 AA197226 Hs.19347 


hypothetical protein MGC11321 


3.6 


100 


28 


6.6 


130863 Y10805 Hs.20521 


HMTI (hnRNP methyltransferase, S. cerevi 


3.4 


525 


154 


5.3 


130871 AF080158 Hs.226573 


inhibitor of kappa light polypeptide gen 


10.5 


121 


12 


1.6 


1308B6 AL044315 Hs.173094 


Homo sapiens mRNA for WAA1750 protein, 


6 


202 


34 


3.7 


130974 NPA«003528Hs.2178 


H2B hlstone family, member Q 


7.1 


100 


14 


75 


130979 NWL012446HS.169833 


slngle-stfanded-ONA-binding protein 


3.2 


87 


27 


1.7 


130987 BE613269 Hs.21893 


hypothetical protein DKFZp761N0624 


3.5 


124 


35 


6.5 


130993 T97401 Hs.21929 


ESTs 


4.5 


45 


1 


2.5 


131076 AA749230 Hs.26433 


doDchyl-phosphate (UDP-N-acetytgluoosam 


3.2 


210 


66 


3.8 


131085 BE207357 Hs.3454 


KIAA1821 protein 


3.8 


42 


11 


0.6 


131126 NM-016156HS.181326 


KtAA1073 protein 


6.7 


67 


6 


1.9 


131129 BE541042 Hs.23240 


Homo sapiens cONA: FLJ21848 fis, done H 


5.8 


115 


20 


2.5 


131148 AW953575 Hs^031» 


p53^nduoed protein PiGPCI 


3.8 


585 


153 


3.7 


131164 AW013807 Hs.182265 


keratin 19 


5.2 


1320 


256 


3.2 


131176 AA465113 Hs.23853 


ESTs, Weakly similar to A34615 profilagg 


3.8 


38 


1 


3.3 


131200 BE540516 Hs.293732 


hypothetical protein MGC3195 


4.8 


48 


1 


4.1 


131216 AI815486 Hs.243901 


Homo sapiens cDNA FU20738 fis, done HE 


6.1 


343 


56 


16.4 


131245 AL080080 Hs.24766 


thioredoxin domain-containing 


8 


100 


13 


2.9 


131248 AI038989 Hs.332633 


Bardet-Biedl syndrome 2 


4 


95 


24 


1.1 


131273 AW206008 Hs.283378 


Homo sapiens cDNA: FLJ21778 fis, done H 


4.6 


239 


53 


3.5 


131319 NM_003155Hs.25590 


stanniocaldn 1 


3.5 


402 


114 


2.1 


131367 AI750575 Hs.173933 


nuclear factor I/A 


3.3 


775 


233 


2.4 


131375 AW293155 Hs.143134 


ESTs 


3.8 


38 


1 


3 


131379 AK001123 Hs.26176 


hypothetical protein FU 10261 


3.9 


116 


30 


0.5 


131388 NM_014810Hs.92200 


KIAA0480 gene product 


7.6 


76 


1 


5 


131475 AA9SK2841 Hsi7263 


KIAA1458 protein 


5.1 


113 


22 


6.1 


131492 AI452601 Hs.288869 


nudear receptor subfamily 2, group F, m 


8.4 


169 


20 


4.6 


131S01 AV66ig58 Hs.8207 


GK001 protein 


3.1 


197 


63 


18.7 


131535 N22120 Hs.75277 


hypothetk:al protein FLJ13910 


5.9 


59 


1 


4.4 


131544 AL355715 Hs.28555 


programmed cell death 9 (PDCD9) 


5.1 


51 


1 


3.9 


131546 AA093668 Hs.28578 


muscleblind (Drosophila)-like 


3.8 


79 


21 


6.9 


131562 NWL003512HS.28777 


H2A histone fsmiity, member L 


4 


350 


88 


3 


131564 T93500 Hs.28792 


Homo sapfens dDNA FLJ11041 fte, done PL 


4.7 


381 


81 


6.4 


131604 AA305477 Hs.29379 


hypothetical protein FU10687 


4.6 


46 


7 


3.8 


131684 NIVL002104HS.3066 


granzyme K (serine protease, granzyme 3; 


3.2 


82 


26 


6.6 


131687 BE297535 Hs.3069 


heat shock 70kD protein 98 (mortaIin-2) 


6.7 


93 


14 


8.4 


131689 AB012124 Hs.30696 


transcrlpbbn factor-like 5 (bask: helix 


3.8 


51 


14 


1.7 


131693 AW963776 Hs.110796 


SARI protein 


7.2 


72 


4 


5.7 


131739 AF017986 Hs.3l386 


secreted frizzled-related protein 2 


2.1 


1561 


757 


1.7 


131742 AA961420 Hs.31433 


ESTs 


11.7 


117 


1 


10.1 


131775 AB014548 Hs.31921 


KIAA0646 protein 


4.8 


48 


1 


4.6 


131787 D87077 Hs.196275 


KIAA0240 protein 


3.2 


207 


64 


5.5 


131796 X86098 Hs.301449 


adenovirus 5 E1A binding protein 


3.4 


115 


34 


9.1 


131836 W00712 Hs.32990 


DKFZP566F084 protein 


5.8 


91 


16 


1.4 


131853 A1681917 Hs.3321 


ESTs, Highly similarto IRX1_HUMAN IROQU 


4.9 


632 


129 


1.7 


131877 J04088 Hs.156346 


topoisomerase (DNA) II alpha (170kD) 


6.8 


68 


1 


5.6 


131881 AW361018 Hs.3383 


upstream regulatory element binding prat 


4 


140 


35 


1.8 


131885 BE502341 Hs.3402 


ESTs 


5.7 


57 


1 


4.5 


131904 AF078866 Hs.284296 


Homo sapiens cDNA: FLJ22993 (is, done K 


5.5 


90 


17 


2.9 


131919 T15803 Hs.272458 


protein phosphatase 3 (fomierfy 2B), cat 


5.0 


95 


17 


y.i 


131941 BE252983 Hs.350B6 


ubiquitin spedfic protease 1 


7.4 


103 


14 


6.5 


131945 NM.002916Hs.351 20 


replicatton factor C (activator 1) 4 (37 


3.7 


37 


1 


3.4 


131949 AK000010 Hs.258798 


hypothetteal protein FLJ20003 


3.5 


35 


1 


2.5 


131965 W79283 Hs.35962 


ESTs 


5.5 


168 


31 


4,4 


131977 U90441 Hs.3622 


procoilageni>roline. 2-oxogiutarate 4-di 


3.7 


37 


9 


2.8 


131985 AA503020 H5.36583 


hypothetical protein FLJ22418 


40.2 


402 


1 


4 


131993 AI878910 Hs.3688 


cisplatin resistance-associated overexpr 


7.3 


73 


1 


1.2 


132064 AA121098 Hs.3838 


serunvindudble kinase 


22.6 


226 


10 


0.9 


132094 NM.016045Hs.3945 


CGI-107 protein 


3.1 


227 


73 


16.8 


132109 AW190902 Hs.40098 


cysteine knotsuperfamily 1, BMP antagon 


3.5 


73 


21 


6.3 


132116 AW950474 Hs.40289 


ESTs 


3.6 


141 


39 


12.6 


132143 D52059 Hs.7972 


KIAA0871 protein 


4.9 


49 


1 


4.1 


132160 W26406 Hs.295923 


seven in absentia (Drosophila) homolog 1 


4.4 


53 


12 


2.1 


132164 AI752235 Hs.41270 


procollagen-iysine. 2-oxoglutarate5-dk> 


5 


225 


45 


9.1 


132180 NMJ)04460Hs.418 


fibroblast acflvafnn protein, alpha 


107 


433 


41 


7.2 
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132197 AI6gg482 Hs.42151 


ESTs 


3.4 


58 


17 


4 


13225B AI078645 Hs.431 


mun'ne leukemia viral (bmi-1) oncogene h 


4.2 


42 


1 


2.2 


132298 N^L015986Hs.7120 


cytokine receptor-like moleGUle 9 


3,4 


34 


2 


3 


132316 U28831 Hs.44566 


KIAA1641 protein 


18.6 


186 


10 


1.5 


132325 N37065 Hs.44856 


hypothetical protein FLJ12116 


5.5 


323 


59 


10.5 


132358 NM_003542Hs.46423 


H4 histone family, member G 


3.3 


979 


298 


2.2 


132384 AA312135 Hs.46967 


HSPC034 protein 


3.6 


36 


1 


3.1 


132388 W32624 Hs.278626 


Afg/AbMnteradlng protein ArgBP2 


5.9 


186 


32 


3.7 


132393 AL135094 Hs.47334 


hypothetical protein FLJ 14495 


4.2 


159 


38 


7.1 


132407 BE613126 Hs.47783 


B aggressive lymphoma gene 


4.6 


46 


1 


4.3 


132425 N87549 Hs.125287 


zbc finger protein 2NF140-like protein 


3.6 


146 


41 


1.1 


132440 AB020699 Hs.112751 


KiAA0892 protein 


3.3 


33 


4 


2.9 


132465 AW169847 Hs.49169 


KIAA1634 protein 


8.3 


145 


18 


37 


132522 AB023164 Hs.507O 


KIAA0947 protein 


4.6 


46 


1 


4.4 


132528 T78736 Hs.50758 


SMC4 (structural mafhtenance of chromoso 


9.3 


93 


1 


8.4 


132530 AA306105 Hs.50785 


SEC22. vesicle trafficking protein (S. c 


4.9 


49 


1 


4.4 


132543 BE568452 Hs.5101 


protein regulator of cytokinesis 1 


11.8 


201 


17 


19.1 


132572 AI929659 Hs.237825 


signal recognitbn particle 72kD 


3.8 


38 


1 


3 


132592 AW803564 Hs.288850 


Homo sapiens cDNA: FU22528 fo, clone H 


4.8 


93 


20 


3.1 


132602 AW606927 Hs.5306 


hypothetical protein DKFZp586F1122 simll 


6.1 


61 


2 


5.9 


132616 BE262677 Hs.283558 


hypothetical protein PR01855 


3.4 


193 


58 


12.3 


132617 AF037335 Hs.5338 


carbonic anhydrase XII 


14.2 


390 


28 


22.5 


132618 AL050025 Hs.279916 


hypothetical protein FLJ20151 


3.3 


909 


274 


3.2 


132632 AU076916 Hs.5398 


guanine monphosphate synthetase 


5 


50 


1 


4.1 


132668 AB018319 Hs.5460 


KIAA0776 protein 


4.2 


171 


41 


12.6 


132742 AA025480 Hs2928l2 


ESTs, Weakly similar to T33468 hypolheti 


6.5 


65 


1 


5.5 


132790 AW242243 Hs.168670 


peroxisomal femesylated protein 


3.7 


37 


1 


2.2 


132811. U25435 Hs^7419 


CCCTCMmg ^r (3^nc finger protei 


7 


115 


17 


5.4 


132852 AL120a50 H&58220 


Hoino sapiens cDNA: FU23005 fls. clone L 


3.3 


61 


19 


5.1 


132856 NML001448HS.58367 


glypican4 


4.8 


48 


1 


3.6 


132880 BE077155 Hs.177537 


hypothetical protein DKFZp761B1514 


12.6 


126 


8 


9.9 


132902 AI936442 Hs.59838 


hypothetical protein FLJ10808 


11 


187 


17 


10.4 


132906 BE613337 Hs.234896 


geminin 


3.3 


106 


33 


2.6 


132914 AL047045 Hs.60293 


Homo sapiens clone 122482 unknown mRNA 


3.5 


110 


32 


2.1 


132968 AF234532 Hs.61638 


myosin X 


4.1 


62 


15 


4.9 


132977 AA093322 Hs.3014O4 


binding motif protein 3 


22.1 


221 


9 


17.8 


132990 X77343 Hs.334334 


transcription factor AP-2 alpha (actlvat 


12.7 


311 


25 


2.4 


132994 AA1 12748 Hs.279905 


cloneHQ0310PRO0310p1 


3 


380 


127 


5.5 


133011 m 006379HS.171921 


sema domain, immunoglobulin domain (Ig), 


7.3 


271 


37 


2.3 


133015 AJ002744 Hs.246315 


UDP4^)^cetyl-alpha-D-galactosamine:polyp 


4.6 


427 


93 


10.4 


133070 U92649 Hs.64311 


a dlsintegrin and metalk)proteina5e doma 


3.6 


36 


1 


3.1 


133091 AK001628 Hs.64691 


K1AA0483 protein 


55 


117 


23 


5 


133192 AA218564 Hs.67a52 


vacuolar protein sorting 26 (yeast homo! 


3,1 


359 


118 


2.5 


133197 AIZ75243 Hs.180201 


hypothetical protein FLJ20671 


5.1 


56 


12 


5.7 


133199 AF231981 Hs.250175 


honiolog of yeast long chain polyunsatura 


3 


816 


275 


3,9 


133221 W32474 Hs.301746 


RAP2A. member of RAS oncogene family 


3.1 


234 


76 


8.6 


133240 AK001489 Hs.242894 


ADP-ribosyiatk)n factor-like 1 


8.1 


81 


1 


4.6 


133271 248633 Hs.283742 


H^piens mRNA for retrotransposon 


12.4 


124 


6 


10.8 


133291 BE297855 Hs.69855 


NRAS-related gene 


3.3 


33 


1 


2.9 


133294 AJ001388 Hs.69997 


zinc finger protein 238 


7.9 


234 


30 


18.9 


133350 AI499220 Hs.71573 


hypothetical protein FLJ 10074 


4.6 


46 


5 


3.5 


133362 AK001519 Hs.7194 


CGI-74 protein 


5 


110 


22 


9.7 


133370 AF245505 Hs.72157 


DKFZP564I1922 protein 


3.2 


725 


227 


3.2 


133407 AF017987 Hs.7306 


secreted frizzled-related protein 1 


4.1 


374 


91 


1.1 


133422 AB033061 Hs.73287 


KIAA1235 protein 


4.3 


43 


1 


3.9 


133435 AI929357 Hs.323956 


Honx) sapiens done H63 unknown mRNA 


5.5 


186 


34 


16.5 


133479 W01556 Hs.238797 


ESTs. Moderately similar to 138022 hypot 


3.5 


35 


7 


2.1 


133493 AW998046 Hs.194369 


aiginine^lutamic add dlpeptide (RE) re 


3.6 


39 


11 


0.4 


133504 NMJ04415HS.74316 


desmopiakin(DPI,DPIt) 


4.1 


640 


158 


3 


133517 NM 000165HS.74471 


gap junction protein, alpha 1, 43kD (con 


Z2 


351 


111 


5.2 


133536 W25797 Hs,177486 


amyloid beta (A4] precursor protein (pro 


3.2 


226 


71 


2.8 


133578 AU077050 Hs.75066 


translin 


3.4 


178 


53 


8.8 


133633 D21262 Hs.75337 


nudeolarand colied-body phosphprotein 


4.7 


47 


1 


4 


133640 AW246428 Hs.75355 


ubiqultin-conjugating enzyme E2N (homolo 


8.5 


85 


1 


7.2 


133669 NIVL006925HS.166975 


splidng factor, arginine/serine-rtch 5 


3.6 


36 


1 


0.4 


133681 AI352558 HsJ5544 


tyrosine 3-monooxygenase/tryptophan 5-mo 


3.4 


234 


68 


10.7 


133746 AW410035 Hs.75862 


MAD (mothers against decapentaplegic, Dr 


9.3 


93 


1 


7.8 
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133765 


M62194 Hs.75929 


cadherin 11. type 2. OB-cadherin (osteob 


3.2 


560 


174 


2.6 


133780 


AA557660 H&76152 


deoo/in 


5.4 


144 


27 


13.3 


133784 


BE522743 Hs.301064 


arfaptin 1 


4.7 


47 


1 


4.1 


133814 


NWL002462HS.76391 


myxovirus (inOuenza) resistance 1, homo 


3.3 


380 


114 


4.9 


133829 


AW630088 Hs.76550 


Homo sapiens mRNA; cDNA DKFZp56461264 (f 67 


304 


46 


7.8 


133845 


AA147026 Hs.76704 


ESTs 


6.2 


600 


97 


4.1 


133913 


AU076964 Hs.7753 


calumenn 


3.3 


889 


267 


5 


133988 


AA355986 Hs.232068 


transcriptton factor 8 (represses Inleri 


3.7 


91 


25 


2.6 


133990 


R48316 Hs.7822 


Homo sapiens mRNA; cONA DKF2p564C1216 (f 3,4 


91 


27 


8.5 


133999 


AA535244 Hs.78305 


RAB2, member RAS oncogene family 


7.8 


78 


1 


5.6 


134032 


N(^005025Hs.78589 


serine (or cysteine) proteinase inhibito 


5.9 


59 


1 


3.3 


134064 


AF091622 Hs.78893 


KIAA0244 protein 


5.8 


58 


1 


4.9 


134087 


U51166 Hs.173824 


thymtne-DNA gtyoosylase 


6.4 


100 


16 


4.4 


134089 


R51273 Hs.79029 


ESTs 


5.1 


51 


9 


3.8 


134095 


NM_004354Hs.79069 


cycnn62 


5 


50 


1 


3.2 


134098 


BE513171 Hs.79086 


mitochondria) ribosomal protein L3 


4.8 


246 


51 


3.9 


134110 


U41060 Hs.79136 


LIV-1 protein, estrogen regulated 


4.5 


1472 


330 


2.1 


134125 


NM_014781Hs.50421 


KIAA0203 gene product 


4.6 


69 


15 


5.8 


134246 


D28459 Hs.80612 


ubiquitin-conjugating enzyme E2A (RAD6 h 


7 


97 


14 


7.5 


134257 


C05768 Hs.8078 


Homo sapiens done FBD3 CiWu-chat crft 


3.4 


34 


5 


2.6 


134272 


X76040 Hs.278614 


protease, serine, 15 


3.6 


36 


1 


2.8 


134282 


R45621 Hs.81057 


hypothetical protein MGC2718 


6.7 


67 


9 


5.7 


134288 


AI022650 Hs.8117 


erbb2-interacting protein ERBIN 


4.5 


137 


31 


12 


134321 


BE538082 Hs.8172 


ESTs, ^/loderately similar to A46010 X-Iln 


5.2 


52 


1 


4.9 


134326 


AW903838 Hs.81800 


chondroitin sulfate proteoglycan 2 (vers 


8.6 


568 


66 


22.4 


134328 


AW959281 Hs.8184 


ESTs 


4.8 


53 


11 


3.7 


134348 


AW291946 Hs.82065 


interteukln 6 signal transducer (gp130, 


7.1 


71 


4 


6.4 


134359 


NM-001982HS.199067 


v-erb-b2 avian erythroblastic leukemia v 


3 


68 


23 


2.8 


134367 


AA339449 Hs.82285 


phosphoribosylglycinamlde formyltransfer 


4.4 


44 


1 


4.1 


134374 


N22687 Hs.8236 


ESTs 


13.3 


445 


34 


6 


134380 


AU077143 Hs.179565 


minichromosome maintenance deficient (S. 


4.5 


45 


2 


3,4 


134395 


AA456539 Hs.8262 


lysosomal 


8 


60 


5 


5.9 


134401 


AI916662 Hs.211577 


kinedin 1 (kinesin receptor) 


4.1 


301 


73 


6.1 


134405 


AW067903 Hs.82772 


collagen, type XI. alpha 1 


4.6 


1216 


267 


4.4 


134415 


AI750762 Hs.82911 


protein tyrosine phosphatase type IVA, m 


4.9 


163 


34 


15.1 


134417 


NM>006416Hs.82921 


solute earner family 35 (CMP-slaPic ad 


4.9 


49 


3 


3.8 


134419 


W95642 Hs.82961 


trefoil factor 3 (Intestinal) 


3.2 


1872 


592 


3.3 


134421 


AU077196 Hs.82985 


collagen, type V, alpha 2 


6.3 


1075 


171 


3.8 


134436 


U29344 Hs.83190 


fatty add synthase 


3.3 


710 


217 


2 


134485 


X82153 Hs.83942 


cathepsln K (pycnodysostosis) 


34.3 


411 


12 


5.1 


134487 


AF061739 Hs.83954 


protein associated with PRK1 


4.8 


153 


32 


4.3 


134495 


D63477 Hs.84087 


KIAA0143 protein 


3.1 


147 


48 


12.7 


134520 


BE091005 HSJ4861 


activated Rl^ polymerase II transcriptio 


3.3 


33 


1 


2 


134542 


M14156 Hs.85112 


insunn-like gioui^ factor 1 (somatomedl 


4.2 


42 


5 


2.6 


134570 


U66615 Hs.172280 


SWI/SNF rdated, matrix assodated, acti 


3.9 


39 


1 


2.5 


134590 


AW903849 Hs.173840 


HUEL (C4orf1)-<ntefacfing protein 


3.7 


41 


11 


0.6 


134604 


Nf\L002884Hs.865 


RAP1A, member of RAS oncogene family 


5.2 


52 


1 


3 


134612 


AW068223 Hs.171581 


ublquitin C-lerminal hydrolase UCH37 


4.9 


49 


1 


3.7 


134643 


AW299723 Hs.87223 


bone morphogenetlc protein receptor, typ 


5.2 


52 


5 


3.5 


134654 


AKD01741 Hs.8739 


hypothetical protein FLJ10879 


6.4 


64 


1 


5.1, 


134656 


AI750878 Hs.87409 


thrombospondin 1 


12.6 


126 


1 


10.8 


134672 


AF271212 Hs.322901 


diswpter of silencing 10 


5.4 


81 


15 


2.6 


134700 


AKD00606 Hs.8868 


golgl SNAP receptor complex member 1 


3.4 


179 


52 


1.5 


134711 


X04011 Hs.88974 


cytochrome b-245, beta polypeptide (chro 


3.2 


143 


45 


13.9 


134722 


AF129536 Hs.284226 


F-box only protein 6 


7 


70 


6 


6 


134856 


BE281128 Hs.9030 


TONDU 


3.1 


31 


1 


2.3 


134880 


AI879195 Hs.90606 


15 kOa selenoprotein 


5.7 


57 


1 


5 


134917 


X87241 Hs.166994 


FAT tumor suppressor (DrosophRa) homolo 


3.2 


153 


48 


4.7 


134921 


AL137491 Hs.125511 


Homo sapiens mRNA; cDNA DKFZp434P1530 (f 4 


452 


114 


2 


134982 


AK002085 Hs.92308 


Homo sapiens cDNA FUl 1223 fis, done PL 


5.1 


150 


30 


7.2 


134989 


AW968058 Hs.92381 


nudix (nucleoside diphosphate linked moi 


8.2 


114 


14 


9.9 


135029 


H58818 Hs.187579 


hydroxysleroid (17-l)eta) dehydrogenase 7 


11.5 


115 


1 


10 


135035 


AIJ034344 HS.2W186 


forkhead box CI 


5.4 


259 


48 


1.4 


135051 


A1272141 Hs.83484 


SRY (sex detemilning region Y)-box 4 


3.3 


1296 


394 


2.2 


135062 


AK000967 Hs.93872 


K1AA1682 protein 


3.8 


240 


64 


3.2 


135073 


W55956 Hs.94030 


Homo sapiens mRNA; cDNA DKFZp5B6E1624 (f 8.1 


101 


13 


7.9 


135098 


AW274526 Hs.277721 


ovarian carctnoma antigen CA125 


3.3 


33 


1 


2.6 
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135117 W52493 


Hs.94694 


Homo sapiens cDNA FU 10561 fis, clone NT 


5.3 


53 


1 


4.1 




'135144 NM^016255Hs.952e0 


Autosomal Highly Conserved Protein 


7.4 


74 


5 


2.4 




135154 AK001835 


HS267812 


sorting nexln 4 


6.6 


69 


11 


6.3 




135155 A1207958 


Hs.166556 


Homo sapiens, Similar to TEA domain fami 


6.1 


61 


1 


5.1 




135172 AB028956 


Hs.12144 


K1AA1033 protein 


3.4 


88 


26 


1.4 




135242 AI583187 


Hs.9700 


cydin El 


3.1 


31 


-1 


2.3 




135243 BE463721 


Hs.97101 


putative 6 protein^oupied receptor 


3.4 


169 


50 


9.1 




135269 NWL003403Hs57496 


YY1 transcription factor 


3.4 


475 


142 


2.5 




135356 BE312948 


Hs.18104 


hypothdical protein FU1 1 274 


3.1 


31 


10 


1.7 




135357 AI565004 


Hs.79572 


cathepsin D (lysosomal asparlyl protease 


4.7 


710 


151 


15 




135389 U05237 


Hs59872 


fetal Alzheimer antigen 


20.6 


206 


4 


19.1 




135397 L14922 


Hs.166563 


replication fador C (activator 1) 1 (14 


3.2 


32 


1 


ZA 




-t35400 X78592 


Hs.99915 


androgen receptor (dihydrotestosterone r 


3.2 


117 


37 


9.4 




AI471525 


Hs.2474a6 


ESTs 


3.8 


58 


16 


5.5 




X70683 


Hs.93668 


ESTs 


1.8 


1047 


596 


1.6 




L14922 


Hs.82128 


5T4 oncofetaJ trophoblast glycoprotein 


5 


285 


58 


1.2 




M23263 


Hs504 


amylo-1;6^tucosidase; 4<alpha-glucanotransferas 


3.1 


31 


1 


2.6 


AI267886 


Hs.148027 


polymerase (RNA) II (DNA directed) polypeptide B 


7.8 


137 


18 


11.9 


AA044840 


Hs.241676 


stromal celi-derived factor 1 


4.7 


114 


25 


0.9 




N90960 


Hs^27459 


ESTs; Moderately similar to !l!t ALU SUBFAMILY 


4.7 


151 


32 


9.3 


AA873285 


Hs.137947 


ESTs 


4.7 


47 


3 


4.4 




T56679 


Hs.865 


RAP1 A; member of RAS oncogene family 


4 


40 


1 


3.4 




AA305535 




■EST176522 Colon cardnoma (Caco-2) cell line II 


3.6 


121 


34 


11.8 


AI369384 




arylsulfatase D 


3.5 


113 


33 


1.7 




AA219081 


Hs.242396 


ESTs; Moderately similar to !lll ALU SUBFAMILY) 


3.4 


107 


32 


9.9 
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TABLE lOA 



Table 10 A shows the accession numbers for those pkeys lacking unigenelD's for Table 10. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

10 

Pkey: Unique Eos probeset Identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 

15 



Pkey 


CAT number 


Accession 


123619 


371681J 


AA602964AA609200 


104602 


524482J 


H47610R86920 


1215B1 


283769^1 


AA416568 AA442889 AA417233 AA442223 


123523 


genbanlLAA60656B AA606588 


100821 


tigr.HT4306 


M26460 U09116 


125091 


genbank_T91518 


T91518 


125150 


NOT FOUND enlrez W38240 W38240 


118475 


genbanK_N56845 


N66845 


104787 


genbanleAA027317 


AA027317 


106055 


gen banK_AA41 7034 


AA417034 


113702 


genbank^T97307 


T97307 


101046 


entrez_K01160 


KOI 160 


101447 


entrez^M21305 


M21305 


101624 


entriez_M55998 


1^155998 


124677 


genbanKJ?01073 


R01073 


110581 


genbantai61560 


H61560 


119023 


genbankJI98488 


N98488 


110775 


genban!eN22414 


N22414 


112092 


genbanK.R44538 


R44538 


112253 


genbanleR51818 


R51818 


107014 


genbanleAA598820 


AA598820 


114988 


genbank^251089 


AA251069 
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TABLE 11; Figure 11 from BRCA 001-3 PCX 



5 Table 11 depicts a preferred groiq) of genes upregulated in tumor tissue compared to normal 
breast tissue. 



Pkey: Unique Eos probeset identifier number 

1 0 ExAccn: Exemplar Accession nu mber. Genbank accession number 

UnigenelD: Unigene number 

Un^eneTftle: Unigene gene trlle 

R1 : Ratio of tumor to normal body tissue 

R2: RatTo of 90» percentile tumor to nomnal body 

15 R3: Ratio of 75^ percentile normal body to tjmor 

R4: Ratio of tumor to normal breast tissue 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Pkey 


ExAccn UnigenelD Unigene Title 


R1 


R2 


R3 


R4 


100131 


D12485 Hs. 11951 ectonudeotidepyropbosphatase/pbosphodi 


13.2 


244 


19 


9.9 


100147 


D13666 Hs. 1 36348 osteoblast specific factor 2 (fasddln 


15.7 


1030 66 


5 


100522 


X51501 Hs.99949 prolactin-induced protein 


22.7 


760 


34 


1.4 


100666 


L05424 Hs. 169610 CD44 antigen (homing fiinctibn and Indian 


8.5 


85 


1 


3.2 


101104 


AW66225B Hs.169266 neuropeptide Y receptor Y1 


15.3 


153 


1 


14.1 


101478 


NM_002890Hs.758 RAS p2 1 protein activator {GTPase adlva 


9.6 


96 


1 


8.5 


101724 


L11690 Hs.620 bullous pemphigoid antigen 1 (230/240}<O) 


9.4 


94 


1 


0.3 


101754 


S701 14 Hs.239489 TIA1 cytotoxic granul&xissociated RNA-b( 


8.9 


89 


5 


8 


101888 


AL049610 Hs.95243 transcription elongation factor A (Sli)- 


7.3 


73 


1 


5.3 


102165 


BE313280 Hs.159627 death associated protein 3 


9.3 


93 


5 


6 


102304 


AF015224 Hs.46452 mammaglobini 


6.5 


2058 243 


1.4 


102348 


U37519 Hs.87539 aldehyde dehydrogenase 3 family, member 


6.4 


428 


67 


2.3 


102457 


NM 001394HS.2359 dual specifrdty phosphatase 4 


20.2 


202 


5 


1.3 


102567 


U63630 Hs.146847 TRAP family member^assodated NFKB activ 


8.2 


82 


1 


6.8 


102823 


D85390 Hs.5057 carboxypeptidase D 


5.6 


56 


1 


5.3 


103557 


AL133415 Hs.297753 vimentin 


7.5 


136 


18 


3.4 


103613 


NNL000346HS.2316 SRY (sex detemilning regfon Y>box 9 (ca 


7.3 


73 


1 


5.2 


104115 


AF1 83810 Hs.26102 opposite strand to trlchorhinophalangeai 


29 


290 


1 


26.8 


104667 


A1239923 Hs.30098 ESTs 


14.9 


149 


1 


6.4 


104804 


AI858702 Hs.31803 ESTs. Weakly simiiar to N-WASP [H.sapien 


7.7 


77 


1 


5.1 


104807 


AI139058 Hs.1 25790 leucine^ich repeat^talning 2 


7 


70 


1 


6.5 


104896 


AW015318 Hs.23165 ESTs 


7.4 


74 


1 


6 


104943 


AF072873 Hs.1 14218 frizzled prosophila) homolog 6 


16.2 


162 


1 


4.2 


105038 


AW503733 Hs.9414 KIAA1488 protein 


5.5 


55 


1 


5.2 


105329 


AA234561 Hs.22862 ESTs 


2.8 


131 


47 


3.9 


105500 


AW602166 Hs.222399 CEGP1 protein 


25.4 


508 


20 


3 


105516 


AK001269 Hs,30738 hypothetical protein FU10407 


8.3 


83 


3 


1.8 


105730 


AW377314 Hs.5364 DKFZP554 1052 protein 


6.9 


69 


1 


4.4 


106012 


AI240665 Hs.8895 ESTs 


2U 


212 


6 


17.4 


106095 


AF115402 Hs.11713 E74-Iike factor 5 (els domain transcript 


25.3 


356 


14 


1 


106155 


AA425414 Hs.332B7 nuclear factor l/B 


9.9 


483 


49 


1.8 


107102 


AB037765 Hs.30652 KIAA1344 protein 


6.3 


63 


1 


5.4 


107136 


AV661958 Hs.8207 GK001 protein 


2.5 


392 


155 


4.3 


107151 


AW378065 Hs.8687 ESTs 


15.6 


156 


7 


10.8 


107922 


BE153655 Hs.61460 Ig superfamily receptor liJIR 


9 


90 


1 


5.5 


106339 


AW151340 Hs.51615 ESTs. Weakly similar to ALU7J1UMAN ALU S 


18.7 


187 


1 


17 


109112 


AW4 19196 Hs.257924 hypothetical protein FU13782 


4.1 


334 


82 


3.4 


109292 


AW975746 Hs.188662 KIAA1702 protein 


7.1 


71 


1 


6.5 


109415 


U80736 Hs.1 10826 trinucleotide repeat containing 9 


12.3 


123 


1 


11.3 


109912 


AW390822 Hs.301528 L-kynurenine/alpha-amfnoadipateaminotra 


14.2 


142 


1 


9.5 


110009 


BE075297 Hs.6614 ESTs. Weakly similar to A43932 mucin 2 p 


6.3 


693 


110 


7.2 


110915 


BE092285 Hs.29724 hypothetical protein FU13187 


20.9 


209 


1 


19.5 


111164 


N46180 Hs.122469 Homo sapiens cDNA aj13289 lis. clone OV 


7.7 


77 


1 


5 
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111179 


AK000136 Hs.10760 asporin (LRR dass 1) 


25.1 


288 


12 


6.7 


111190 


AK002055 Hs.151046 hypothetical protein aJ1 1193 


6.3 


63 


1 


5.8 


111223 


AA852773 Hs.334838 KIAA1 666 protein 


3,6 


402 


112 


4.9 


111357 


BE314949 Hs.87128 hypothetical protein FLJ23309 


3.8 


425 


111 


4 


112244 


AB029000 Hs.70823 KIAA1 077 protein 


5.7 


567 


100 


6.7 


113047 


AI571940 Hs.7549 ESTs 


9.6 


124 


13 


9 


113702 


T97307 gb:ye53h05.s1 Scares fetal Twer spleen 


12.3 


129 


11 


11.7 


114124 


W57554 Hs.125019 lymphoid nudear protein (LAF4) mRNA 


24.2 


242 


10 


5.6 


114138 


AW384793 Hs.15740 Homo sapiens mRNA: cDNADKFZp434E033 {fr 6.7 


67 


1 


6.3 


114768 


AF212848 Hs.182339 ets homologous factor 


137 


137 


1 


8.9 


114860 


ALl 57545 Hs.42179 bronfiodonfiain and PHD finger containing, 3 


9.1 


91 


1 


7.6 


114965 


AI733881 Hs72472 BMP-RIB 


355 


359 


10 


297 


114988 


AA251089 gb:zs04!055l NCLCGAP.GCBI Homo sapiens 1 1.5 


115 


1 


6.9 


115206 


AWl 83695 Hs.1 86572 ESTs 


5.8 


58 


1 


5 


115719 


AW992405 Hs.59622 Homo sapiens, clone IMAGE:3507281> mRNA. 


7.6 


144 


19 


13.9 


115644 


AI373062 Hs.332938 hypothetical protein MGC5370 


6.2 • 


62 


1 


5.4 


116470 


AI272141 Hs.83484 SRY (sex determining region YHxix 4 


1.8 


1047 596 


1.6 


116786 


H2S636 Hs.301527 ESTs, Moderately similar to unl(nown(H.5 


22.8 


228 


9 


12,4 


117280 


M18217 Hs.172129 Homo sapiens cDNA: FU21409 fis, done C 


3.9 


322 


83 


4.4 


117412 


N32536 Hs.42645 solute carrier family 16 (monocartnxylic 


17.4 


174 


9 


6.9 


118472 


AL1 57545 Hs.42179 bromodomain and PHD firmer containing, 3 


14.5 


145 


1 


2.4 


119271 


AI061 1 18 Hs.65328 Fanconi anemia, complementation group F 


8.2 


82 


1 


6.4 


119771 


AI905687 Hs.2533 EST 


3.5 


2073 595 


2.1 


120562 


BE244580 Hs.302267 hypothetical protein FLJ1 0330 


8.5 


127 


15 


1.6 


121463 


AK000282 Hs,239681 hypothetical protein FLJ20275 


10.3 


103 


1 


9.3 


121723 


AA243499 Hs.l04800 hypotheti'cal protein FLJ10134 


2.9 


214 


74 


3.7 


122963 


AA478446 Hs.69559 KIAA1096 protein 


7.2 


72 


1 


5,7 


123137 


AI073913 Hs.100686 ESTs. Weakly similar to JE0350 Anterior 


9.9 


351 


36 


13.9 


123619 


AA602964 gb:no97c02.sl NCLCGAP_Pr2 Homo sapiens 


8.5 


85 


1 


4.3 


123709 


AA706910 Hs.1 12742 ESTs 


3.9 


60 


16 


4.6 


124006 


AI147155 Hs.270016 ESTs 


5.8 


321 


55 


17 


124059 


BE387335 Hs.263713 ESTs, Weakly similar to S64054hypotheti 


10.4 


660 


85 


5.3 


124308 


AA249027 Hs.241507 ribosomal protein S6 


10.5 


105 


1 


9.9 


125279 


AW401809 Hs.4779 K1AA1 150 protein 


13.1 


131 


1 


5.1 


125617 


AA287921 Hs.1 64950 ESTs 


6.7 


67 


1 


6 


127439 


D60237 Hs. 1 4368 SH3 domain binding glutamic add-nch pr 


30.6 


306 


4 


26.5 


128305 


A1954968 Hs.279009 matrix Gla protein 


7.5 


75 


1 


6.5 


128482 


A1694143 Hs.295251 programmed cell death 4 


7.2 


72 


1 


5.8 


128790 


AF026692 Hs.1 05700 secreted frizzted-related protein 4 


17.4 


409 


24 


7.8 


128925 


R57419 Hs.21851 Homo sapiens cDNA FU12900 fe. done NT 


7.1 


392 


56 


3.6 


129017 


AAl 15333 Hs.107968 ESTs 


8.2 


82 


1 


7.4 


129229 


AF013758 Hs. 109643 polyadenylate binding protein-lnteractln 


7.1 


71 


1 


6.2 


129337 


NM_014918Hs.1 10488 KIAA0990 protein 


9.5 


95 


1 


8.5 


129366 


BE220806 Hs.1 84697 Homo sapiens clone 23785 mRNA sequence 


7.1 


150 


21 


14,5 


129821 


AB028945 Hs.12696 cortactlnSHS domain-binding piotein 


11.4 


114 


1 


10 


130036 


BE061916 Hs.1 25849 chromosome 8 open reading frame 2 


6.7 


67 


1 


5.7 


130057 


AF027153 Hs.324787 solute cam'er family 5 (inositol tiansp 


1 


1 


1 


1 


13009S 


AK001635 Hs.14838 hypotheUcal protein aJ10773 


14.6 


219 


15 


7.6 


130343 


AB040914 Hs.278628 KIAA1481 protein 


13.2 


331 


25 


12.4 


130386 


AW067800 Hs.155223 stanntacaldn2 


72.2 


722 


1 


1.9 


130407 


BE385099 Hs.334727 hypothetical protein MGC301 7 


6.5 


65 


4 


5.3 


130441 


U63630 Hs.1 55637 protein kinase, DNA-adivated, catalytic 


6.1 


61 


1 


5.7 


130455 


D90041 Hs.1 55956 N-acetyltransferase 1 (arytam&ie N-acety 


10.8 


706 


66 


9.2 


130604 


AA383256 Hs.1657 estrogen receptor 1 


32.2 


322 


1 


4.7 


130517 


M90516 Hs.1674 glutamine-fructose-6-phosphatetransamin 


10 


100 


1 


7.6 


130712 


AJ271881 Hs.279762 bromodoma'BVContaining 7 


17.5 


175 


2 


12.8 


131148 


AW953575 Hs.303125 p53-induced protein PIGPC1 


3.8 


585 


153 


3.7 


131388 


NIVL014610HS.92200 KIAA0460 gene product 


7.6 


76 


1 


5 


131564 


T93500 Hs.28792 Homo sapiens cDNA FU11041 lis, done PL 


4.7 


381 


81 


6.4 


131742 


AA961420 Hs.31433 ESTs 


11.7 


117 


1 


10.1 


131877 


J04066 Hs.156346 topoisomerase (DNA) II alpha (170kD) 


6.8 


68 


1 


5.6 


131985 


AA503020 Hs.36563 hypothetical protein FU224 16 


40.2 


402 


1 


4 


132316 


U26831 Hs.44566 KIAA1 641 protein 


18.6 


186 


10 


1.5 


132528 


T78736 Hs.50758 SMC4 (stnidural maintenance of chromoso 


9.3 


93 


1 


8.4 


132742 


AA025480 Hs292812 ESTs. Weakly similar to T33468 hypotheti 


6.5 


65 


1 


5.6 


132990 


X77343 Hs.334334 transcitptk)nfadorAP-2 alpha (activat 


12,7 


311 


25 


2.4 


133015 


AJ002744 Hs.246315 UDP-N^cetyl^lpha-D^alactDsamherpolyp 


4.6 


427 


93 


10.4 
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133139 AF2319d1 Hs.250175 homolog of yeast long chain polyunsatura 


3 


816 


275 


3.9 


133240 AKD01489 Hs^42894 ADP-ribosylation factor-like 1 


8.1 


81 


1 


4.6 


133271 Z48633 Hs283742 Haptens mRNA for retrotransposon 


12.4 


124 


6 


10.8 


133640 AW246428 Hs.75355 ubiqultin-conjugatlng enzyme E2N (homok) 


8.5 


85 


1 


7.2 


133746 AW410035 Hs.75862 ^AAD (mothers against decapentaplegic. Dr 


9.3 


93 


1 


7.8 


133999 AA535244 Hs.78305 RAB2, number RAS oncogene family 


7.8 


78 


1 


5.6 


134110 U41060 Hs.79136 UV-1 protein, estrogen regulated 


45 


1472 330 


2.1 


134485 X82153 Hs.83942 cathepsin K (pycnodysostosis) 


34.3 


411 


12 


5.1 


134654 AK001741 Hs.8739 hypothetical protein FIJ10879 


6.4 


64 


1 


5.1 


134880 A1879195 Hs.90606 15 kDa selenoprotein 


5.7 


57 


1 


5 


135029 H58818 Hs.187579 hydroxysteroid(17-beta)dehydn)genaSB7 


11.5 


115 


1 


10 


135389 U05237 Hs.99872 fetal Alzheimer antigen 


20.6 


206 


4 


19.1 


128305 AI954968 Hs^79009 matrix Gla protein 


9.4 


94 


3 


53 
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TABLE llA 

Table 11 A shows the accession numbers for those pkeys lacking unigenelD's for Table 11. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity usmg Clustering 
and Ahgnment Tools (DoubleTwist, Oakland Cahfomia). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey. Unique Eos probeset Identifier number 

CAT number: Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accession 

123619 371681.1 AA602g64 AA609200 

113702 genbank_T97307 T97307 

114988 genbankJVA251089 AA2510B9 
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TABLE 12: Figure 12 from BRCA 001-3 PCT 



Table 12 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accessbn number 

UnigenelD: Unlgene number 

Unlgene Title: Unlgene gene title 

R 1 : Ratio of tumor to nomial body tissue 

R2: Ratio of 90^ percentile tumor to body 

R3: Ratio of percentile body to tumor 

r4: Ratio of tumor to normal breast tissue 



Pkey 


ExAccn UnigenelD Unigene Title 


R1 


R2 


R3 


R4 


100131 


D12485 Hs.11951 phosphodiesterase l(PC-1) 




244 


19 


9.9 


105500 


AW602166 Hs.222399 ESTs 


25.4 


508 


20 


3 


112244 


AB029000 Hs.70823 K(AA1077 protein 


5.7 


567 


100 


6.7 


114124 


W57554 Hs.125019 ESTs 


24^ 


242 


10 


5.6 


119771 


AI905687 Hs.2533 ESTs 


3.5 


2073 


595 


2.1 


121723 


AA243499 Hs.1O4800 ESTs 


2.9 


214 


74 


3.7 


128790 


Af=026692 Hs.1 05700 secreted frizzled-related protein 4 


17.4 


409 


24 


7.8 


131148 


AW953575 Hs.303125 ESTs 


3.8 


585 


153 


3.7 


131985 


AA503020 Hs.36563 ESTs 


40.2 


402 


1 


4 


133199 


AF231981 Hs.250175 Homo sapiens cbne 23904 mRNAsequ 


enoe 3 


816 


275 


3.9 
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TABLE 13: Table 1 from BRCA 001-5 US 
5 Table 13 depicts a preferred group of genes upregulated in breast cancer cells. 



Pkey: Unique Eos probeset identifier number 

1 0 ExAccn: Exemplar Accession number, Genbank accesston number 

UnigenelD: Unigene number 

Unigene Titfe: Unigene gene title • 

R1: Ratio of tumor to normal body tissue 

15 



control 

,w control 

20 100040 M97935 control 

control 

proteasome (prosome; macropaln) 26S sub 
Lsm1 protein 

. actin related protein 2/3 complex; subunll 

25 100103 AF007875 Hs.5085 dollchyl-phosphate mannosyltransferase p 

thymldylale synthetase 
protein kinase C; delta 
oxoglutarate dehydrogenase (llpoamide) 
.-w.-w ^..w,, ..^^..w^ proteasome (prosome; macropain) 26S sub 
30 100131 012485 H&11951 phosphodiesterase l/nucleotide pyrophosp 

chaperonln containing TOPI; subunit 8 (t 
Human mRNA for KiAAOOIS gene; comp 
osteoblast specific factor 2 (fascicfin l-Dke 

KlAAOl 01 gene product 

35 100164 D14812 Hs.173714 MORF-related gene X 

D123 gene product 
RAD23 (S. cerevislae) homolog B 
adenylate cyclase 7 

bllivendin reductase B (flavin reductase (N 

40 100215 D26598 Hs.82793 proteasome (prosome; maaopain) subunft 

proteasome (prosome; macropain) subunit 
bone mam)w stromal cell antigen 2 
interferon-induced; hepatitis C-assodated 

KIAA0071 protein 

45 100287 D43950 Hs.1600 chaperonincontalnir\gTCP1; subunll 5 (e 

antioxidant protein 1 
hypothetical protein 

p^telet-activating factor acetythydrolase; 

. w KIAA0153 protein 

50 100355 D78129 Hs.71465 Homo sapiens mRNA for squalene epoxU 

ubiquitln-conjugating enzyme E2G 1 (horn 
extra spindle poles; S. cerevislae; homoto 
KIAA0175 gene product 

KIAA0182 protein 

55 100379 D82060 Hs.278721 Ke4 gene; mouse; human homolog of 

K1AA0193 gene product 
novel RGD-containing protein 
minlchromosome maintenance deficient (m 
.^w^ww «wv-,^w ..*w-,ww nidogen2 
60 100406 D86479 Hs.1 18397 AE-binding protein 1 

KIAA0202 protein 

Human mRNA for KtAA0232 gene; comp 
KIAA0274 gene product 

, KIAA0275 gene product 

65 100448 D87469 Hs^7652 EGF-nke-domain; nruiltipie 2 

215 





ExAccn 


llnlGaie ID 


100038 


M97935 




100030 


MQ7935 




100040 


MQ7Q3I; 




100041 






100082 


AB003103 


Hs.4295 


100091 


AF000177 


Hs. 11 1783 


100100 

1 WW 1 WW 


AF006084 


Hs 11538 


100103 


AF007875 


Hs 5085 


100114 


D00596 


Hs.82962 


100121 


D 10495 


Hs 155342 


100123 


D10523 


Hs 16B669 


100128 


D11094 


Hs.61153 


100131 


D12485 


Hs.1 1951 


100137 


D13627 


Hs. 15071 

1 lob Iwvr 1 


100144 


D13643 


H&75616 


100147 


D13666 


Hs.136348 


100154 


D14657 


Hs.81892 


100164 


D14812 


Hs.173714 


100169 


D14878 


Hs.82043 


100190 


D21090 


Hs.178658 


100203 


D25538 


Hs.172199 


100209 


D26308 


Hs.76289 


100215 


D26598 


Hs.82793 


100216 


D26599 


Hs.1390 


100219 


D28137 


Hs.118110 


100227 


D28915 


Hs.82316 


100248 


D31888 


Hs.78398 


100287 


D43950 


H&1600 


100294 


049396 


Hs.75454 


100307 


D50525 


Hs.699 


100335 


D63391 


Hs.6793 


100340 


D63487 


Hs.82563 


100355 


D78129 


Hs.71465 


100363 


D78514 


Hs.78563 


100368 


D79987 


Hs.153479 


100372 


D79997 


Hs.184339 


100375 


D80004 


Hs.75909 


100379 


D82060 


Hs.278721 


100387 


D83777 


Hs.75137 


100393 


D84145 


Hs.39913 


100398 


D84557 


Hs.155462 


100405 


D86425 


Ha82733 


100406 


086479 


Hs.1 18397 


100409 


086957 


Hs.80712 


100421 


086985 


Hs.79276 


100446 


D87464 


Hs.10037 


100447 


087465 


Hs.74583 


100448 


087469 


Hs.57652 



R1 

16.7 

6.3 

8.3 

14.8 

7.5 

4.9 

4.7 

13.4 

155 

4.6 

7.5 

4.4 

8.7 

9.5 

6 

8.5 

105 

4.6 

7.9 

5.8 

9.9 

4.9 

14.2 

11.3 

5.7 

5.7 

7.4 

5.6 

12.9 

8.4 

6.8 

4.4 

12.6 

4.6 

6.5 

8.4 

4.5 

8.1 

10.7 

7.2 

7.2 

5.4 

4.3 

11.9 

9.7 

6.4 

10 

6.2 
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100467 


D89052 


Hs.7476 


100468 


D89289 


Hs,1 18722 


100486 


HT1112 


Hs.10842 


100497 


HT1400 


Hs.79137 


100618 


HT2710 


Hs.1 14599 


100661 


HT3018 


Hs.1 32748 


100667 


HT3127 


Hs.169610 


100668 


HT3938 


Hs.169610 


100676 


HT3742 


Hs.287820 


100775 


HT26388 


Hs.89603 


100783 


HT4018 


Hs.191356 


100829 


HT4343 


Hs.278544 


100830 


HT4344 


Hs.4756 


100840 


HT4392 


Hs.183418 


100850 


HT417 


Hs.297939 


100866 


HT4582 


Hs.75113 


100906 


HT5158 


Hs.5398 


100914 


HT511 


Hs,324178 


100916 


HT544 


Hs.73946 


100945 


HT884 


Hs.180686 


100975 


J02923 


Hs.76506 


100988 


J03589 


Hs.76480 


100996 


J03909 


Hs.14623 


100999 


J03934 


Hs.80706 


101011 


J04430 


Hs.1211 


101017 


J04599 


Hs.821 


101031 


J05070 


Hs.151738 


101038 


J05249 


Hs.79411 


101054 


K02405 


HS73931 


101061 


K03515 


Hs.180532 


101091 


L06132 


Hs.1 49155 


101097 


L06797 


Hs.89414 


101104 


L07615 


Hs.169266 


101143 


L12723 


Hs.90093 


101152 


L13800 


Hs,9884 


101183 


L19779 


Hs.795 


101216 


L25876 


Hs.84113 


101233 


L29008 


Hs.878 


101247 


L33801 


Hs78802 


101282 


L38810 


Hs.79387 


101326 


L42572 


HS.7B504 


101332 


L47276 


Hs.156346 


101348 


L77213 


Hs.30954 


101352 


L77701 


Hs.16297 


101378 


M1375S 


Hs.833 


101396 


M15796 


Hs.78996 


101404 


M16342 


Hs.182447 


101439 


M20902 


H$^8571 


101464 


M22538 


Hs.51299 


101469 


M22877 


Hs.1 69248 


101472 


M22960 


Hs.1 18126 


101478 


M23379 


Hs.758 


101484 


^A24594 


Hs.20315 


101539 


M30818 


Hs.926 


101540 


M30938 


Hs.84981 


101544 


M31169 




101552 


M31642 


Hs.82314 


101580 


M34677 


Hs.83363 


101600 


M37583 


Hs.1 19192 


101663 


M60750 


Hs.2178 


101664 


M60752 


Hs.121017 


101667 


M60858 


Hs.79110 


101684 


M63256 


HS75124 


101702 


M64929 


Hs.179574 


101754 


M77142 


Hs.239489 


101758 


M77836 


HS79217 


101767 


M81057 


Hs.180864 



ATPase; H+ transporting; lysosomal (vacu 

fucosyftransferase 8 (alpha (1:6) fucosyitr 

Ras-Uke Protein Tc4 

Cafboxyl Methyltransferase. Aspartate, A 

Collagen, Type Viii, Alpha 1 

Ribosomal Protein L39 Homolog 

Epican, AIL Splice 11 

Epican. AIL SpkeM 

Fibronectin.Alt Splice 1 

Mucin I.Epfthelial.A/t Splice 9 

Basic Transcription Factor, 44 Kda Subun 

Cytosolic Acetoaoetyl-Coenryme A Thio 

Rad2 

Protein Kinase Pitslre, Alpha, All Spfice 
Cathepsin B 
Transcripfion Factor Ida 
Guanosine 5^-Monophosphate Synthase 
Ras Inhibitor tnf 

Endothelial Cell Giowth Factor 1 
Oncogene E6-Ap, PapiHomavinis 
lymphocyte cytosoilc protein 1 (L-plastin 
ubiquitin-dke 4 

interferon; gamma-indudble protein 30 
diaphorase(NADH/NADPH) (cytochrom 
acid phosphatase 5; tartrate resistant 
big (yean 

matrix metalloproteinase 9 (gelatinase B; 
replication protein A2 (32kD) 
Human MHC class 11 HLA-DQ-beta mRN 
glucose phosphate Isonrierase 
voltage-dependent anion channel 1 
chemoldne (C-X-C motif); receptor 4 (fus 
Human neuropeptide Y receptor Y1 (NPY 
heat shock 70kD protein 4 
Homo sapiens liver expressed protein gen 
H2A histone family; member 0 
cyclin-dependent kinase inhibitor 3 (CDK 
sorbitol dehydrogenase 
glycogen synthase kinase 3 beta 
proteasome (prosome; macropain) 26S sub 
inner membrane protein; mKochondrial (m 
Homo sapiens (cell line HL-6) alpha topo 
phosphomevatonate kinase 
C0X17 (yeast) homolog; cytochrome c ox 
interferon-stlmulated protein; 15 kOa 
proliferating ceB nuclear antigen 
heterogeneous nuclear ribonudeoprotein 0 
apolipoproteih C-( 

NADH dehydrogenase (ubk^uinone) flavo 
Human somatic cytochrome c (HCS) gene 
protective protein for beta-galactosidase ( 
RAS p21 protein activator (GTPase actlva 
interferon4nduced protein 56 
myxovinis (Influenza) resistance 2; homo! 
X-ray repair complementing defective rep 
Human propionyt-CoA carboxylase beta-s 
hypoxanthine phosphoribosyltransferase 1 
DMA segment on chromosome X (unique) 
H2A histone family; member Z 
H2B histone My; member A 
H2A histone feuTv'ly; ntember A 
nudeolln 

cerebellar degeneratton-related protein (62 
protein phosphatase 2 (fonnerty 2A); regu 
TIA1 cytotoxic granule-associated RNAi 
pyrrolIne-5-carboxy)ate reductase 1 
carboxypeptidase B1 (tissue) 
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15 



20 
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40 
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60 
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101770 


M81601 


Hs.78869 


transcription elongation factor A (SIQ; 1 


4.6 


101791 


M83822 


Hs.62354 


ceQ division cycle 4-l)ke 


9.7 


101803 


M86546 


Hs.155691 


pre-B-oel] leukemia transcription factor 1 


5.5 


101809 


IVI86849 


Hs.323733 


Momo sapiens connexin 26 (GJ62) mRNA 


22.5 


101839 


M93036 


Hs.692 


membrane component; chromosomal 4; su 


4 


101851 


M94250 


Hs.82045 


rrtdldne (neurite growthi)romoting factor 


7.6 


101888 


M99701 


Hs.95243 


transcription efongatfon fector A (Slf)-llke 


11.4 


101973 


S82597 


Hs.80120 


UDP-N-acetyl-alpha-D^alactosamine:po 


4.6 


101991 


U00968 


Hs.166 


Human SREBP-I mRNA; comptetecds 


4.1 


102009 


U02680 


Hs.82643 


protein tyrosine kinase 9 


4.4 


102025 


U03911 


Hs.78934 


mutS (E. coll) homolog 2 (cobn cancer; n 


4 


102047 


U07158 


Hs.83734 


syntaxin 4A (placental) 


6.1 


102051 


U07550 


Hs,1197 


heat shock lOkD protein 1 (chaperonin 10 


4.4 


102083 


U10323 


Ha75117 


interleukln enhancer binding factor 2; 45k 


10.4 


102095 


U11313 


Hs.75760 


sterol carrier protein 2 


9.5 


102130 


ulsoog 


Hs.1575 


small nuclear rit)onucleoprotein 03 polyp 


6.6 


102133 


U15173 


Hs.155596 


BCL2/adenovinjs E1B igkO-interac8ng p 


4.3 


102148 


U16954 


Hs.75823 


ALLI'fused gene from chromosome 1q 


6^ 


102179 


U19713 


Hs.76364 


allograft inflammatofy factor 1 


4.8 


102180 


U19718 


Hs.83551 


mfcrofibriflar-associated protein 2 


7.2 


102193 


U20758 


Hs.313 


secreted phosphoproteln 1 (osteopontin; b 


7.2 


102198 


U21090 


Hs.74598 


polymerase (DhW directed); delta 2; regu 


4.3 


102202 


U21931 


Hs.574 


fnjctose-blsphosphatase 1 


4.5 


102209 


U22970 


Hs,265827 


Interferon; a/pha-lnduable protein (done 


9.9 


102211 


U23070 


Hs.78776 


putative transmembrane protein 


4.9 


102220 


U24389 


Hs.65436 


lysyl oxWase-like 1 


8.5 


102224 


U24704 


Hs.148495 


proteasome (prosome; macropaln) 26S sub 


5.4 


102234 


U26312 


Hs.278554 


chromobox homotog 3 (Drosophila HP1 g 


7.7 


102250 


U28014 


Hs.74122 


caspase 4; apoptosls-related cysteine prot 


5.4 


102260 


U28386 


HS.1S9557 


karyopherin alpha 2 (RAG cohort 1; impo 


6.3 


102261 


U2B488 


H&155935 


complement component 3a receptor 1 


5.7 


102273 


U30888 


Hs.75981 


ubquitin specific protease 14 (tRNA^^uan 


6.1 


102298 


U32849 


Hs.54483 


N-myc(and STAT) interactor 


4.1 


102302 


U33052 


Hs,69171 


protein kinase C-like 2 


4.3 


102305 


U33286 


Hs.90073 


chromosome segregation 1 (yeast homolo 


5.4 


102320 


U34683 


Hs.82327 


glutathione synthetase 


4.1 


102330 


U35451 


Hs.77254 


chromobox homotog 1 (Drosophila HP1 b 


4 


102348 


U37519 


Hs.87539 


aldehyde dehydrogenase 8 


9.4 


102361 


U39400 


Hs.75859 


chromosome 1 1 open reading frame 4 


5.2 


102362 


U39412 


Hs.75932 


N-ethylmaleimkie^ensilive factor attachm 


9.3 


102369 


U39840 


Hs.299867 


hepatocyte nuclear factor 3; alpha 


7.7 


102395 


U41767 


Hs.92208 


a dislntegrin and metalloprotelnase domai 


10.4 


102409 


U43286 


Hs. 118725 


selenophosphate synthetase 2 


6.2 


102418 


U43923 


Hs.79058 


suppressor of Ty (S.cerevisiae) 4 homolog 


4.1 


102425 


U44772 


Hs.3873 


palmftoyl-proteh thtoesterase(cen)kMrpo 


4.8 


102457 


U48807 


Hs.2359 


dual speaTiclty phosphatase 4 


6.3 


102465 


U49352 


Hs^1548 


2;4-djenoyl CoA reductase 1; mitochondri 


9.4 


102495 


U51240 


Hs.79356 


Lysosomal-assodated multispanning mem 


6.5 


102534 


U56833 


Hs.198307 


von HlppeHlndau binding protein 1 


8.6 


102546 


U57877 


Hs.3577 


succinate dehydrogenase complex; subuni 


4.3 


102549 


U5B046 


Hs.198899 


eukaryotic translation initiation factor 3: 5 


6.3 


102557 


U58766 


HS.26442B 


tissue specific transplantation antigen P35 


5 


102562 


U59309 


Hs.75653 


fumarate hydratase 


6 


102568 


U59877 


Hs.223025 


RAB31; member RAS oncogene family 


9.1 


102580 


U60608 


Hs.152981 


CDP-diacylglycerol synthase (phosphatid 


7.9 


102581 


U61145 


Hs.77256 


enhancer of zeste (Drosophila) homotog 2 


7.6 


102590 


U62136 


Hs.79300 


Homo sapiens enterocyte differentiation a 


7 


102591 


U62325 


Hs.324125 


amyloid beta (A4) precursor protein-blndi 


4 


102592 


U62389 


Hs.11223 


Human putative cytosolic NADP-depende 


5 


102617 


U6S928 


Hs.198767 


Jun actrvatk)n domain binding protein 


6.1 


102618 


U65932 


Hs.81071 


extraceDuiar matrix protein 1 


232 


102638 


U67319 


H&9216 


caspase 7; apoptosis-related cysteine prot 


8.9 


102663 


U70322 


Hs.168075 


karyopherin (Importtn) beta 2 


7.1 


102666 


U70660 


Hs.279910 


ATX1 (antioxidant protein 1; yeast) homo 


4.7 


102679 


U72661 


Hs.11342 


ninjurin 1; nerve Injury-induced protBln>1 


4.7 


102687 


U73379 


H&93002 


ubiquitin canter protein E2-C 


7.7 


102704 


U76638 


Hs.54089 


BRCA1 associated RING domain 1 


5.6 
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102705 


U77180 


H5.50002 


small indudble cytokine subfamily A (Cy 


11.8 


102721 


U79241 


Hs.1 18666 


Human clone 23759 mRNA; partial cds 


15 


102729 


U79254 


Hs.181311 


asparaginyl-tRNA synthetase 


5 


102739 


U79282 


Hs.155572 


Human clone 23801 mRNA sequence 


6 


102742 


U79293 


Hs.159264 


Human done 23948 mRNA sequence 


13.1 


102761 


U82130 


Hs.118910 


tumor susceptibility gene 101 


7 


102788 


U86602 


HS74407 


nucleolar protein p40 


4.1 


102790 


U87269 


Hs.154196 


E4F transcription factor 1 


7.1 


102801 


U69606 


Hs.38041 


pyridoxal (pyridoxine; vitamin B6) kinase 


4,7 


102808 


U90426 


Hs.179606 


nudear RNA heficase; DECD variant of D 


7.5 


102817 


U90904 


Hs.83724 


Human done 23773 mRNA sequence 


15.2 


102823 


U90914 


Hs.5057 


carboxypeptidase D 


6.6 


102827 


U91327 


Hs,6456 


chaperonin containing TCP1; subunit 2 (b 


6 


102838 


U94592 


Hs.80658 


Human unooupfing protein homolog (UCP 


6.1 


102841 


U95006 


HS37616 


Human D9 sptioe variant B mRNA; comp 


4.2 


102844 


U96113 


Hs.324275 


Homo sapiens Nedd^-like ubiquitin-prot 


6.8 


102868 


X02419 


Hs,77274 


plasminogen acUvator; urokinase 


4 


102907 


)C06985 


Hs.202833 


heme oxygenase (decyding) 1 


22.7 


102919 


XI 2447 




aldolase A; fructose4}isphosphate 


9.9 


102929 


XI 3238 


Hs.74649 


cytochrome c oxidase subunit Vic 


5.4 


102973 


X16663 


Hs.14601 


hematopoietic cell^pedfic Lyn substrate 


4.8 


102983 


X17620 


Hs.1 18638 


non-metastatic cells 1; protein (NI\/I23A) 


4.6 


102985 


X17644 


Hs,2707 


G1 to S phase transition 1 


20.6 


103003 


X52003 


Hs.1406 


trefoil fador 1 (breast cancer; estrogen-ind 


10.7 


103018 


X53296 


Hs.81134 


tnterieukln 1 receptor antagonist 


5.8 


103023 


X53793 


Hs.1 17950 


multifunctional potypeptkle similar to SA 


4 


103036 


X54925 


Hs.83169 


matrix metalloprotelnase 1 (Interstitial col 


7.3 


103060 


X57766 


Hs.155324 


matrix metalloprotelnase 11 (stromelysln 


17.8 


103073 


X59417 


Hs.74077 


proteasome (prosome; macropain) subunit 


5.6 


103075 


XS9543 


Hsi934 


ribonucleotide reductase Ml polypeptkJe 


4.2 


103060 


X59798 


Hs.82932 


cydin D1 (PRA01: parathyroid adenomat 


67 


103094 


X60787 


HS296281 


Interteukin enhancer binding factor 1 


5.7 


103105 


X61970 


Hs76913 


proteasome (prosome; macropain) subunit 


5.8 


103121 


X63679 


Hs.4147 


translocating chain-associating membrane 


42 


103149 


X66363 


Hs.171834 


PCTAIRE protein kinase 1 


12 


103180 


X69433 


Hs.5337 


Isodtrate dehydrogenase 2 (NADP+); mit 


18.9 


103182 


X69819 


Hs.99995 


Intercellular adhesion molecule 3 


10.7 


103188 


X70040 


Hs.2942 


macrophage stimulating 1 receptor (c-met 


4.1 


103191 


X70218 


Hs.2903 


protein phosphatase 4 (fomierly X); cataly 


10.7 


103193 


X70476 


Hs.75724 


coatomer protem complex; subunit beta 2 


8.2 


103194 


X70649 


Hs.78580 


DEAD/H (Asp-G!u^aAsp/Hls) box pol 


13.7 


103195 


X70940 


Hs.2642 


eukaryotic translation etongatfon factor 1 


13.4 


103206 


X72755 


Hs.77367 


monokine induced by gamma Interferon 


15.1 


103207 


X72790 




Human endogenous retrovirus mRNA for 


5.3 


103208 


X72841 


Hs.31314 


retinobiastoma-binding protein 7 


12.3 


103216 


X74262 


Hs.16003 


retinobiastoma-binding protein 4 


4.1 


103226 


X75042 


Hs.44313 


v-rei avian reticutoendotherK>sis viral onco 


6.9 


103230 


X75861 


Hs.74637 


testis enhanced gene transcript 


7.9 


103262 


X78565 


Hs^89114 


hexabrachton (tenasdn C; cytotactin) 


5 


103278 


X79882 


Hs.80680 


lung reslstance>related protein 


5.7 


103297 


X81788 


Hs.9078 


immature cdon carcinoma transcript 1 


4.6 


103302 


X82103 


Hs.3059 


coatomer protein complex; subunit beta 


4.5 


103316 


X83301 


Hs.324728 


SMA5 


7.1 


103330 


X85373 


Hs.77496 


small nudear ribonucleoprotein polypepti 


4 


103349 


X89059 




serine/threonine kinase 9 


4.7 


103352 


X89398 


Hs.78853 


uracil-DNA glycosylase 


5.3 


103364 


X90872 


Hs.279929 


SULTICsulfotransferase 


4 


103374 


X91788 


Hs.84974 


chloride channel; nudeotUe-sensItive; 1A 


4.2 


103380 


X92396 


Hs^4167 


synaptobrevin-likB 1 


13.6 


103395 


X94754 


Hs,279946 


methlonlne-tRNA synthetase 


14.2 


103402 


X95404 


Hs.180370 


cofilin 1 (non-musde) 


4.6 


103410 


X96506 


Hs.295362 


DRI-assodated protein 1 (negative cofact 


8.3 


103420 


X97065 


Hs.173497 


Sec23 (S. cerevisiae) homolog B 


4.9 


103421 


X97074 


Hs.119591 


adaptor-related protein complex 2; sIgma 


5 


103427 


X97303 


Hs.250655 


H^apiens mRNA for Ptg-12 protein 


7 


103430 


X97544 


Hs.20716 


translocase of inner mitochondrial membr 


4.5 


103438 


X98263 


Hs.1 52720 


Mi)hase phosphoprotetn 6 


4.5 



218 



wo 02/059377 



PCT/US02/02242 



103464 Y00285 Hs.76473 insutin-like growth factor 2 receptor 4.2 

103470 ¥00796 Hs.174103 btegrin; alpha L (antigen CD11 A (pi 80): 4.5 

103494 Y08991 Hs.83050 phospha&iyiinosftol 3-kinas&a5Sociated p 4.1 

103505 Y09912 Hs.33102 transcription feclorAP.2t)eta (activating 4.5 

5 103547 Z14982 Hs.1 80062 proteasome (piosome; macropain) subunit 4.3 

103551 Z15115 Hs.75248 topofeomerase (DMA) II beta (180kD) 4 

103565 Z22548 Hs.146354 thioredoxin-dependent peroxide reductase 7.6 

103587 Z29083 Hs.82ia 5T4 oncofetal trophoblast glycoprotein 14.6 

103621 Z47727 Hs.150675 polymerase (RNA) II (DNA directed) po) 6.3 

10 103622 Z48042 Hs.278672 membrane component; chromosome 11; s 4.4 

103658 Z74615 88.172928 collagen; type I; alpha 1 5.9 

103680 Z93784 Homo sapiens DNA sequence from PAC 4.4 

103772 AA092473 Hs^78554 chromobox homolog 3 (Drosophila HP1 g 4.9 

103774 AA092898 Hs.92918 ESTs; Weakly similar to R07G33 [Celeg 6.1 

15 103821 AA157623 Hs.198793 K1AA0750 gene product 23.3 

103835 AA172215 Hs.93748 ESTs; Ntoderately similar to TRANSCRIP 4 

103686 AA236384 Hs.105737 ESTs; Weakly similar to gene 9306 protei 4.9 

103890 AA236843 Hs.72085 ESTs; Weakly similar to unknown [S.ceie 7.8 

103892 AA243523 Hs.239189 ESTs 4.8 

20 104054 AA393432 Hs.7100 hypothetical protein 5.3 

104115 AA428090 Hs.26102 ESTs 28.7 

104135 AA442669 Hs.268371 zv58f6.r1 Soares^tDtaLfetus_Nb2HF8_9w 5.7 

104147 AA451992 Hs.283037 ESTs; Highly similar to HSPC039 protein 6.9 

104173 AA476564 Hs.76561 ESTs; WeaWy similar to finger protein HZ 5.2 

25 104181 AA479521 Hs.283740 ESTs 7.8 

104183 AA480838 Hs.1 14309 ESTs 5.1 

104192 AA486946 Hs.21321 Homo sapiens mRNA; cDNA DKFZp564 4.3 

104209 AB000221 Hs.16530 small inducible cytokine sub^lly A (Cy 12.3 

104234 AB002357 Hs.168212 kinesin family member 3B 6.2 

30 104271 C01687 Hs.7381 ATP synthase; H+ transporting; mRochon 

104278 C02582 Hs.109253 ESTs; Highly similar to N<terminal acetyl 4.5 

104307 D52618 Hs.1 11680 endosulfine alpha 4.7 

104309 D55869 Hs.284123 Homo sapiens mRNA fuH length insert cD 4.2 

104370 Hig378 HS.218S1 Homo sapiens mRNA; cDNA DKFZp586 6.4 

35 104446 L44497 Hs.7351 ESTs 4.9 

104453 rvl19169 Hs.123114 cystatinSN 11.6 

104476 N33807 Hs.324275 protease; serine; 15 5.6 

104558 R56678 Hs.88959 Human DNA sequence from done 967N2 6.3 

104592 R81003 H5.325820 serine protease; umbilical endothelium 13.6 

40 104634 AA004274 Hs. 19151 ESTs 6.3 

104636 AA004415 Hs.106106 ESTs 10.1 

104558 AA007145 Hs.27268 Homo sapiens mRNA; cDNA DKFZp5 54 4.3 

104667 AA007234 Hs.30098 ESTs 16.6 

104675 AA009596 Hs.301553 ESTs; IVloderately similar to lilt ALU SU 4.6 

45 104767 AA025534 Hs.8852 . ESTs 4.8 

104785 AA027163 Hs.7942 ESTs 8.1 

104791 AAQ29046 Hs.301871 ESTs; Moderately similar to cAMP indue 10.9 

104804 AA031357 Hs.31803 ESTs; Weakly similar to N-WASP(H.sap 5.5 

104807 AA032147 Hs23296 ESTs 10.4 

50 104837 AA039469 Hs.21126 ESTs; Weakly similar to K1AA0299 [H.s 4.6 

104849 AA040270 Hs.241507 Homo sapiens mRNA; cDNA DKFZp564 4.3 

104867 AA045481 Hs.225979 Human gene from PACs 3711/117 and 305B 4.5 

104884 AA053021 Hs.14511 SCO (cytochrome oxidase deficient; yeast 4.7 

104906 AA055809 Hs.26802 ESTs; Weakly similar to phosphoproleffit 8.8 

55 104919 AA057193 Hs.25252 ESTs 5.5 

104921 AA057839 Hs.1508 ESTs 4.2 

104926 AA058845 Hs.33363 DKFZP434N093 protein 7 

104938 AA064627 Hs.318725 ESTs; Highly similar to 061-72 protein [H 7.1 

104943 AA065217 Hs.1 14218 ESTs 5.7 

60 104957 AA074919 Hs.10026 ESTs; Weakly similar to ORFYJIj063cP 4.7 

104961 AAa76672 Hs.33905 ESTs 5.5 

104968 AA084602 H5.29669 ESTs 4.3 

104975 AA086071 Hs.50758 chromosome-assoCiated polypeptkie G 8.3 

104977 AA088228 Hs.18272 ESTs 6.2 

65 104978 AA088458 Hs.19322 ESTs 6.7 

104987 AA101723 Hs.1 1861 ESTs 9.2 

105002 AA113266 Hs.182704 ESTs; Moderately similar to aRematively 6.9 
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105012 AA1 16036 Hs.9329 chromosome 20 open reading frame 1 10.7 

105019 AA121879 Hs.9280 proteasome (prosome; macropain) subun'rt 5.7 

105029 AA1 26855 Hs.13268 ESTs 4.4 

105033 AA127964 Hs.274329 TP53 target gene 1 6.3 

5 105035 AA128486 Hs.8859 ESTs 6.5 

105039 AA130349 Hs.36475 ESTs 4 

105062 AA134968 Hs.36529 ESTs 4.3 

105076 AA142858 Hs.37810 ESTs 6.4 

105087 AA147884 Hs.9812 ESTs ^ 9.2 

10 105091 AA148859 Hs.179909 ESTs; Weakly similar toll!! ALU SUBFA 5.7 

105093 AA149051 Hs.32405 ESTs 6.3 

105107 AA152302 Hs.25035 DKFZP566G223 protein 6.2 

105127 AA158132 Hs.301957 ESTs: Weakly similar to contains similart 5.7 

105132 AA159501 Hs.247280 HBV associated factor 4.2 

15 105143 AA165333 Hs.24808 ESTs 4.7 

105154 AA171736 Hs.35947 methyi-CpG binding donfiain protein 4 9 

105162 AA175690 Hs.4084 KIAA1 025 protein 9.1 

105186 AA191512 Hs.28005 Homo sapiens mRNA; cDNA DKFZp564 193 

105209 AA205072 Hs.227743 KIAA0980 protein 7.4 

20 105223 AA211388 Hs.7750 ESTs 5.1 

105252 AA227428 Hs.9728 ESTs; Weakly similar to KIAA0512 protB 11.1 

105253 AA227448 Hs.50O3 KIAA0456 protein 6.4 
105261 AA227871 Hs.6361 IVIEK partner 1 9.1 
105263 AA227926 Hs.6682 ESTs 6.7 

25 105274 AA228122 Hs.281866 ATPase; H+ transporting; lysosomal {vacu 5.3 

105297 AA233451 Hs.183858 transcilptional intermediary factor 1 8.7 

105309 AA233790 Hs.4104 ESTs; Weakly similar to cDNA EST yk38 7.4 

105312 AA233854 Hs.23348 S-phase kinasenassoctaled protein 2 (p45) 5.8 

105342 AA235286 Hs.157078 ESTs 4.5 

30 105376 AA236559 Hs.8768 ESTs; Weakly similar to III! ALU SUBFA 5.8 

105366 AA236950 Hs.8115 ESTs 5.5 

105397 AA242868 Hs.7395 ESTs; Weakly similar to housekeeping p 7.7 

105399 AA243007 Hs.l6420 ESTs; Highly similar to SH3 domain-bind 5.6 

105400 AA243052 Hs.65648 RNA binding motif protein 8 5.8 
35 105404 AA243303 Hs.21187 ESTs 9.1 

105409 AA243562 Hs.301855 ESTs ■ 4.4 

105436 AA252172 Hs.237856 ESTs; Moderately similar to (AMP indue 5.1 

105483 AA256874 Hs,23458 ESTs 4.9 

105433 AA256268 Hs.10283 ESTs 6 

40 105495 AA256317 Hs.2S7B5 Homo sapiens mRNA; cONA DKFZp586 5.2 

105496 AA256323 Hs.301997 DKFZP434N126 protein 8.7 

105500 AA256485 Hs.222399 CGI-96 protein 9.5 

105507 AA256678 Hs.226318 ESTs; Moderately similar to CCR4-associ 4.1 

105538 AA258860 Hs.32597 ring finger protein (C3H2C3 type) 6 4.1 

45 105544 AA261954 Hs.24678 ESTs 8 

105546 AA262032 Hs.2682d1 ESTs; Weakly similar to 62D9.ap.me1an 6.1 

105549 AA262417 Hs.5415 ESTs 4.5 

105551 AA262477 Hs,25292 ribonuclease HI; large subunit 9.1 

105560 AA262783 Hs.306915 ESTs 4.5 

50 105565 AA278302 Hs.18349 ESTs; Weakly similar to partial CDS (C.e 4.2 

105566 AA278323 Hs.17481 Homo sapiens done 24606 mRNA sequen 115 

105575 AA278717 Hs.12772 ESTs 5.9 

105584 AA279012 Hs.3454 ESTs; Weakly similar to KIAA0665 prote 4.4 

105596 AA279418 Hs. 18490 ESTs 4 

55 105604 AA279787 Hs.15467 ESTs; Moderately similar to putative plio 5.6 

105610 AA279991 Hs.99872 ESTs; Weakly simSar to trithorax homob 5.3 

105621 AA280865 Hs.6375 Homo sapiens mRKA;cDNADKFZp564 4.8 

105627 AA281245 Hs.23317 ESTs 7.5 

105638 AA281599 Hs.247817 Homo sapiens mRNA for for histoneH2B 5.9 

60 105645 AA282138 Hs.11325 ESTs 6.4 

105650 AA282347 Hs.25635 ESTs; Highly similar to HSPC003IH.sap 11.3 

105666 AA283930 Hs.34906 ESTs 4.7 

105674 AA284755 Hs.279789 CDW52 antigen (CAMPATH-1 antigen) 8 

105687 AA286809 Hs.28423 ESTs 7.1 

65 105700 AA287643 Hs.35254 ESTs; Weakly similar to hypothetical pro 4.9 

105705 AA290767 Hs.101282 Homo sapiens mRNA; cDNA DKF2p434 8 

105709 AA291268 Hs.26761 OKf^586L0724 protein 6.8 
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105731 AA292711 Hs.29131 ESTs 6.4 

105753 AA299789 Hs.1 10857 ESTs 7 

105774 AA348014 Hs.23412 ESTs 7.1 

105784 AA350771 Hs.17850 ESTs 13.4 

5 105791 AA358038 Hs.14368 SH3-bind(ng domain glutamic acid-rich p 4.3 

105807 AA393803 Hs.16869 ESTs; Moderately similar to COLUGEN 5.3 

105808 AA393808 Hs.286131 KIAA0438 gene product 4.1 

105812 AA394126 Hs.20814 ESTs; Highly similar to CGI-27 protein (H 14.6 

105813 AA394140 Hs.l8585 ESTs 4.9 
10 105819 AA397920 Hs.28783 Homo sapiens mRNA; cOKA DKFZp564 4.9 

105870 AA399623 Hs.1O1067 ESTs 4.8 

105874 AA400074 Hs.171118 ESTs 4 

105696 M4009g9 Hs.7838 Human ring zino-finger protein (ZNF127- 4.8 

105934 AA404248 Hs.16577 ESTs 5.2 

15 105935 M404277 Hs.263727 ESTs; Weakly simflar to bisphosphateS*- 4 

105966 AA406105 Hs.5344 adaptor-related protein Gomptex 1; gamma 8.3 

105974 AA406321 Hs.6224 KIAA0895 protein 4.6 

105990 M410336 Hs.29403 ESTs; Weakly similar to PROBABLE AT 4.5 

105995 AA410510 Hs.5345 ESTs 4.9 

20 106000 AA410972 Hs.20726 ESTs 5.8 

106007 AA411462 Hs.11042 ESTs; Weakly similar to veli 1 (H. sapiens 6.9 

106016 AA411819 Hs.8164 , KIAA0898 protein 5 

106034 AA412473 Hs.14928 ESTs 6.6 

106042 AA4 12700 Hs.169895 ubiquitin-conlugating enzyme E2L 6 4.6 

25 106057 AA417067 Hs.289074 ESTs 4.5 

106065 AA417558 Hs.25206 ESTs 12J 

106070 AA417761 H&5957 Homo sapiens clone 24416 mRNA sequen 5 

108103 AM21104 Hs.120g4 ESTs 15.4 

106126 AA424006 Hs.22972 ESTs; Moderately similar to H5AR[M.m 6.4 

30 106154 AA425304 Hs.6994 ESTs 5.1 

106157 AA425367 Hs.34892 ESTs 11.1 

106166 AA425872 Hs.19561 NADH dehydrogenase (ubiquinone) 1 alp 19.3 

106204 AA428024 Hs.21479 ESTs 4.7 

106210 AA428239 Hs.10338 ESTs 5.7 

35 106220 AA428582 Hs.32196 ESTs; Moderately similar to metaigWInp 7.7 

106236 AA429951 Hs.21104 ESTs 8 

106240 AA430074 Hs.18552 ESTs; Weakly similar to Ylr218cp[S.cerB 4.4 

106263 AA431462 Hs.28329 ESTs 4.9 

106288 AA435536 Hs.24336 ESTs 8.8 

40 106293 AA435591 Hs.301444 signal sequence recepton gamma (transloc 8.7 

106310 AA436244 H5.17240 ESTs 4.5 

106317 AA436568 Hs.108124 ESTs 4 

106328 AA43670S Hs.28a20 KIAA0766 gene product 4.4 

106341 AA441798 Hs.5243 ESTs; Moderately simQar to plL2 hypoth 23.7 

45 106348 AA442253 Hs.10702 ESTs 4.7 

106350 AA442753 Hs.l94698 cycljnB2 6.1 

106371 AA443923 Hs.170310 ESTs 6.8 

106389 AA446949 Hs.6236 ESTs 4.7 

105394 AA447223 Hs.25320 Homo sapiens clone 25142 mRNA sequen 4.4 

50 106426 AA448282 Hs.16206 ESTs; Weakly similar to F55C 12.5 [Cele 4.5 

106459 AA449741 Hs.4029 gltoma-amplifiedsequence41 4.8 

105462 AA449912 Hs.30532 ESTs; Highly similar to CGI-77 protein [H 5.2 

106468 AA450047 Hs.14770 ESTs 6.8 

106479 AA450351 Hs.75251 ESTs 12.4 

55 106494 AA452108 Hs.18387 transcription factor AP-2 alpha (activating 4.5 

106503 AA452411 Hs.29679 ESTs; Highly similar to mediator (H.sapie 5.1 

106507 AA4K584 Hs.267819 protein phosphatase!; regulatory (inhibito 4.9 

106533 AA453786 Hs,145998 ESTs 8.3 

106568 AA455970 Hs.28285 patehed related protein translocated in ren 7.6 

60 106586 AA456598 Hs.57787 ESTs 8.2 

106589 AA456646 Hs.28661 ESTs 4.8 

106606 AA457730 Hs.283437 Homo sapiens done 23851 mRNA sequen 4.4 

106611 AA458904 Hs.26267 ESTs; Weakly similar to torstnAIH.sapie 7 

106614 AA458934 Hs.256l50 ESTs 4.5 

65 106628 AA459657 Hs.12311 Homo sapiens done 23570 mRNA sequen 6.5 

106637 AA459961 Hs.250824 ESTs 5.5 

106644 AA460239 Hs.12680 ESTs 4.4 
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106664 AA460969 Hs.7510 mitogen-acfivated protein kinase kinase ki 6.4 

106698 AA463745 Hs^9403 ESTs; Weakly similar to PROBABLE AT 5.3 

106719 AA465171 Hs.236844 ESTs 5.6 

106726 AA465339 Hs.3886 ESTs 10.1 

5 106747 AA476473 Hs.171957 triple fundtonal domain (PTPRFlnleractf 10.4 

106759 AA477263 Hs.25584 ESTs 4.2 

106765 AA477717 Hs.306117 interieukin 13 receptor, a(pha 1 6.9 

106784 AA478558 Hs.227913 API5^ike1 5.1 

106831 AA482014 Hs.29463 centrin; EF-hand protein; 3 (CDC31 yeasl 5.1 

10 106836 AA482112 Hs.238707 ESTs 4.8 

106840 AA482548 Hs.5534 ESTs 10.3 

106856 AA4B6183 Hs.285123 ESTs; Weakly similar to similar to oxyste 6.2 

106865 AA487228 Hs.19479 ESTs 4.5 

106878 AA488872 Hs.12314 Homo sapiens mRNA; cONA DKFZp586 7.9 

IS 106868 AA489101 Hs.24734 oxysteroi binding protein 6.4 

106895 AA489665 Hs.25245 ESTs 4.6 

106909 AA490323 Hs.250747 SUMO-1 activating enzyme subunit 1 4.2 

106919 AA490885 Hs.21766 ESTs 12.3 

106920 AA490899 Hs.296323 ESTs 6.2 
20 106941 AA496204 Hs.237971 ESTs 4 

106942 AA496347 Hs.31314 retinoblastoma-binding protein 7 4.8 

108948 AA4967B8 Hs^1077 KIAA0532 protein 4 

106968 AA504631 Hs,26813 ESTs; Weakly similar to hypothetical 43^ 4.4 

106973 AA505141 Hs.11923 Human DNA sequence from done 167A1 5.4 

25 106980 AA521121 Hs.8858 bromodomain adjacent to zinc finger dom 4.1 

106981 AA521157 Hs74101 ESTs 57 

106998 AA598461 Hs.195464 insulin-like growth factor binding protein 18.7 

107008 AA598710 Hs.23740 ESTs 6.2 

107028 AA599214 Hs.24143 ESTs 4.1 

30 107032 AA599472 Hs.247309 sucdnate-CoA igase; GDP-forming; beta 5.3 

107052 AA600134 Hs.12482 glyceronephosphateOacyltransfeiase 4.8 

107053 AA600147 Hs.5741 ESTs; Weakly similar to NADH-cytochro 5.8 
107056 AA600310 Hs.18720 programmed cell death 8 (apoptosMnduc 4.9 
107080 AA609210 Hs.19221 ESTs 8.4 

35 107102 AA609723 Hs.30652 ESTs 8 

107109 AA609943 Hs32793 ESTs 9.5 

107129 AA620553 Hs.4756 flapstnjcture-spedfk^endonuclease 1 4.9 

107132 AA620598 Hs.9052 ESTs 5.3 

107136 AA620795 Hs.8207 ESTs 4 

40 107140 AA620889 Hs.170088 ESTs 6.7 

107151 AA621169 Hs.8687 ESTs 19 

107159 AA621340 Hs.10600 ESTs; Weakly similar to ORFYKR081c I 6.1 

107174 AA621714 Hs.25338 ESTs 8.5 

107217 D51095 Hs.35861 DKFZP586E1621 protein 7.2 

45 107252 D59971 Hs.25925 ESTs 7.9 

107295 T34527 Hs.80120 UDP-N-acetykalpha-D-galactosamlneipo 5.6 

107299 T40327 Hs30661 lung resistanGe-related protein 8.4 

107324 T81665 Hs.278422 DKF2P586G1122pro!efn 7.5 

107372 U85625 Hs.8297 ribonudease 6 precursor 4.7 

50 107373 U85773 Hs.154695 phosphomannomutase 2 4.8 

107481 W58247 Hs.279766 Homo sapiens kinesin superfamily motor 6.3 

107531 Y13936 Hs.17883 protein phosphatase 1G {formerly 2C); ma 8.3 

107859 AA024835 Hs.47584 potassium voltage^ated channel; delayed 7.3 

107890 AA026030 Hs.61311 ESTs; Weakly similar to CALPAIN 2; U 7.3 

55 107908 AA026894 Hs.42826 ESTs 4.9 

108039 AA041341 Hs.46670 ESTs 5.4 

108040 AA041551 Hs.159971 ESTs 8.4 
108102 AA046424 Hs.49433 ESTs; Weakly similar to HYPOTHETICA 6.6 
108217 AA058686 Hs.62568 ESTs 7.7 

60 108255 AA063157 Hs.172608 ESTs 4 

108358 AA071514 Hs.1634 ESTs 4 

108609 AA100e94 Hs.69499 Human DNA sequence from BAG 15E1 o 5.5 

108647 AA1 12396 Hs.44276 ESTs; Moderately similar to HOMEOBO 143 

108676 AA1 15562 Hs.274417 Homo sapiens mRNA; cONA DKFZp664 5.2 

65 108687 AA120785 Hs.54347 ESTs 5.6 

108695 AA121315 Hs.70823 KIAA1077 protein 10.5 

108733 AA126422 zn84f1.sl Stiatagene lung carcinoma 9372 4.4 
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108774 AA128125 Hs.71040 ESTs; Moderately simitar to CELL GROW 4.6 

108828 AA131584 Hs.273344 DKFZP564O0463 protefn 5.5 

108872 AA134063 Hs.111680 ESTs 7.2 

108884 AA134958 Hs.293591 ESTs 11.3 

5 108893 AA135894 Hs.194691 reUnolcadd induced 3 8.9 

109008 AA156360 Hs.87128 ESTs 147 

109010 AA156460 Hs.44229 dual specfficfty phosphatase 12 4.9 

109011 AA156542 Hs.72127 ESTs 4.6 
109042 AA159525 Hs71779 Homo sapiens DNA from chromosome 19 7.2 

10 109086 AA166695 Hs.270737 tumor necrosis factor (ligand) superfamity 4 

109090 AA167006 Hs70499 ESTs 5.9 

109101 AA167708 Hs.52184 ESTs 4.2 

109112 AA169379 Hs.257924 ESTs 4 

109150 AA179387 Hs.301997 DKFZP434N1 26 protein 4 

IS 109166 AA179845 Hs.73625 RAB6 interacUng; kinesin-like (rabidnesin 13-6 

109178 AA181600 Hs.283707 ESTs 11^ 

109179 AA181902 H5.192789 ESTs; Weakly slmBar to !U1 ALU SUBFA 5.4 
109261 AA195255 Hs.61779 ESTs 6.7 
109270 AA195515 Hs.3585 ESTs; Wealdy similar to alternatively spli 4.9 

20 109277 AA196332 Hs.86043 ESTs 5.4 

109313 AA206d00 Hs.86276 ESTs; Moderately similar to zinc finger p 5.5 

109415 AA227219 H&110826 trinudeotWe repeat containing 9 20.1 

109454 AA232255 Hs.295232 ESTs 47 

109467 AA232904 Hs.63187 ESTs 6.8 

25 109481 AA233342 Hs.289069 ESTs; Weakly similar to WD40 protein C 10.6 

109508 AA233892 Hs.55902 ESTs; Weakly sImdartD IIH ALU SUBFA 8 

109514 AA234087 Hs.262346 ESTs; Weakly similar to 0RF2: function 8.2 

109572 F02027 Hs.171937 ESTs 4.8 

109632 F04165 Hs.235873 ESTs; Weakly similar to K11C4.2lC.eleg 6.2 

3 0 109644. F04477 Hs.291531 ESTs; Moderately similar to GLYCERAL 6.6 

109703 F09684 Hs.24792 ESTs; Weakly slmter to ORFYOR283w 7.1 

109726 F10009 Hs.9196 ESTs 5 

109747 F10161 Hs.22969 ESTs 4.7 

109799 F10770 Hs.180378 Homo sapiens clone 669 unknown mRNA 4.5 

35 109814 F10979 Hs.1S3106 Homo sapiens clone 23728 mRNA sequen 8.7 

110189 H20543 Hs.6278 DKF2P586B1621 protein 16.6 

110240 H25577 Hs.176588 ESTs; WeaWy similar to CYTOCHROME 6.2 

110280 H29285 Hs.32468 ESTs 4.5 

110520 H56965 Hs.4082 yjO9f06.s1 Scares fetal liver spleen 1NFL 57 

40 110551 H59617 Hs.5199 ESTs; Weakly similar to UBIQUITIN-CO 19.5 

110707 H95079 Hs.15617 ESTs;Weakly similar to ALU SUBFA 6.2 

110734 H98714 Hs.24131 ESTs 30.2 

110770 N22262 Hs.131705 ESTs 5.8 

110780 N23174 Hs.22891 solute carrier family 7 (cationic amino ad 8.2 

45 110787 N24716 Hs.12244 ESTs; Weakly similar to C44B9.1 [C.eleg 6.7 

110794 N25262 Hs.27931 ESTs 5.9 

110799 N26101 Hs.323401 Human ring zinc-finger protein (ZNF127- 4 

110816 N29454 Hs.27552 ESTs; Weakly similar to putative p150[H 4.3 

110839 N30856 Hs.30246 solute carrier family 19 (thiamine transpo 12.8 

50 110844 N3ig52 Hs.157531 Homo sapiens mRNA full length insert cO 10.1 

110854 N32919 Hs.27931 ESTs 4.7 

110856 N33063 ESTs; Weakly similar to S164 [H.sapiens 4.2 

110860 N33438 Hs.170055 ESTs 12.5 

110897 N39148 Hs.5880 DKFZP434D 156 protein 4 

55 110915 N46252 Hs.29724 ESTs 232 

110935 N48787 Hs.305979 protease inhibitor 1 (anti-elastase); alpha- 4.8 

110970 N51374 Hs.96870 Homo sapiens mRNA full length insert cD 5.4 

111006 N53375 Hs.166146 Homer, neuronalimmediate early gene; 3 4.7 

111008 N53388 Hs.7222 ESTs 13.3 

60 111018 N54067 H5.3628 mitogen^ctivated protein kinase kinase ki 5.7 

111084 N59543 H&15456 PDZ domain containing 1 8.3 

111100 N62522 Hs.20450 ESTs 14J 

111125 N63823 Hs.269115 ESTs 7.9 

111132 N64378 Hs.83293 ESTs; Highly similar to unknown functfon 4.4 

65 111139 N64683 Hs.290943 ESTs 6 

111164 N66857 Hs.14808 ESTs; Weakly similar to Hi! ALU CLASS 4.1 

111172 N67102 Hs.21851 Homo sapiens mRNA; cDNA DKFZp586 5.5 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



111178 


N67227 


Hs^4633 


ESTs 


5.7 


111179 


N67239 


Hs.10760 


ESTs 


37 


111181 


N67278 


H&171802 


ESTs; Weakly similar to hypothetfcal pro 


&6 


111184 


N67437 


Hs.243901 


Homo sapiens mRNA; cDNA DKFZp564 


8.7 


111221 


N68869 


Hs.15119 


ESTs 


7.3 


111223 


N68921 


Hs.297939 


ESTs; Weakly similar to neogen&i [H.sap 


9 


111229 


N69113 


Hs. 110855 


ESTs 


8.9 


111241 


N69514 


Hs.288880 


ESTs; Weakly similar to CGI-82 protein I 


6.9 


111268 


N70481 


Hs.26118 


Homo sapiens done 24766 mRNA sequen 


4.5 


111295 


N73275 


Hs.21275 


ESTs; Weakly similar to ubiquitin-conjug 


5.6 


111299 


N73808 


Hs.24936 


ESTs 


8.5 


111336 


N79565 


Hs.29894 


ESTs 


6.7 


111357 


N91023 


H&87128 


ESTs 


15 


111370 


Ng2915 


Hs.94531 


brefeldin A-inhibited guanine nudeotlde-e 


5.2 


111806 


R33468 


Hs^79008 


ESTs 


10 


111825 


R35685 


Hs.28614d 


stromal antigen 1 


4.5 


111836 


R36228 


Hs.25119 


ESTs 


7.2 


111890 


R38678 


Hs.12365 


ESTs 


17.3 


111923 


R39995 


Hs.25925 


Homo sapiens done 23860 mRNA sequen 


7.3 


111942 


R40576 


Hs.21590 


ESTs 


9.2 


111987 


R42036 


Hs.6763 


K1AA0942 protein 


10.6 


112101 


R44793 


Hs.295341 


adenylyl cydase-assodated protein 2 


5.3 


112134 


R46025 


Hs7413 


ESTs 


17.4 


112197 


R49482 


Hs,5637 


ESTs 


4.4 


112244 


R51309 


Hs.70823 


KIAA1077 protein 


11 


112253 


R51818 




Homo sapiens mRNA; cDNA DKFZp666 


9.3 


112305 


R54822 


Hs.26244 


ESTs 


4.4 


112449 


R63802 


Hs.124186 


ring finger protein 2 


6.3 


112483 


R66534 


Hs.285885 


ESTs 


4.9 


112519 


R66631 


Hs.11861 


ESTs 


14.3 


112610 


R79392 


H5.23643 


ESTs 


5.2 


112693 


R88741 


Hs.91065 


ESTs; Moderately similar to proEferation 


4.6 


112751 


Rg3507 


Hs.8207 


ESTs 


5.6 


112801 


R97486 


Hs.157160 


protein kinase; DNA-acthrated; catalytic p 


3.7 


112869 


T03313 


Hs.4747 


dyskeratosis congenita 1; dyskerin 


5.9 


112871 


T03352 


Hs,12285 


ESTs 


5.8 


112908 


T10065 


Hs.3530 


TLS-assodated serine-a;ginine protein 


4.1 


112966 


T17119 


HS.10254B 


gluoooortlcoid receptor DNA binding fad 


5.7 


112971 


T17185 


Hs.83883 


ESTs 


6.4 


112995 


T23528 


Hs.7155 


ESTs; Weakly similar to TYKi protein (M 


9.1 


113047 


T25867 


Hs.7549 


ESTs 


5.4 


113075 


T34660 


Hs.5986 


ESTs; Weakly similar to 1111 ALU SUBFA 


5.7 


113117 


T47819 


Hs.159153 


ESTs 


5.8 


113206 


T58044 


Hs.241471 


ESTs; Moderately similar to IIII ALU SU 


6.4 


113248 


T63857 




yc16eU1 Stratagene lung (#93721) Homo 


4.6 


113260 


T64896 


H5.287420 


ESTs 


6.9 


113277 


T65797 


Hs.11774 


protein (peptldyl^rolyl ctsArans isomeias 


5.6 


113278 


T65802 


Hs.11135 


yc11h10.s1 Stratagene lung (#937210) Ho 


4.5 


113440 


T86121 


Hs.191445 


ESTs 


6.4 


113523 


T90037 


Hs.95549 


ESTs 


6.4 


113604 


T92735 


Hs.296083 


ESTs 


8.7 


113702 


T97307 




ESTs; Moderately similar to l!!I ALU SU 


9.5 


113783 


W19222 


Hs.7041 


ESTs; Weakly similar to !!!l ALU SUBFA 


5.2 


113794 


W37382 


Hs.11090 


ESTs 


11.9 


113808 


W44735 


Hs.9286 


ESTs 


16.7 


113811 


W44928 


Hs.6994 


ESTs 


4 


113822 


W47350 


Hs,l7466 


retinoicacid receptor responder (tazaroten 


4.8 


113823 


W47388 


Hs.55099 


rab6 GfPase activating protein (GAP and 


4 


113836 


W56792 


Hs.12040 


ESTs; Weakly sfmflar to KIAA0881 prote 


4.1 


113857 


Vy85477 


Hs.52g7 


Honno sapiens mRNA; cDNA DKFZp664 


4.3 


113886 


W72471 


Hs.23920 


ESTs 


4.6 


113885 


W73738 


Hs.12921 


ESTs 


7,1 


113923 


W80763 


Hs.3849 


ESTs; Weakly similar to FK506^indlng p 


6.8 


113931 


W81205 


Hs.3496 


ESTs 


6.1 


113950 


W85785 


Hs.30504 


Homo sapiens mRNA; cDNA DKFZp434 


14 


113970 


W86748 


Hs.8109 


ESTs 


15 


114051 


W94942 


Hs.177534 


dual specifidty phosphatase 10 


5.4 
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114057 W96222 Hs.34192 ESTs 4.8 

114086 238266 Hs.288649 Homo sapiens PAC done DJ0777O23 fro 5.1 

114098 238347 Hs.l 18338 ESTs; Weakly similar to similar to S. cere 6.2 

114109 Z38435 Hs.184108 ribosomal protein L21 4.6 

5 114124 238595 Hs.125019 ESTs; Highly similar to KIAA0885 prote 22 

114138 Z38763 Hs.l5740 amyloid beta (A4) precursor protein-bindi 8.8 

114149 Z38814 Hs.27196 ESTs 4 

114162 238909 Hs.22265 ESTs 7.2 

114177 Z39052 Hs.23740 ESTs 5.3 

10 114196 Z39211 Hs.150926 fucose-1-phosphateguanylyltransferase 4.4 

114208 Z39301 Hs.7859 ESTs 5.1 

114250 Z39897 Hs.l3297 ESTs 7.2 

114251 239898 Hs.21948 ESTs 14.7 
114292 Z40715 Hs.184641 deltas fatty add desaturase 19.4 

15 114297 Z40758 Hs.173091 DKFZP434K151 protein 8.9 

114334 Z41342 Hs.22941 ESTs 13.7 

114460 AA024604 Hs.26102 ESTs 10.1 

114471 AA028074 Hs.104613 ESTs 5.7 

114480 AA032243 Hs.151678 UDP-N-acetyl-alpha-D-galactosamffie:po 7.3 

20 114518 AA046407 Hs,106469 suppressor of varl (S.cerBvislae)3-like1 4.3 

114542 AA055768 Hs.293380 ESTs 117 

114549 AA056484 Hs.292833 ESTs 7.3 

114652 AA101416 Hs.107149 ESTs; Weakly similar to PTB-ASSOCIAT 6.1 

114673 AA113303 Hs.95583 transmembrane 4 superfamily member (te 4.3 

25 114698 AA126951 Hs.110857 ESTs; Highly similar to putative DNA-dlr 7.1 

114767 AA148885 Hs.154443 mlnlchromosome maintenanoe deficient (S 5.3 

114799 AA159323 Hs.109929 ESTs 4.2 

114804 AA160363 Hs.269956 ESTs 4.8 

114811 AA161161 Hs.95907 multiple inositol potyphosphate phosphata 7.1 

30 114821 AA165313 Hs.55468 ESTs 4.4 

114852 AA235035 Hs.38260 ESTs; Moderately similar to ubiquitin spe 5 

114901 AA236276 Hs.l96437 ESTs; Weakly similar to R26660_1; parti 16.9 

114902 AA236359 Hs.39504 ESTs 5.1 
114940 AA243012 Hs.75928 ESTs 8.5 

35 114965 AA250737 Hs.72472 ESTs 35.1 

115047 AA252627 Hs.82916 homeoboxBS 5.7 

115054 AA252863 Hs.87729 ESTs 6.2 

115061 AA253217 Hs.4127l ESTs 13 

115082 AA255557 Hs.19B269 NADH dehydrogenase (ubiquinone) 1 a|p 28.2 

40 115116 AA256486 Hs.62275 ESTs 8.8 

115140 AA258030 Hs.279938 ESTs; Weakly similar to supported by GE 4.1 

115205 AA262470 Hs.284216 ESTs 8.3 

115206 AA262491 Hs.186572 ESTs 5.1 
115239 AA278650 Hs.73291 ESTs; Weakly similar to similar to the bet 4.6 

45 115242 AA278755 Hs.283732 ESTs 8.3 

115249 AA278961 Hs7l124 ESTs 10.1 

115259 AA279071 Hs.13453 splicing factor 3b; subunltl; 155kD 9.5 

115285 AA279799 Hs.293736 ESTs 5.8 

115291 AA279943 Hs.122579 ESTs 5.1 

50 115357 AA2B1793 Hs.72988 ESTs 5 

115377 AA282247 Hs.193063 ESTs 6.1 

115400 AA283198 Hs.89113 ESTs 4.9 

115439 AA284561 Hs.193090 ESTs 5.8 

115471 AA287138 Hs.59346 ESTs; Weakly similar to ASPARTYL-TR 11.7 

55 115506 AA292537 Hs.45207 Human DNA sequence from done 620E1 6.8 

115522 AA331393 Hs.47378 ESTs 5.8 

115572 AA398392 Hs.59594 ESTs; Weakly similar to F33G12.3 gene p 9.7 

115587 AA399264 Hs.283037 ESTs; Highly similar to HSPC039 protein 8.7 

115600 AA400247 Hs.42l73 ESTs 4 

60 115612 AA400948 Hs.71243 ESTs; Weakly simflar to zinc finger prole 8.4 

115646 AA404352 Hs.305971 ESTs 5.3 

115652 AA405098 Hs.38178 ESTs 16.1 

115657 AA405620 Hs.55158 ESTs; Weakly similar to weak similarity I 4.7 

115658 AA405625 Hs.183056 Human DNA sequence from done 34B21 5.1 
65 115675 AA406546 Hs.82065 Homo sapiens mRNA; cDNA DKI=Zp5M 20.5 

115721 AA417102 Hs.90960 ESTs 4.8 

115763 AA421560 ESTs 7 
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115764 AA421562 Hs.91011 anterior gradient 2 pCenepuslaevfe) homo 41.6 

115835 AA428576 Hs.41371 ESTs A2 

115844 AA430124 HsJ773 ESTs 115 

115875 AA433943 Hs.43946 ESTs; Weakly similar to Weak similarity 33.5 

5 115888 AA435839 HsJ6591 KIAA0887 protein 72 

115922 AA441911 HsJ1869 ESTs; Weakly similar to KlAA0926protB 5.1 

115941 AA443602 Hs.46679 ESTs 4.8 

115947 AA443793 Hs.94761 ESTs 8.3 

115948 AA4437g8 Hs.43445 poMA)-sped1ic rtbonudease (deadenylat 13^ 
10 115951 AA443918 Hs.301048 cofilin 1 (norvmuscle) 7.5 

115967 AA446887 Hs.42911 ESTs 8.8 

115984 AA447687 Hs.91109 ESTs 13.1 

116009 AA449448 Hs.44238 ESTs 5.5 

116024 AA451748 Hs.83883 Human DMA sequence from clone 718J7 7.5 

15 116028 AA452112 Hs.42544 thtoredoxirv^ike 12.7 

116050 AA453656 Hs.884l7 ESTs 72 

116097 AA456099 Hs.176376 ESTs 11.8 

116108 AA457566 Hs.28777 ESTs 4.5 

116121 AA459254 Hs.48B55 ESTs 4.5 

20 116127 AA459703 Hs.279884 v-myc avian myelocytomatosis viral once A3 

116129 M459956 Hs.49163 ESTs; Highly similar to putative ribonude 7.6 

116142 AA460649 Hs.39457 ESTs 43 

116204 AA465701 Hs.108646 ESTs 6.8 

116221 AA478397 Hs.50180 ESTs 45 

25 116222 AA478415 Hs.89986 ESTs 4 

116238 AA479362 Hs.47144 DKFZP586N0819 protein 4.6 

116246 AA479961 H5.250646 ESTs; Highly simitar to ubiquitin-conjuga 4 

116249 M480886 Hs.86693 ESTs 18.5 

116250 AA480975 Hs.44829 ESTs 10.8 
30 116254 AA481146 Hs.41086 ESTs; Weakly similar to OXYSTEROL^ 9.1 

116256 AA481256 Hs.88201 ESTs; Weakly similar to lysophospholipa 8.4 

116264 AA482594 Hs.272239 Homo sapiens mRNA; cDNA DKF2p586 7.2 

116265 AA482595 Hs.55189 ESTs; Weakly similar to F25B53 [C.eleg 11.1 
116282 AA486550 Hs.204501 ESTs; Weakly similar to Wiskott-AWrich 6.2 

35 116298 AA489046 Hs.94109 ESTs 4.9 

116300 AA489194 Hs.159471 ESTs; Weakly similar to snRNP protein B* 4.6 

116327 AA490959 Hs.28005 Homo sapiens mRNA; cDNA DKFZp564 5.8 

116334 AA491457 Hs.48948 ESTs 4.3 

116337 AA496127 Hs.44070 ESTs 8.4 

40 116351 AA504116 Hs.82501 Homo sapiens mRNA; cDNA DKFZp434 5.3 

116357 AA504806 Hs.90797 Homo sapiens done 23620 mRNA sequen 5.2 

116415 AA609204 Hs.27973 KIAA0874 protein 6.6 

116443 AA620313 Hs.190488 ESTs; Weakly similar to KERATIN; TYP 4.5 

116470 C13992 Hs.83484 ESTs 4.5 

45 116480 CI 4088 glyceraldehyde-3-phosphatedehydrogena 5.6 

116578 D51272 Hs.75337 nucleolar phosphoprotein p130 4.1 

116579 D51276 Hs.81915 leukemta-assodated phosphoprotein pi 8 5.8 
116626 F02028 Hs.81907 ESTs 4.9 
116647 F03069 Hs.15395 ESTs; Weakly similar to ARGINYL-TRN 6.1 

50 116674 F04816 Hs.92127 ESTs 10.6 

116680 F08813 Hs.273829 LINErelrotransposabteelemenll 4.2 

116700 F09983 Hs.3l7589 ESTs 13 

116724 F13665 Hs.65641 ESTs 8.5 

116726 F13681 Hs.53913 ESTs 5.6 

55 116732 F13779 Hs.165909 ESTs 11.6 

116734 F13789 Hs.93796 OKF2P586D2223 protein 5.4 

116760 H11064 Hs.155342 protein kinase C; delta 4.3 

116780 H22566 HsJ^S ESTs 5.7 

116786 H25836 Hs.301527 tumor necrosis ^r(t|gand)superfamily 8.8 

60 116787 H28581 Hs.15641 ESTs 8.6 

116790 H29532 Hs.101174 mtorotubule-assodated protein tau 22.2 

116803 H47357 Hs.109701 ESTs; Moderately similar to weak similar 6.7 

116877 H68116 Hs.l68732 ESTs 6.5 

116921 H72948 Hs.821 biglycan 20.7 

65 117216 N20083 Hs.42792 ESTs 4.4 

117232 N20579 Hs.61153 ESTs 7.4 

117284 N22162 Hs.183779 ESTs; Weakly similar to d3NA EST yk33 4.1 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



117344 N24046 


Hs^10706 


ESTs 


7.4 


117367 N24954 


Hs.42502 


ESTs 


10.5 


117392 N26175 


Hs.93405 


ESTs 


5.8 


117394 N26257 


Hs.39871 


KIAA0727 protein 


8.4 


117412 N26722 


Hs.42645 


ESTs 


18.1 


117498 N31726 


Hs.44258 


ESTs; Highly similar to myelin gene expr 


5.8 


117557 N33920 


Hs.44532 


diubiqultin 




117634 N36421 


Hs.13323 


ESTs; Weakly similar to SODIUM- AND 


4.4 


117639 N36923 


Hs.44833 


ESTs 


6 


117754 N47469 


Hs.59757 


ESTs 


7.6 


117852 N49408 


Hs.136102 


KIAA0853 protein 


5.9 


117879 N50050 


Hs.303025 


ESTs; Weai^ly similar to l^eratin; 67K typ 


7.9 


117924 N51056 


Hs.38891 


ESTs 


7.9 


117950 N51394 


Hs.75478 


KIAA0956 protein 


5 


117992 N52000 


Hs.172089 


Homo sapiens mRNA; cDNA DKi=Zp586 


7 


118138 N57773 


Hs.93560 


ESTs; Weakly similar to tig IRnorvegicu 


4.8 


118215 N62195 


Hs.77910 


34iydrQxy-3-(nethylg)utaryl^^oertzyme A 


13.4 


118229 N62339 


Hs.166254 


lieat shock 90kD protein 1; alpha 


5.4 


118265 N62827 


Hs.48645 


EST 


4.2 


118336 N63604 


Hs.47166 


ESTs 


72 


118363 N64168 


Hs.48938 


ESTs 


6 


118429 N66158 


HSJ4649 


ESTs 


4.1 


118470 N66759 


Hs^91033 


ESTs 


5.4 


118472 N66818 


Hs.42179 


ESTs 


10.8 


118475 N66845 




ESTs; Weakly similar to il!i ALU CLASS 


4.5 


118493 N57149 


Hs.50115 


ESTs 


5.3 


118528 N67889 


Hs.49397 


ESTs 


10.4 


118542 N68010 


Hs.49427 


ESTs 


7,9 


118600 N69222 




ESTs 


9.2 


118695 N71781 


Hs.50081 


Homo sapiens mRNA fuH length Insert cO 


9.8 


118698 N72113 


Hs.50187 


ESTs 


4,3 


118901 N90719 


Hs.94445 


ESTs 


8.1 


118952 N92986 




ESTs; Highly simitar to C6I-90 protein [H 


12.5 


118976 N9362g 


H5.93391 


ESTs 


5 


118986 N94362 


Hs.125830 


ESTs 


7.3 


118989 N94439 


Hs.45105 


ESTs 


S2 


119027 N99256 


Hs.114611 


ESTs 


5 


119042 R05316 


Hs.5472 


ESTs 


4 


119075 R36451 


Hs,287820 


fibronectin 1 


6 


119260 T15916 


Hs.102950 


ESTs; Highly similar to coat protein gamm 


4.1 


119271 T16387 


Hs.65328 


ESTs 


12.1 


119298 T23820 


Hs.155478 


cyclin T2 


5.6 


119302 T25725 




ESTs 


143 


119341 T62571 


Hs.146388 


microtubule-associated protein 7 


4 


119495 W35390 


H5.55533 


ESTs 


5,3 


119580 W42451 


H5.92260 


hfgh-mobility group protein 2-likB 1 


5.6 


119602 W46286 


Hs.233694 


ESTs; Weakly similar to ZK10S8.5 (C^le 


6.5 


119620 W47620 


Ks.56009 


2'-5'oiigoadenylate synthetase 3 


8.1 


119676 W60473 


Hs.57787 


ESTs 


5.5 


119717 W69134 


Hs.57987 


ESTs 


4.6 


119729 W69747 


Hs.94806 


KIAA1062 protein 


4 


119805 W73788 


Hs.43213 


ESTs 


4 


119859 W80702 


Hs.58461 


ESTs 


4.8 


119867 W80852 


Hs.250696 


KDEL (Lys-Asp-GIu-Leu) endoplasmic re 


4.2 


119873 W81129 


Hs.44865 


Homo sapiens mRNA; cDNA DKFZp586 


4.8 


119899 W84767 


Hs.58698 


ESTs 


5.9 


119940 W86779 


Hs.272531 


DKFZP586B0319 protein 


9 


119943 W86835 


Hs.14158 


copine III 


4.6 


119970 W87812 


Hs.93581 


Homo sapiens mRNA; cDNA DKFZp686 


4 


120131 Z38666 


Hs.75887 


coatomer protein complex; subunit alpha 


4.2 


120150 Z39549 


Hs.1 53746 


ESTs 


11 


120208 Z40805 


Hs.91668 


ESTs 


8.2 


120241 Z41815 


Hs.65946 


ESTs 


15.6 


120255 AAie9752 


Hs.5672 


ESTs; Weakly similar to Similarity to Yea 


4.2 


120314 AA194166 


Hs.221040 


KIAA1038 protein 


6.8 


120325 AA195651 


Hs.104106 


ESTs 


15.2 


120352 AA211400 


Hs.193172 


ESTs 


6.8 
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120428 AA236822 Ms.173694 K1AA1 097 protein 5.6 

120524 AA261852 Hs.192905 ESTs 5.6 

120528 AA262107 Hs.104413 ESTs 4.5 

120571 AA280738 Hs.34892 ESTs 4.9 

5 120649 AA287115 Hs.192843 ESTs 4.5 

120655 AA287347 Hs.238205 ESTs 6.7 

120668 AA287833 Hs.292913 ESTs 8.3 

120712 AA292654 Hs.102506 eukaryotic translation inftiation factor 2 ai 4.6 

120713 AA292655 Hs.96557 ESTs 10.6 
10 120724 AA293470 Hs.100747 ESTs 5.4 

120873 AA358015 EST 7.1 

120885 AA365515 Hs.301872 ESTs; Moderately similar to I HI ALU SU 4.6 

120919 AA381125 Hs.301444 ESTs 8.2 

120948 AA397822 Hs.ia4650 ESTs; Highly similar to similar to magon 8.6 

IS 120969 AA3981 16 Hs.129206 casein kinase 1; gamma 3 10.5 

120977 AA398155 Hs.97600 ESTs 10.9 

121103 AA398936 Hs.97697 EST 7.4 

121291 AA401753 Hs.8186 lung cancer candidate 5.3 

121320 AA403008 Ms.301927 T-ceil receptor; alpha (V;D;J;C] 13.5 

20 121463 AA411745 Hs.239681 ESTs; Weakly similar to KIAA0554piote 8.9 

121596 AA416740 Hs.174104 ESTs 22.6 

121723 AA419622 Hs.104800 ESTs; Weakly similar to Mouse 19.5 mRN 8 

121748 AA421171 Hs.234545 ESTs 5.6 

122125 AA434411 Hs.98806 ESTs 5.3 

25 122522 AA449444 Hs.98969 ESTs 4 

122655 AA454756 Hs.97837 ESTs 4 

122704 AA456326 Hs.99445 ESTs 6.2 

122782 AA459894 Hs,99472 ESTs 5.3 

122856 AA463740 Hs.75367 Src>iike-adapter 13.1 

30 122882 AA466381 Hs.108d12 ESTs; Weakly similar to B0041. 5 [Celeg 5.5 

122928 AA47657d Hs.101840 ESTs 6.3 

122974 AA478625 Hs.194215 ESTs 6 

122997 AA479295 Hs.106290 Kelch motif containing protein 12.5 

123016 AA480103 Hs.323231 ESTs; Weakly similar to altemafivelyspli 4.4 

35 123107 AA486071 Hs.104207 ESTs 8.3 

123111 AA486273 Hs.191721 ESTs 4.2 

123114 AA486407 Hs.129928 ESTs; Moderately similar to KIAA0454 p 5.2 

123136 AA487449 Hs,194024 ESTs 4.2 

123137 AA48746B Hs.100686 ESTs; Weakly similar to secreted cement 14.6 
40 123169 AA488892 ESTs; Weakly similar to Gag-Polpolypro 4.5 

123176 AA489020 Hs.69233 ESTs 5.2 

123338 AA504249 Hs.187585 ESTs 4 

123436 AA598714 Hs.223014 protease; serine; 15 7.3 

123442 AA598803 Hs.111496 ESTs 5.9 

45 123449 AA598899 Hs.1 12493 Homo sapiens mRNA; cDNA DKFZp564 , 4.1 

123494 AAS99786 Hs.1 12110 ESTs 4 

123503 AA600121 Hs.293156 ESTs 12.8 

123533 AA608751 ESTs; Weakly similar to llll ALU SUBFA 7.9 

123619 AA509200 ESTs 23.1 

50 123673 AA509471 Hs.1 58549 ESTs 6.6 

123729 AA609778 Hs.278672 membrane component chromosome 11; s 4.7 

123819 AA620636 Hs.112264 ESTs 4 

123960 AA621785 Hs.287733 methylmalonate-semialdehydedehydnoge 7.6 

124000 D57317 Hs.74a61 aclh/ated RNA polymerase II transolption 4.4 

55 124005 D60302 Hs.270016 ESTs 20.6 

124012 D80240 Hs.241471 HUM5G11A Human fetal brain (TFujiwa 6.7 

124021 F02859 Hs.13974 ESTs 4.7 

124049 F10523 Hs74519 primase; polypeptide 2A (58kD) 4.7 

124059 F13673 Hs.283713 ESTs 7.7 

60 124243 H66710 Hs.133525 ESTs 5.5 

124308. H93575 Hs.241507 Homo sapiens mRNA; cDNA DKFZp564 11.4 

124314 H94877 . Hs.215766 GTP-blndIng protein 13.7 

124315 H94892 Hs.28d757 v-ra! simian leukemia viral oncogene horn 14 - 
124350 N21359 Hs.101282 Homo sapiens mRNA; cDNA DKFZp434 6.6 

65 124352 N21626 Hs.102406 ESTs 7.2 

124357 N22401 yw37g07.s1 Morton Fetal Cochlea Homo 5.2 

124390 N29325 Hs.7535 ESTs; Highly similar to C0BW4ike place 7.9 
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124438 N40188 Hs.11090 
124447 N48000 
124457 N50114 H5.2G6175 
124539 N63172 Hs.146409 
124626 N74604 Hs.11090 
124632 N79515 Hs.306117 
124644 N91279 Hs.109654 
124675 R01037 Hs.181013 
124677 R01073 
124724 R12405 Hs.1 12423 
124773 R40923 Hs.108604 
124777 R41933 
124792 R44357 Hs.48712 
124857 R63652 Hs.137190 
124911 R88992 Hs.180612 
124955 710598 Hs.324841 
124958 T11134 Hs.431 
125038 T78089 Hs.270134 
125092 T92544 Hs.137546 
125132 W15495 Hs.129781 
125144 W37999 Hs.24336 
125154 W38419 
125243 W86423 Hs.105413 
125279 W93640 Hs.4779 
125299 Z39436 Hs.102720 

125303 239821 Hs.288193 

125304 Z39833 Hs.124940 
125474 AA151216 HsJ5103 
125509 AA044232 Hs.288967 
125580 AA126504 Hs.267612 
125582 AA507383 Hs.74649 
.125670 AI432621 Hs.82685 
125698 AA748483 Hs.191356 
125745 AI283493 HsJ5722 
125852 H09290 HsJ6550 
125972 AA434562 Hs.35406 
126160 N90950 Hs.265398 
126257 N99638 Hs.124084 
126337 AI066486 Hs.40500 
126405 U46278 Hs.122489 
126537 W40262 Hs.146310 
126590 VV7B958 Hs.181307 
126712 AA205862 Hs.7942 
126721 T72569 Hs.125359 
126764 Ai334393 Hs,102178 
126804 At203334 Hs.l60628 
126819 AA305536 Hs.279607 
126877 AI052047 Hs.26102 
126991 R31652 Hs.821 
127479 AA513722 Hs.179729 
127514 AAB26926 Hs.204214 
127663 W07286 Hs.10340 
127677 AA916752 Hs^64190 
127814 AA761755 85.136713 
127997 AI281549 Hs.311054 
128092 AA904617 Hs.166229 
128218 H02682 Hs.292154 
128466 D59653 Hs.241471 
128482 U83908 Hs.296251 
128517 AA280617 Hs.100861 
128530 AA504343 Hs.183475 
128559 AA226801 Hs.101448 
128574 AA412048 Hs.38260 
128595 U31875 Hs.l52677 
128610 L38608 Hs.10247 
128629 AA399187 Hs.102708 
128649 AA142853 Hs.103106 



ESTs 

Homo sapiens mRNA; cDNA DKFZp586 

ESTs 

cell division cycle 42 (GTP-binding prate 
ESTs 

interieukin 13 receptor alpha 1 

ESTs; Moderately simSar to outer membr 

phosphoglycerate mutase 1 (brain) 

ESTs; Weakly similar to I'll ALU CLASS 

Homo sapiens mRNA; cONA DKFZp586 

ESTs 

ESTs 

ESTs: Weakly similar to cON A EST EMB 

ESTs 

ESTs 

ESTs; Weakly similar to III! ALU SUBFA 
murine leukemia vtial (bmi-1) oncogene h 
ESTs 

C084 antigen (leukocyte antjgen) 

chromosome 21 open reading frame 5 

ESTs 

ESTs 

ESTs 

ESTs; Moderately similar to similar to AD 

ESTs 

ESTs 

GTP-binding protein 

tyrosine 3-monooxygenase/tiyptophan 5-m 

ESTs 

sorting nexin 4 

cytochrome c oxidase subunit Vic 
CD47 antigen (Rh-ielated antigen: integrt 
general transcription factor IIH; polypepti 
ribophorin II 

Homo sapiens mRNA; cDNA DKFZp564 

ESTs 

ESTs; Weakly similar to transformation-r 
tumor necrosis factor receptor superfamijy 
similar to S. cerevlslae RER1 
ESTs 

ESTs; Weakly similar to putative p150 [H 

H3 histone; family 3A 

ESTs 

Thy-1 cell surface antigen 

ESTs 

ESTs 

ESTs 

ESTs 

biglycan 

collagen; type X; alpha 1 (SchmU metaph 

ESTs 

ESTs; Weakly similar to weak similanty t 
ESTs; Highly similar to MEM3 (M.muscu 
ESTs; Weakly simitar to V4-1 [H.sapiens 
ESTs 
ESTs 

ESTs; Moderately similar to recombinatto 
EST 

programmed cell death 4 

ESTs; Weakly similar to p60 katanin [H^ 

Homo sapiens clone 25061 mRNA sequen 

metastasis associated 1 

keratin 8 

short-chain alcohol dehydrogenase family 
activated leucocyte ceil adhesion molecule 
DKFZP434A043 protein 
Homo sapiens mRNA for 67b protein (G 
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128651 AA446990 Hs.103135 ESTs 6.1 

128653 R48943 Hs.10315 solute carrier family 7 (cationic amino aci 4.4 

128656 AA458542 Hs.10326 coatomer pnotein complex; subunit epsiion 14.3 

128717 T30617 Hs.104222 Homo sapiens mRNA; cDNA DKF2p566 24.5 

5 128727 M64174 H$.50651 Janus kinase 1 (a protein tyrosine kinase) 7.3 

128764 N49308 Hs.104938 ESTs; Weakly similar to alpha 1(XVIII)c 9.2 

128781 X85372 Hs.1 05465 small nuclear nt)onucteoprotein polypepti 5.4 

128793 W93562 Hs.105749 KIAA0553 protein 4.6 

128835 W15528 Hs.106390 Homo sapiens mRNA; cDNA DKFZp586 4 

10 128845 AA455658 Hs.1 0649 t)asement membrane-induced gene 6.9 

128871 AA400271 Hs.106778 Homo sapiens mRNA for putative Ca2-Kt 4.5 

128922 AA252023 Hs.9589 ESTs;Weaklyslm"i!artoHRIHFB2157 [H 6.4 

128925 D61676 Hs.21651 Homo sapiens mRNA; cDNA DKFZp586 6.4 

128938 AA410325 Hs.107260 ESTs 7 

15 128946 N29353 Hs,107318 kynurenlne 3-monooxygenase (kynurenln 52 

128948 AA485655 Hs.223025 proteasome (prosome; macropain) subunil 13.1 

128955 F10290 Hs.185807 Homo sapiens ctone 24758 mRNA sequen 5.8 

129005 AA460049 Hs.13323 ESTs; Weakly similar to SODIUM- AND 12.6 

129009 AA131421 Hs.75607 ESTs 9.8 

20 129017 H13108 Hs.107968 ESTs 13.9 

129057 X62466 Hs.276770 CDW52 antigen (CAMPATH-I antigen) 10.7 

129075 AA129465 Hs.83765 ESTs 4.7 

129095 L12350 Hs.108623 Ihrombospondin 2 4.4 

129124 AA234530 Hs.1O8802 N-ethylmaleimide-sensitive factor 20.7 

25 129160 AA131252 Hs.109007 ESTs 5.9 

129164 AA282183 Hs.109045 ESTs 5.8 

129180 R40556 Hs.318401 ESTs; Highly similar to HSPC039 protein 7.6 

129224 X89109 Hs.109606 coronin; actin^)inding protein; 1A 12 

129229 AA211941 Hs.109643 polyadenylate binding protein-interacting 7.9 

30 129240 W24360 Hs.237868 Interteukin 7 receptor 5.3 

129241 AA435665 Hs,109706 ESTs; Moderately stmOar to HN1 [M.mus 8.4 

129243 H88033 Hs.109727 KIAAD733 protein 7.8 

129247 AA151574 Hs.109733 pilin-like transcripttonfactor 6.4 

129259 AA090695 Hs.1 81385 ESTs 6.2 

35 129270 235227 Hs. 109918 ras homotog gene family; me ml)er H 5.4 

129281 AA026318 Hs.289101 glucose regulated protein; 58kD 4.4 

129300 C20976 Hs.1 10165 ESTs; Highly similar to ribosoma) protein 5.7 

129318 N93155 Hs.285976 calnrodulin 1 (phosphorylase kinase; delta 7.7 

129319 AA037467 Hs.30340 ESTs 6 
40 129351 AA167268 Hs.62349 Human ras inhibitor mRNA; 3' end 9.3 

129366 H18027 Hs.184697 ptexinCI 18.2 

129383 W92984 Hs.288224 ESTs 5.9 

129388 AA151621 Hs.110964 ESTs 4.1 

129391 T80814 Hs.11101 discs; large (Drosophila) homolog 3 (neur 105 

45 129404 AA1 72056 Hs.317584 ESTs 5.3 

129406 N23707 Hs.l1l138 KIAA0712 gene product 4 

129426 AA412087 Hs.l11323 EST; Highly similar to protein inhibitor o 8 

129453 AA421213 Hs.l11632 Lsm3 protein 5.5 

129513 C00225 Hs.306153 ESTs; Weakly similar to fos39554_1 (H.S 5.5 

50 129519 AA298786 Hs.112242 ESTs 6.8 

129606 R21443 Hs.l66254 heat shock 90kD protein 1; alpha 5 

129522 AA278243 Hs.323949 ESTs 6.8 

129626 AA447410 Hs.1 11334 ESTs; Weakly similar to Illl ALU SUBFA 5.1 

129627 AA258308 Hs.71968 Homo sapiens mRNA; cDNA DKFZp564 5.3 
55 129628 U26727 Hs.l174 cydin-dependent kinase inhibitor 2A (mel 8.2 

129642 R50008 Hs.1 1806 7-dehydrDCholesterol reductase 43 

129663 AA442768 Hs.1 1866 transtocase of inner mitochondrial membr 4.4 

129665 M88458 Hs.118778 KDEL (Lys^p^Iu-Leu) endoplasmic re 4 

129691 X06700 Hs.1 19571 collagen; type III; alpha 1 [Ehlers-Danlos 6 

60 129783 AA454618 Hs.12479 associated molecule with the SH3 domain 6.4 

129800 AA252436 Hs.12540 lysophospholipase I 7.7 

129836 AA452161 Hs.206521 YME1 {S.cerevisiae)-like 1 5 

129850 N20593 Hs.288932 GDP dissociation inhibitor 2 6.9 

129869 AA102520 Hs.13015 ESTs; Weakly similar to heat shock prote 5 

65 129896 AA043021 Hs.13225 UDP-GaltetaGlcNAcbeta 1;4-galactosy 6.6 

129982 M87789 immunoglobulin gamma 3 (Gm marker) 4 

129985 AA450045 HS.1404S2 cargo selection protein (nfiannose 6 phosp 5.8 
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130029 AA236412 Hs.236510 ESTs; Modeiately similar to PFT27[M.m 5.6 

130033 M90696 Hs.181301 cathepsinS 5.4 

130036 AA195260 Hs.125849 ESTs; Moderately similar to ALU SU 7.4 

130069 AA055896 Hs. 146428 collagen; type V; alpha 1 ■ 7.6 

5 130077 T24055 Hs.91379 ribosomal protein L26 4 

130080 X14850 Hs.147097 H2A histone family; member X 12.1 

130096 AA223874 Hs.197955 KIAA0704 protein 5 

130114 AA234717 Hs.14992 ESTs 7.8 

130125 M36803 Hs.1504 hemopexin 7.2 

10 130135 M61764 Hs.21635 tubulin; gamma 1 5.6 

130170 AA610070 Hs.151469 catdum/calmodullnKJependent serine pro 7.5 

130169 D43947 Hs,151761 KIAA0100 gene product 6.4 

13Q2Q8 AA620556 Hs.15250 peroxisomal D3:D2-enoyt^A isomerase 6.4 

130211 D50840 Hs.23703 UOP-glucose ceram^ glucosyltransferas 4.5 

15 130235 X14046 Hs.153053 CD37 antigen 9.1 

130276 S75295 Hs.169149 karyopherin alpha 1 (importin alpha 5) 6.6 

130280 L13738 Hs.153937 activated p21cdc42Hs kinase 5 

130313 AA620323 Hs.154320 ubiquilin-activating enzyme EIC (homdo 6.1 

130314 D86967 Hs,154332 KIAA0212 gene product 10 
20 130328 AA135673 Hs.154668 KIAA0391 gene product 6.1 

130356 X84373 Hs.155017 nuclear receptor interacting protein 1 10.6 

130367 238501 Hs.8768 ESTs; Weakly similar to (HI ALU SUBFA 8.3 

130378 T47333 Hs.155188 TATA box binding protein (TBP)-assoc{a 7.1 

130384 X66354 Hs.166071 cycfin-dependent kinase 5 5.6 

25 130393 D13630 Hs.1 55291 KIAAOOOS gene product 4.1 

130399 AA449417 Hs.155356 Homo sapiens mRNA for putative glucosy 4.6 

130407 N29888 Hs.l55410 ESTs 7 

130414 M21121 Hs.241392 small inducible cytokine A5 (RANTES) 4.1 

130417 U58522 Hs.l55485 huntlngttn-interacting protein 2 7.9 

30 130421 D21260 Hs.178710 clathrin; heavy polypeptide^ike 2 4 

130441 U35835 Hs.155637 protein kinase; DNA-activated; catalytic p 6.8 

130455 X17059 Hs.155956 N-acetyltransferase 1 (arylamine Nicety I 26.4 

130498 L38951 Hs.180446 karyopherin Gmportin) beta 1 4.8 

130499 AA416723 Hs.158286 Homo sapiens mRNA for KIAA0446 prol 6.1 
35 130511 L32137 Hs.1584 cartilage oligomeric netrix protein (pseud 8.3 

130553 AA430032 Hs.252587 pituitary tumor-transfomilng 1 7.5 

130558 H96654 Hs.15984 ESTs; Weakly similar to gene pp21 protei 5.6 

130568 AA232535 Hs.16085 ESTs; Highly similar to CGM3 protein [H 4 

130583 W24957 Hs.293907 ESTs; Moderately similar to similar to C.e 13.3 

40 130585 H66211 Hs.16331 ESTs 10.1 

130604 X03635 Hs.l657 estrogen leceptor 1 39.9 

130614 AA132007 Hs.l6697 ESTs 5.1 

130619 AA477739 Hs.12532 ESTs 5.9 

130622 AA235247 Hs.16846 ESTs; Weakly similar to cytochrome P45 4.1 

45 130625 F0396g Hs.260720 matrix metalbprotetnase 2 (gelatinase A; 8.3 

130627 L23808 Hs.1695 matrix metalloproteinase 12 (macrophage 10.3 

130629 M60346 Hs.1697 ATPase; H+ transporting; lysosomal (vacu 7 

130635 M87503 Hs.1706 interferon-stlmulated transcription ^ctor 5.5 

130639 D59711 Hs.17132 ESTs 7.2 

50 130677 H17861 Hs.17767 ESTs 13.5 

130681 D82808 Hs.17820 Rho^ssociated; ralled-coil containing pro 6 

130693 AA487202 Hs.17962 ESTs 6.1 

130703 N63295 Hs.18103 ESTs 4.3 

130706 AA488843 Hs.201673 comichon-like 4 

55 130712 AA292056 Hs.279762 adenylate cyclase 7 5.1 

130714 xg2896 Hs.18212 DMA segment on chromosome X (unique) 8.4 

130715 T98227 Hs.171952 ocdudin 5.7 
130744 AA203527 Hs.18747 P0P7 (processing of precursor, S.cerevfs 6.2 
130747 AA471293 Hs.6879 ESTs 8.2 

60 130751 AA435633 Hs.18879 Homo sapiens ck)ne 23965 mRNA sequen 8.3 

130796 R39390 Hs.19525 ESTs 4.5 

130800 AA223386 Hs.19574 ESTs; Weakly similar to katanin p80 subu 7.7 

130855 AA425439 Hs.l43323 putative DNA/chromatin binding motif 43 

130859 AA287327 Hs.20478 ceroid-lipofuscinosis; neuronal 2; late infa 9.8 

65 130866 M58028 Hs.2055 ubiquitin^ctivatingenzynfieEl (AISgTa 4.3 

130880 D14678 Hs.20830 kinesin-like 2 4.5 

130891 D31891 Hs.20991 SET domain; bifurcated; 1 4 
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130905 AA056489 Hs.129998 

130913 W03592 Hs.21198 

130919 AA291710 Hs.21276 

130921 AA074596 Hs.194688 

130944 M97935 Hs.21486 

130974 X57985 Hs.2178 

130987 R45698 Hs.21893 

130999 N48963 Hs.21992 

131010 AA435748 Hs.169341 

131046 X02530 Hs.2248 

131091 T35341 Hs.22880 

131153 H11760 Hs.23606 

131185 M25753 Hs.23960 

131200 AA609427 Hs.293732 

131206 AA04407d Hs.24210 

131210 AA430047 Hs.95549 

131227 AA429472 Hs.236522 

131244 D38076 Hs.24763 

131245 M620599 Hs.24766 
131257 AA256042 Hs.24908- 
131319 U25997 Hs.25590 
131339 'AA463450 Hs^5812 
131388 R34531 Hs.92200 
131410 H84658 Hs.279836 
131472 AA608962 Hs.27258 
131475 Z39053 Hs.27263 
131501 AA121127 Hs.8207 
131514 X02152 Hs.2795 
131524 N391ffl Hs.301804 
131528 060856 Hs.28309 
131544 N33236 Hs.28555 
131557 D30946 Hs.28707 
131562 U90551 Hs.28777 
131564 M491465 Hs.28792 

131586 AA235385 Hs.26966 

131587 M15182 Hs.183868 
131589 U52100 Hs.29191 
131615 D14533 Hs.192803 
131664 AA136126 Hs.30327 
131679 AA136660 Hs.30579 
131684 U26174 Hs.3066 
131687 LI 1066 Hs.3069 
131689 AA5g9653 Hs.30696 
131893 W60913 Hs.110796 
131710 AA233225 Hs.30985 
131716 D49738 Hs,31053 
131742 D31352 Hs.31433 
131762 H46831 Hs.107767 
131781 AA460450 Hs.31989 
131795 N32724 Hs.32317 
131809 L76517 Hs.3260 
131814 AA437226 Hs.157 
131838 AA091932 Hs.180628 
131877 J04088 Hs.156346 
131885 AA044095 Hs.3402 
131891 AA158258 Hs.30376 
131925 AA248470 Hs.183180 
131930 AA205460 H5.69476 
131941 D62657 Hs.35086 
131965 W90146 Hs.35962 

131970 D86960 Hs.3610 

131971 R70167 Hs.154938 
131974 AA410424 Hs.268122 
131977 F09788 Hs.3622 
131994 AA479515 Hs.279882 
131997 D82399 Hs.136644 
132017 W67251 Hs.267659 



ESTs 

translocase of outer mitochondrial mmbr 
ooKagen; type W\ atpha 3 (Goodpasture a 
bromodomain adjacent to zinc finger dom 
signal transducer and activator of t/anscrip 
H26 histone family; member Q 
ESTs; Weakly similar to cAMP Inducible 
KIAA0689 protein 

ESTs; Weakly similar to phosphatldic add 
small Inducible cytokine subfamily B (Cy 
ESTs; Highly similar to dipeptidyl peptid 
ESTs 
cydinBI 

ESTs: Moderately similar to !i!i ALU SU 

ESTs 

ESTs 

DKFZP434P106 protein 
RAN binding protein 1 
DKFZP564E1962 protein 
ESTs 

stanniocalcin 

Nijmegen breakage syndrome 1 (nibrin) 

KIAA0480gene product 

ESTs 

calcydin binding protein 
ESTs 

H3 histone; temilySA 
lactate dehydrogenase A 
ESTs 

UDP-glucose dehydrogenase 

ESTs; Weakly similar to B0511.8 (C.eleg 

signal sequence receptor; gamma (iransloc 

H2A histone family; member L 

ESTs 

ESTs; Moderately similar to attematively 
glucuronWase; beta 
epithelial membrane protein 2 
xerodenna pigmentosum; oomplementatio 
mitogen-activated protein kinase-activated 
ESTs 

granzyme K (serine protease; granzyme 3 
heat shock TOkD protein 9B (mortalin-2) 
transcription fador-jike 5 (basic hellx4oo 
ESTs; Weakly similar to cDNA EST yk45 
MRS1 protein 

cytoskeleton-assodated protein 1 
ESTs 

ESTs; Moderately similar to CaM^OI Inh 
DKFZP586G1722 protein 
Sox-iike transcr1ptk)nal factor 
presenilin 1 (Alzheimer disease 3) 
interieukin 10 receptor; alpha 
dynamin-like protein 
topoisomerase (DNA) II alpha (170kO) 
ESTs 

heterogeneous nuclear protein similar to r 
ESTs; Weakly similar to RING finger pro 
ESTs 

ubkiuitin^pedfic protease 1 
ESTs 

KIAA0205 gene product 
ESTs 

Homo sapiens mRNA; cONA DKFZp586 
procollageniiroline; 2-oxoglutarate 4-diox 
Human DNA sequence from clone 703H1 
Homo sapiens cbne 23714 mRNA sequen 
Homo sapiens vav 3 oncogene (VAV3) m 
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132021 T68246 Hs.306079 chaperonin containing TCP l;subunit 5 (e 5.2 

132065 D82226 Hs.211594 proteasome (prosome; macropain] 26S sub 8.5 

132085 D44466 Hs.38d7 proteasome (prosome; macFopain) 26S sub 13^ 

132089 AA131971 Hs.39122 ESTs 4.8 

5 132109 AA599801 Hs.40098 ESTs 6.2 

132143 AA257056 Hs7972 K1AA0871 protein 14.6 

132149 T10822 Hs.324743 ESTs 5.3 

132153 N90141 Hs.41066 ESTs; Moderately simflar to ELONGATI 9.2 

132160 AA281770 Hs.295923 seven In absentia (Drosophte) homolog 1 5.5 

10 132164 U84573 Hs.41270 procollagen-lysine; 2-oxoglutaralB 5-clioxy 8.1 

132180 AM05569 Hs.418 fibroblast activation protein; alpha; sepras 15.4 

132183 L19183 Hs.199895 hypothetical pix)teih 12.2 

132225 AA128980 ESTs 5.6 

132227 AA412620 Hs.4248 ESTs 6.7 

15 132235 F09058 Hs.42656 ESTs 6.2 

132255 AA608856 Hs.431 murine leukemia viral (bmM) oncogene h 6 

132298 N41849 HsJ120 Homo sapiens cytokine receptor related p 5.6 

132314 AA285290 Hs.44499 small EDRK-rich factor 2 6.8 

132325 N37065 Hs.44856 ESTs 4.7 

20 132384 AA479933 Hs.46967 Human DNA sequence from done 167A1 4.2 

1323B7 R70914 Hs.281434 heal shock 70kD protein 1 9.1 

132393 W85888 Hs.47334 ESTs; Moderately similar to III! ALU SU 4 

132405 F09979 Hs.4774 ESTs 15 

132407 AA431459 Hs.47783 ESTs 8 

25 132413 AA132969 Hs.260116 KIAA1 104 protein 4 

132446 AA426218 Hs.48764 ESTs 5.3 

132465 AA047896 Hs.49169 ESTs 15.4 

132482 AA429478 Hs.238126 ESTs; Highly similar to CG 1-49 protein (H 9 

132492 T03749 Hs.4990 KIAA1089 protein 8.5 

30 132528 AA283006 Hs.50758 chromosome-associated polypeptide C 4.3 

132540 AA488987 Hs.5097 synaptDgyrfn2 9.8 

132543 AA417152 Hs.5101 protein regulator of cytokinesis 1 10.1 

132580 L37042 Hs.283738 casein kinase 1; alpha 1 5.9 

132586 AA412452 Hs.52515 DKFZP434N024 protein 4.2 

35 132608 AA199588 Hs.5321 ARP3 (actin-felated protein 3; yeast) hom 4.2 

132616 AA386264 Hs.283558 isocitrate dehydrogenase 2 (NADP+); mit 5,2 

132617 AA171913 Hs.5338 cartjonic anhydfase XII 10.1 

132618 AA253330 Hs.279916 adaptor-related protein complex 1; gamma 4.8 
132640 U33821 Hs.5437 Taxi (human T^ell leukemia virus type I 5.7 

40 132668 AA453614 Hs.5460 KIAA0776 protein 4.4 

132694 M60830 Hs.5509 ecotropic viral integration site 2B 15.6 

132700 N47109 Hs.5521 ESts 7 

132724 AA417962 Hs.55498 geranylgeranyl diphosphate synthase 1 5.6 

132738 W42674 Hs.264636 ESTs; Moderately similar to neuronal thre 4.9 

45 132742 AA490862 Hs.292812 ESTs; Weakly similar to C43H8.1 [Celeg 7.9 

132744 X54326 Hs.55921 glutamyl^)rolyWRNA synthetase 4.1 

132795 H99152 Hs.57079 ESTs 8 

132807 AA331777 Hs.57301 mul (E. cdi) homotog 1 (coton canoen n 8 

132811 U25435 Hs.57419 transcriptional repressor 4 

50 132817 AB004884 Hs.57553 tousled-like kinase 2 6.5 

132840 N23817 Hs.5807 Homo sapiens clone 23675 mRNAsequen 5.6 

132845 D62588 Hs.5813 ESTs 12.4 

132847 T48195 Hs38189 eukaryotte translation infllation factor 3; s 7 

132856 W79865 Hs.58367 gfyp{can4 6.2 

55 132869 N26855 Hs.203961 ESTs 6.5 

132874 AA425776 H8.58609 ESTs 5.6 

132880 AA4443e9 Hs.177537 ESTs 7.2 

132894 D82422 Hs.5944 ESTs 7.5 

132900 N56451 Hs.5g78 UM domain only 7 4.4 

60 132903 AA235404 Hs.5985 Homo sapiens clone 25186 mRNAsequen 9.1 

132904 X83618 Hs.59889 3-hydroxy-3-methylglutaryl-Coenzyme A 10.7 

132906 AA142857 Hs.234896 ESTs; Highly similar to geminin IH.sapie 10.2 

132914 AA496037 Hs.60293 ESTs 47 

132918 AA252605 Hs.6051 KIAA061 6 protein 7.1 

65 132936 AB002305 Hs.6111 KIAA0307 gene product ' 8.3 

132951 U04209 Hs.61418 microfibilllar-associated protein 1 4.3 

132957 AA234791 Hs.61469 Human gene from PAC 753P9; chromoso 13.2 
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132959 AA028103 Hs.61472 ESTs; Weakly similar to unknown tS.cere 18.9 

132968 N77151 Hs.61638 myosin X 5.8 

132984 HB0409 Hs.62112 zinc finger protein 207 4.3 

132990 AA458761 Hs.18387 transcription factor AP-2 alpha (activating 4.2 

5 132994 AA505133 Hs.279905 solute carrier family 2 (fadlltated glucose 2^4 

132998 Y00062 Hs.170121 protein tyrosine phosphatase; receptor typ 4.4 

133002 AF006082 Hs.42915 ARP2 (actin-related protein 2; yeast) horn 4.7 

133D05 C21400 Hs.278605 KIAA0970 protein 8.6 

133015 AA047036 Hs.246315 ESTs 7.9 

10 133016 W81298 Hs.6289 growth factor receptor-bound protein 2 5.2 

133039 X52055 Hs.63489 protein tyrosine phosphatase; non-recepto 4 

133050 S67325 Hs.63788 propionyl Coenzyme A cart)oxylase: beta 5.2 

133056 AA0713d7 Hs.6396 jumping translocation breakpoint 5 

133062 R33663 Hs.64056 ESTs 5-4 

IS 133083 N70633 Hs.6456 chaperonln containing TCP1; subunit 2 (b 6 

133091 AA122147 Hs.64691 KIAA0483 protein 5 

133093 AA598749 Hs.285996 ESTs 5.6 

133124 AA156049 Hs.267923 ESTs 4.1 

133126 D16469 Hs.6551 ATPase; H+ transporting; lysosomal (vacu 6.2 

20 133196 R37367 Hs.6727 Ras-GTPase activating protein SH3 doma 5.1 

133214 Y10659 Hs.285115 Interleukin 13 reoepton alpha 1 6.2 

133225 Z41415 Hs.6823 ESTs; Weakly similar to Intrinsic factor-B 8.3 

133228 N90029 Hs.6831 Homo sapiens done 1400 unknown prote 4.7 

133239 AA059405 Hs.179882 Homo sapiens ctone 24655 mRNA sequen 5.5 

25 133240 D31161 Hs.242894 ESTs 9 

133257 AF006086 Hs.6895 adin related protein 2/3 complex; subunit 7.7 

133264 W72187 Hs.69192 ESTs; Weakly similar to cDNA EST yk37 6.7 

133274 AA488886 Hs.6949 ESTs 4.2 

133281 AA421079 Hs.69594 ESTs; Weakly similar to Sox-like transcri 4.9 

30 133283 AA410507 Hs.6968 ESTs 4,3 

133287 L15702 Hs.69771 B-factor. properdin 9.3 

133294 R79723 Hs.6g997 zinc finger protein 238 30.4 

133297 AA600057 Hs70266 KIAA0905 protein 10.4 

133318 AA256168 Hs,152316 ESTs 8.5 

35 133362 H06195 Hs7194 ESTs; Highly similar to CGI-59 protein |H 14 

133370 AA156897 Hs.72157 DKFZP564I1922 protein 5 

133391 X57579 Hs.727 inhibin; bete A (activin A; adivin AB alp 13.9 

133395 AA491296 Hs72805 ESTs 4.3 

133422 N79516 Hs.73287 ESTs; Weakly similar to eyelid [D.melano 4.5 

40 133431 AA255438 Hs.7358 Homo sapiens mRNA; cDNA DKFZp566 8 

133435 T23983 Hs.323966 ESTs 5 

133449 AA094989 Hs.7381 voltege-dependent anion channel 3 8.7 

133468 X03068 Hs.73931 major hIstocompaMrty complex; class II 5 

133484 X78710 Hs.211581 metal-regulatory transcription factor 1 5.3 

45 133506 AA316868 Hs.74346 ESTs; Weakly similar to 140G 11. h[D.me 6.8 

133517 X52947 Hs.74471 gap junctfon protein; alpha 1;43kD (conn 5.7 

133551 D63480 Hs.278634 KIAA0 146 protein 4.8 

133569 AA313977 Hs.172772 transcriplton etongatton factor B (SHI); po 9.5 

133572 W94333 Hs.279915 translocase of inner mitochondrial membr 5 

50 133577 F03717 Hs.75063 human immunodefidency virus type I en h 7.4 

133589 L37368 Hs.75104 RNA-binding protein SI; serine-rich dom 5 

133608 D13315 Hs.75207 glyoxalasel 4.2 

133617 AA148318 Hs.75249 KIAA0069 protein 4.5 

133627 U09587 Hs.75280 glycyl-tRNA synthetase 10 

55 133633 D21262 Hs.75337 nudeolar phosphoprotein p130 4.5 

133634 U24166 Hs,234279 microtubule^ssociated protein; RP/EB fa 15.2 

133640 D83004 Hs.75355 ubquitin-conjugating enzyme E2N (homo 9.1 

133644 D89077 Hs.75367 Src-like-adapter 6.4 

133649 AA479139 Hs.75393 add phosphatase 1;solubte 4.8 

60 133652 AA287383 Hs.7540 ESTs 4.2 

133674 AA458946 Hs .75497 ESTs 4.3 

133700 K01396 Hs!297681 protease inhibitor 1 (antl-elastese); alpha- 8!3 

133705 N21648 Hs.75659 MpV17 transgene; murine homolog; gtom 4.6 

133716 Y00282 Hs.75722 ribophorinll 7.5 

65 133720 L27841 Hs.75737 pericentriolar material 1 9.4 

133752 U49278 Hs.75875 ubiquitinK^onjugating enzyme E2 variant 4.5 

133765 D21255 Hs.75929 cadherin 11 (OB-cadherin; osteoblast) 6.4 
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133772 W73693 Hs76038 isopentenyl-dlphosphate delta isomerase 

133774 Z23090 Hs.76067 heat shock 27kD protein 1 

133776 J03473 Hs.177766 ADP-ribosyllransferase (NAD+; pdy (AD 

133784 AA214305 Hs.301064 ESTs 

5 133814 M33882 Hs.76391 myxovlrus (influenza) resistance 1; homol 

133829 AA453783 Hs.76550 Homo sapiens mRNA; cDNA DKFZp564 

133834 AA147510 Hs.288660 serine protease; umbilical endothelium 

133839 M59815 Hs.170250 complement component 4A 

133842 U73477 Hs.285013 putative human HLA dass II assodated p 

10 133845 T68510 Hs.76704 ESTs 

133859 U88782 Hs.178761 26S proteasome-associated padi homolog 

133867 D43948 Hs.76989 KtAA00g7 gene prodxt 

133868 U580g0 Hs.183874 cullin4A 
133871 AA454597 Hs.182793 ESTs 

15 133893 XOIOeO Hs.77356 transferrin receptor (p90;CD71) 

133914 N32811 Hs.77542 ESTs 

133918 W72783 Hs.58382 ESTs; Wealdy similar to CI 3F1 0.5 [Cele 

133944 AA045870 Hs.7780 Homo sapiens mRNA; cDNA DKFZp564 

133946 AA156565 Hs.173878 4-nitrophenylphosphatase domain and non 

20 133963 L34587 Hs.184693 transcription elongation factor B (Sill); po 

133980 D00760 Hs.250811 proleasome (prosome; macropain) subunit 

133990 C02374 Hs7822 Homo sapiens mRNA; cDNA DKFZp564 

133999 M28213 HsJ8305 RAB2; member RAS oncogene family 

134030 J03077 HsJ8575 prosaposin (variant Gaucher disease and V 

25 134032 Z81326 Hs.78589 protease inhibitor 12 (neuroserpin) 

134045 S82470 HsJ8768 BB1 

134046 D28473 Hs.172801 isoleudne-tRNA synthetase 
134064 D87685 Hs.78893 KtAA0244 protein 
134070 H96621 Hs78946 cullin3 

30 134087 U51166 Hs.173824 thymine-DNA glycosylase 

134090 M22382 Hs79037 heat shock 60kD protein 1 (chaperonin) 

134098 X06323 Hs79086 ribosomal protein; mitochondrial; L3 

134110 U41060 Hs79136 UV-1 protein; estrogen regulaied 

134132 U32519 Hs.220689 Ras-GTPase-adivating protein SH3-doma 

35 134168 AA398908 Hs.181634 Human Chromosome 16 BAG clone CIT9 

134170 M63138 Hs79572 cathepsin D (lysosomal aspartyl protease) 

134208 U88871 Hs79993 peroxisomal biogenesis fador 7 

134258 L28010 Hs.808 heterogeneous nudear nTxinucleoprotein F 

134288 AA430008 Hs.8117 ESTs 

40 134310 AA313414 Hs.8148 Homo sapiens clone 24856 mRNA sequen 

134326 U16306 Hs.81800 chondroitin sulfate proteoglycan 2 (versic 

134329 D38551 Hs.81848 ' RAD21 (S. pombe) homolog 

134331 AA452020 Hs.111222 ESTs; Weakly similar to CGM28 protein 

134351 R82074 Hs.82109 syndecani 

45 134357 L43575 Hs.82171 Human done 191 B7 placenta expressed m 

134363 M37033 Hs.82212 CD53 antigen 

134367 X54199 Hs.82285 phosphoribosylglydnamide formyltransfe 

134374 D62533 Hs.8236 ESTs 

134375 AA412720 Hs.82389 ESTs; Highly similar to CGI-1 18 protein 
50 134376 X02874 Hs,82396 2';5'-oligoadenylate synthetase 1 

134381 U56637 Hs,184270 capping protein (actin filament) musde 2- 

134388 ■M15841 Hs.82575 small nudear rfconudeoproteln polypepti 

134395 L09717 Hs.8262 lysosomal-assodated membrane protein 2 

134399 H99801 Hs.82689 tumor rejedfon antigen (gp96) 1 

55 134401 AA243746 Hs.211577 klnedin 1 (klnesln receptor) 

134405 J04177 Hs.82772 collagen; ^pe XI; alpha 1 

134415 AA329274 Hs.82911 protein tyrosine phosphatase type IVA; m 

134417 D87969 Hs.82921 solute carrier family 35 (CMP^iaricackit 

134419 L08044 Hs.82961 trefoil fador 3 (intestinal) 

60 134421 AA122386 Hs.82985 oollagen; type V; alpha 2 

134423 WgeiSI Hs.83006 ESTs; Highly simitar to CGt-139 protein 

134438 AA449984 Hs.246857 ESTs; Highly similar to proteine kinase JN 

134446 T25732 Hs.83419 KIAA0252 protein 

134453 X70583 Hs.83484 SRY (sex detemnlning regton Y)-box 4 

65 134470 X54942 Hs.83758 CDC28 protein kinase 2 

134487 R38165 Hs.83954 Homo sapiens unknown mRNA 

134495 D63477 Hs.84087 KIAA0143 protein 
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134498 M63180 Hs.84131 threonyl-tRNA synthetase 6.1 

134506 U45328 Hs.84285 ubtqultin-oonjugatlng enzyme E21 (homol 4.6 

134529 H24460 Hs.848 FK506-bInding protein 4 (59kD) 6.2 

134570 U66615 Hs.172280 SWI/SNF related; matrix associated: adin 4.8 

5 134582 AA234986 Hs.86041 CGG triplet repeat binding protein 1 4.7 

134600 R68884 Hs.86347 ESTs;Weakty similar to predicted using G 5.8 

134623 X74496 Hs.86978 prolyl endbpeptidase 4.5 

134654 W23625 Hs.8739 ESTs; Weakly similar to ORFYGR200c[ 137 

134655 AA454070 Hs.123090 ESTs 5.8 
10 134675 AA250745 Hs.87773 protein kinase; cAMPndependentcatalyti 8.9 

134711 X04011 Hs.88974 cytochrome b-245; beta polypeptide (chro 6.8 

134714 U89922 Hs.890 lymphotoxin beta (TNF superfamily; mem 357 

134722 W47183 Hs.2d4226 ESTs; Weakly similar to neural F box pro 8.1 

134776 J05582 Hs.89603 mucin 1; transmembrane 6.2 

15 134806 Z49099 Hs.89718 spemiine synthase 4.2 

134810 M27394 Hs.89751 membrane-spanning 4-domains; subfamily 7 

134840 U51477 Hs,89981 diacylglycerol kinase; zeta (104kD) 4.1 

134843 H60595 Hs.90061 progesterone binding protein 47 

134853 D82348 Hs.90280 5-aminolmidazole^-carboxamlde ribonuc 10.2 

20 134866 U84011 Ks.904 amylo-1;6-gtucostdase;4-alpha^lucanotr 12.1 

134868 Z39762 Hs.90419 KIAA0882 protein 6 

134885 N27670 Hs.9071 progesterone membrane binding protein 5 

134982 N46086 Hs.92308 ESTs 4.1 

134989 AA236324 Hs.92381 Homo sapiens mRNA; chnomosome 1 spe 16.8 

25 134992 H05625 Hs.5831 ESTs 4 

134993 AA282343 Hs.301005 purine^ element bindlr^ prote/n B 4.4 

135010 D59675 Hs.92927 ESTs 7 

135015 U54999 Hs.278338 LGN protein 4.8 

135029 AA224180 ESTs; Moderately similar to 17-beta-hydr 13.6 

30 135032 AA243497 Hs.173685 Human DNA sequence from done 30M3 4 

135037 U77948 Hs.278589 general transcription factor II; i 8 

135059 M598449 Hs.93832 Homo sapiens done 24483 unknown mRN 5.4 

135071 L08069 Hs.94 heat shod( protein; DNAJ-like 2 9.3 

135083 AA495950 Hs.94262 ESTs 6.7 

35 135117 W52493 Hs.94694 Homo sapiens ctone 24837 mRNA sequen 10.2 

135144 AA044842 Hs.95260 Homo sapiens mRNA; cDNA DKFZp586 6.6 

135154 AA126433 Hs.267812 sorting nexfn 4 7.4 

135218 D31157 Hs.324277 ESTs; Weakly similar to growth factor-res 6.2 

135237 AA454930 Hs.9691 ESTs 19.5 

40 135243 AA215333 Hs.97101 putative G protein-coupled receptor 8.8 

135335 H20989 Hs.198281 pyruvate kinase; muscle 12.4 

135349 D83174 Hs.9930 collagen4)inding protein 2 (colligen 2) 5.5 

135367 AA480109 Hs.9963 TVRO prote/n tyrosine fdnase binding pro 5.4 

135389 U05237 Hs.gg872 fetal Alzheimer anUgen 7.8 

45 135400 M23263 Hs.99915 androgen receptor (dihydrotestosterone re 9.1 

135411 L10333 Hs.99947 retteuloni 5.3 

300019 M97935 AFFX control: STAT1 8.3 

300021 IVI97935 AFFX control: STAT1 7 

300022 M97935 AFFX control: STAT1 14 
50 300089 AI199738 Hs.208275 ESTs; Weakly similar to llll ALU Cl^SS 9.1 

300107 AI694585 Hs.270464 ESTs; Weakly similar to IIH ALU CLASS 7.4 

300254 AW079607 Hs.188417 ESTs; Weakly similar to ZnT-3(H.saplen 30.1 

300328 AW015860 Hs.224623 ESTs 11.9 

300549 AA699328 Hs.298119 ESTs 5.5 

55 300711 Ai492179 Hs.166244 ESTs; Weakly similar to d3N A EST yk40 11 

300921 AW293224 Hs.232165 ESTs 11 

301124 T79326 Hs.298262 ESTs; Weakly similar to dJ88J8. 1 [H.sapi 8.8 

301165 N85789 Hs.150186 ESTs; Weakly similar to PTER1N-4VVLP 6 

301576 AI682905 Hs.270431 ESTs; Weakly similar toll!! ALU SUBFA 4.7 

60 301604 AA373124 Hs.24809 ESTs; Weakly similar to CI 7G 10.1 [Cele 8 

301704 AA526313 Hs.293691 ESTs 4.2 

301782 N99399 Hs. 143048 EST duster (not in UniGene) with exon h 18 

301884 AA312082 Hs.105445 GDNF family receptor alpha 1 20.7 

301936 Nf4_004694Hs.1 14924 EST duster (not In UniGene) with exon b 11.6 

65 302002 AF013956 Hs.5637 diromobox homotog 4 (DrosophHa Pc da 9.2 

302032 NM^001992Hs.128087 EST duster (not in UniGene) with exon h 4.3 

302067 H05698 Hs.222399 ESTs; Weakly similar to proteln-tyroslne 7.8 
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302145 NM_003613Hs.151407 EST cluster (not in UniGene) with exonh 15.1 

302236 A(128606 Hs.6557 zinc finger protein 161 25,8 

302276 NI4>004448Hs.323910 EST duster (not in UniGene) with exonh 21.6 

302290 AL1 17607 Hs.175563 Homo sapiens mRNA; cDNA OKFZp564 41.4 

5 302326 NrA.004271Hs.184018 EST duster (not In UniGene) with exonh 8^ 

302342 AB023141 Hs.190386 KIAA0924 protein 5.4 

302372 AL1 17406 Hs.200102 Homo sapiens mRNA; cDNA DKFZp434 8.9 

302422 AB021227 Hs.3743 matrix metailoprote&iase 24 (membrane-Bi 5.2 

302431 AF129530 Hs.226434 EST duster (not in UniGene) with exon h 5.3 

10 302501 AF022726 Hs.251446 EST duster (not in UniGene) with exon h 9.9 

302505 ALJ049650 Hs.247874 multipte UniGene matches 4.3 

302533 L36149 Hs.248116 diemokine (C motif) XC receptor 1 4.9 

302638 AA463798 Hs.102696 ESTs; Weakly similar to C1 1 D2.4 [Celeg 5.3 

302656 AW293005 Hs.70704 ESTs 8.4 

15 302792 AA343696 Hs.46821 ESTs; Weakly similar to putative (H.sapie 4.5 

302820 X04588 Ks.85844 EST cluster (not in UniGene) with exonh 6.8 

302838 U66049 Hs.82171 EST duster (not in UniGene) with exon h 8,4 

302892 N58545 Hs.42346 histone deacetylase 3 22.8 

302977 AW263124 Hs.3151 1 1 EST duster (not in UniGene) with exon h 6,8 

20 302989 N46406 Hs,84700 EST cluster (not in UniGene) with exon h 8.9 

303007 AA478876 Hs.317714 pallid (mouse) homotog; palRdin 10.1 

303052 AF140242 Hs.279926 EST duster (not in UniGene) wSh exon h 24.4 

303131 AW081061 Hs,103180 actin-like6 6.3 

303132 Ai929819 Hs.4055 ESTs 17.7 
25 303153 U09759 Hs!246857 mttogen-adivated protein kinase 9 11.4 

303387 AA908797 Hs.180799 ESTs 15.8 

303499 AI815990 Hs.293515 ESTs 7.2 

303502 AA488528 EST duster (not in UniGene) with exonh 5.3 

303576 T07216 Hs.301226 EST duster (not in UniGene) with exonh 16.2 

30 303620 AA397545 Hs.119151 ESTs 8.9 

303634 AI953377 Hs.28444 ESTs; Weakly similar to predided using G 12 

303642 AW299459 Hs.1 1 1977 EST duster (not in UniGene) with exon h 4,2 

303654 AA436942 Hs.288529 ESTs 8.4 

303733 AW502498 Hs.1 5220 ESTs; Weakly similar to zinc finger prote 5.2 

35 303780 A1424014 Hs.l8995 ESTs; Moderately similar to KIAA0456 p 28.4 

303792 C75094 Hs.199839 ESTs; Highly slmflar to NG22 [H.sapiens 4.4 

303842 AI337304 Hs.126268 ESTs; Weakly similar Id similar to PDZd 8.1 

303951 AW475081 Hs.172928 collagen; type 1; alpha 1 7.5 

304465 AA421948 EST singleton (not In UniGene) with 6X00 6.5 

40 304507 AA456426 EST 5.4 

304S91 AA505702 EST singleton (not in UniGene) with exon 9.8 

304601 AA507875 EST singteton (not tn UniGene) with exon 7.5 

304659 AA533185 EST sii^leton (not in UniGene) with exon 7 

305040 AA630582 Hs.169476 glyoeraldehyde^osphatedehydrogena 12.4 

45 305134 AA653159 Hs.179661 EST singleton (not in UniGene) With exon 8,7 

305415 AA725116 Hs,78465 EST singleton (not In UniGene) with exon 5.3 

305453 AA738110 EST singleton (not in UniGene) with exon 4.1 

305898 AA872838 keratin 8 7.7 

305913 AA876109 EST singleton (not in UniGene) With exon 6.3 

50 305950 AA884479 EST singleton (not in UniGene) with exon 5.6 

306004 AA889992 Hs.2186 EST singleton (not in UniGene) with exon 13.2 

306009 AA894560 Hs,283370 EST singleton (not In UniGene) with exon 4.4 

306060 AA906161 HsJ6277 EST singleton (not in UniGene) with exon 4.6 

306398 AA970548 Hs.297681 EST singleton (not in UniGene) with exon 7,6 

55 306505 AA987722 Hs.172928 EST singleton (not in UniGene) with exon 19,7 

306576 AA995761 Hs.276092 EST singleton (not in UnK^ene) with exon 5.5 

307117 AI184111 Hs.76067 heal shodt 27kD protein 1 7.7 

307138 A1185516 Hs.172928 collagen; type I; alpha 1 8.8 

307187 AI190870 Hs.276417 EST singleton (not In UntGene) with exon 4.1 

60 307542 AI280859 Hs.62954 ESTslngleton (not in UniGene) with exon 6 

307554 AI281603 Hs.172928 EST singleton (not in UniGene) with exon 10.8 

307806 AI351739 Hs.276726 EST singleton (not in UniGene) With exon 4.7 

308079 AI472733 Hs.270208 ESTs 4.2 

308307 AI581398 Hs.172928 ooDagen; type I; alpha 1 5.4 

65 308511 AI587580 Hs.169476 EST singleton (not In UniGene) with exon 10.1 

308615 Ai738593 Hs.101774 EST singleton (not in UniGene) with exon 15.1 

308677 A1761173 EST singleton (not in UniGene) with exon 4.6 
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308852 A(829848 Hs.182937 peptidy)protylisomeraseA(cyclophiIIn A 5.9 

308974 AI872290 Hs.300697 immunoglobulin gamma 3 (6m marker) 4.5 

308981 AI873242 EST singleton (not fnUniGene) with exon 7.6 

308995 AI880172 EST singleton (not In UniGene) with exon 6.6 

5 309177 AI951118 EST singleton (not In UniGene) with exon 24.3 

309186 AI952723 Hs.90207 EST singteton (not In UniGene) with exon 6.1 

309198 A1955915 major histocompatibility complex; dass 1; 5.6 

309226 A1969897 EST singleton (not in UntGene) with exon 6.2 

309279 AI990102 EST singleton (not in UniGene) with exon 7.9 

10 309583 AW170035 EST 64.5 

309624 AW191929 Hs.252989 EST 5.3 

309629 A W1 92764 Hs.1 72928 collagen; type I; alpha 1 6.9 

309641 AW194230 H5.253100 EST 11.4 

309698 AW238461 Hs.73742 ribosomal protein; large; PO 4^ 

IS 309700 AW241 170 Hs.179661 Homo sapiens done 24703 beta-tubulin m 11.9 

310073 AI335004 Hs.148558 ESTs 4.2 

310094 AW450967 Hs.235240 ESTs 5.7 

310373 AW0B0778 Hs.145582 ESTs 4.8 

310438 AW022192 Hs.200197 ESTs 39.1 

20 310470 A1281848 Hs.194691 ESTs 4.9 

310583 AW205632 Hs.211198 ESTs 7 

310877 T47784 Hs.188955 ESTs 4.1 

311067 A1587332 Hs.209115 ESTs 11.2 

311166 A1821294 Hs.118599 ESTs 24.1 

25 311199 T57896 Hs.191095 EST cluster (not In UniGene) 5.7 

311465 AI758660 Hs.206132 ESTs 15.7 

311587 A1828254 Hs.271019 ESTs 6.4 

311774 AA700870 Hs.14304 ESTs 6.2 

311785 A1056769 Hs.1335l2 ESTs 5 

30 311923 T60843 .Hs.189679 ESTs 55 

311935 AA216387 EST duster (not in UniGene) 5.5 

311972 N51511 Hs.188449 ESTs 5.2 

312014 AI435650 Hs.128778 ESTs 43 

312047 AA588275 Hs.180669 ESTs 14.7 

35 312147 T89855 Hs.195648 EST cluster (not in UniGene) 9.8 

312153 AA759250 Hs.153028 cytochrome b-561 27.1 

312158 T92251 Hs.198882 ESTs 4.2 

312172 A1222168 Hs.191168 ESTs 6.1 

312226 AI796815 Hs.199993 ESTs; Weakly similar to Ubiquitous TPR 5.5 

40 312292 AW451893 Hs.1 51 124 ESTs 18.4 

312312 AI080505 Hs.134529 ESTs 11.9 

312369 AA582039 Hs.173884 Homo sapiens mRNA; chromosome 1 spe 4 

312407 R46180 Hs.1 53485 ESTs 13.6 

312430 AW139117 Hs.1 17494 ESTs 4.1 

45 312470 AW451347 Hs.175862 ESTs 4.6 

312483 AI417526 Hs.7753 ESTs 15.3 

312521 AA033609 Hs.3l9093 ESTs 125 

312544 AI498371 Hs.183526 ESTs 14.6 

312638 AW439195 Hs.256880 ESTs 5.3 

50 312754 R99834 Hs.250383 ESTs 8.4 

312772 H63791 EST cluster (not h UniGene) 4.3 

312821 AA699325 Hs.269880 ESTs 8.3 

312837 AW292286 Hs.255058 ESTs 7.1 

312849 AA846353 Hs.194054 ESTs 5.9 

55 312854 AA828713 Hs.321058 EST duster (not in UniGene) 4.1 

312992 AA088446 Hs.170298 ESTs 7.3 

313096 AI422367 Hs.163533 ESTs 6.1 

313112 AA732534 Hs.269099 ESTs 4.2 

313126 AA720887 Hs.283313 EST duster(nol in UniGene) ' 18.1 

60 313136 N59284 Hs288010 ESTs 17 

313197 AI738851 Hs.222487 ESTs 12,9 

313219 N74924 Hs.182099 ESTs 7.1 

313258 AW068358 Hs.183918 ESTs 137 

313328 AW449211 Hs.105445 ESTs 27.9 

65 313352 AW292127 Hs.144758 ESTs 9.8 

313417 AA741151 Hs.137323 ESTs 8.2 

313455 AW081702 Hs.98571 ESTs 6.9 
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313590 AAB04410 Hs.291677 EST cluster (not in UntGene) 53 

313663 A(953261 Hs.169813 ESTs 7.6 

313667 U69201 HS.136B4 ESTs; Weakly similar to choline kinase B 12.5 

313749 AW450376 Hs.119004 ESTs 5.5 

5 313832 AW271022 Hs.133294 ESTs 4.3 

313881 AA535580 Hs.16331 ESTs 7.7 

313915 AB69390 Hs.163443 ESTs 27.1 

313955 A)B588B4 Hs.270647 ESTs 5.7 

313974 AI310151 Hs.173524 ESTs 43 

10 314097 AA648744 Hs.269493 ESTs 14.5 

314129 AA228366 H5.115122 ESTs 9.5 

314359 AA205569 Hs.194193 ESTs 5.4 

314384 AA535B40 Hs.162203 ESTs; Weakly similar to altemalively spB 5.3 

314394 AI380563 Hs.130815 ESTs 13.2 

15 314462 AA347951 Hs.3264l3 ESTs 6.2 

314465 AA602917 Hs.15e974 ESTs 1B.1 

314470 AI934422 Hs.30661 ESTs ' 4.2 

314488 AA358265 Hs.182890 ESTs 6.1 

314506 AA833655 Hs.206858 ESTs 27.8 

20 314510 AI204418 Hs,190080 ESTs 9.5 

314558 AI873274 Hs.190721 ESTs 22,5 

314661 AA436432 Hs.324239 EST duster (not In UniGene) 13.3 

314691 AW207206 Hs.136319 ESTs 21.4 

314754 AW026761 Hs.134374 ESTs 4.4 

25 314775 A1149880 Hs.188809 ESTs 4.4 

314943 AI476797 Hs.184572 cell division cyde 2; G1 to Sand G2 to M 18.4 

314961 AW008061 Hs.231994 ESTs 102 

314963 AI689617 Hs,200934 ESTs 5.3 

315006 AI538613 Hs.298241 ESTs 207 

30 315010 AA531082 Hs.240049 ESTs 5 

315019 AA532807 Hs.105822 ESTs 6.1 

315033 AM93046 Hs,146133 ESTs 12 

315036 AA534953 Hs.163297 ESTs 8.3 

315037 AW205863 Hs.133988 ESTs; Weakly similar to gene MAC25 pr 6.1 
35 315051 AW292425 Hs.163484 EST 12.7 

315054 A1968598 Hs78768 ESTs 7.6 

315073 AW452948 Hs.257631 ESTs 13,9 

315080 AA744550 Hs.136345 ESTs 4.4 

315083 AI221325 Hs.205442 ESTs 5.1 

40 315088 AA557351 Hs.1 52448 ESTs; Moderately similar to MULTIFUN 4.7 

315175 AI025842 Hs.152530 ESTs 11.9 

315196 AA972756 Hs.44898 ESTs 28.8 

315296 AA876905 Hs.125286 ESTs 16.1 

315303 AW194364 Hs.128022 ESTs; Weakly similar to FIG-1 PROTEIN 257 

45 315352 AA604799 Hs.136528 ESTs; Moderately similar to IK! ALU SU 12.3 

315364 AA643602 Hs.155485 ESTs; Highly similar to serine protease [H 4.6 

315368 AW291563 Hs.104696 ESTs 4.8 

315390 AI801565 Hs.200113 ESTs; Weakly similar to allematfvelyspli 4.4 

315408 AW273261 Hs.216292 ESTs 5 

50 315458 AA872000 Hs.1 16104 ESTs 7.6 

315472 AA828850 Hs.165469 ESTs 4.9 

315478 AA665612 Hs.120874 ESTs 5.2 

315498 AA628539 Hs.1 16252 ESTs; Moderately similar to 111! ALU SU 4.8 

315527 AI791138 Hs.116768 ESTs 4.4 

55 315530 AI200852 Hs.127780 ESTs . 22.4 

315562 AA737415 Hs,152826 ESTs 5.9 

315634 AA837085 Hs.220585 ESTs 8.8 

315647 AA648983 Hs.212911 ESTs 15 

315652 AI521489 Hs.3053 ESTs 6,3 

60 315676 AW002565 Hs.124660 ESTs 9.2 

315680 AA814309 HS.1Z3583 ESTs 8.1 

315735 AI831760 Hs.155111 ESTs 13.4 

315741 AA812168 Hs.122559 ESTs 5.4 

315769 AA744875 Hs.l894l3 ESTs 4.4 

65 315978 AA830893 Hs.1 19769 ESTs 10,4 

315984 AI015862 Hs.131793 ESTs 5 

316042 AW297979 Hs.170698 ESTs 147 
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316136 AA830808 Hs.124366 ESTs 4 

316177 AI908272 Hs.293102 EST cluster (not In UniGene) 32.6 

316313 AA741300 Hs.202599 ESTs 4.8 

316405 AA757900 Hs.270823 ESTs 4.8 

5 316480 AJ749921 Hs,205377 ESTs 1Z9 

316564 AI743571 Hs.168799 ESTs; Weakly similar to 11!! ALU SUBFA 8.1 

316714 AA809792 Hs.123307 ESTs 5 

316715 AI440266 Hs.170673 ESTs 4.2 
316828 AA828116 Hs.173076 ESTs 5.2 

10 316859 AI954880 Hs.134604 ESTs 13.3 

316905 AW13d241 Hs.210846 ESTs 6^ 

316943 AW014875 Hs.137007 ESTs 5.3 

316949 AA856749 Hs.124620 ESTs 7.2 

317008 AW051597 Hs.143707 ESTs 4.1 

15 317028 AA962623 Hs.189144 ESTs; Weakly similar to RENAL SODIU 4.2 

317057 A]805392 Hs.325335 ESTs 4.5 

317069 AI732892 Hs.190489 ESTs 6.4 

317210 AA490716 EST duster (not In UniGene) 4.4 

317298 AI922374 Hs,158549 ESTs 5.9 

20 317658 AW139077 Hs.202217 ESTs 4.6 

317674 AW294909 Hs.132208 ESTs 5.2 

317685 AI798630 Hs.149997 ESTs 4.3 

317836 AA983913 Hs.128929 ESTs 12.4 

317881 AI827248 Hs.224398 ESTs 1i1 

25 317902 AI828602 Hs.211265 ESTs 8.8 

317916 AI565071 Hs.159983 ESTs 12.8 

318042 AW294522 Hs.14gg91 ESTs 5.6 

318053 AI074465 Hs.13346d ESTs 4 

318064 AW29688e Hs.170939 ESTs 5.2 

30 318070 AI024594 Hs.248942 ESTs 4.7 

318073 AW167087 Hs.131562 ESTs 15.7 

318146 A)040125 Hs.150521 ESTs 5.9 

318186 AW016773 Hs.3709 ESTs 5,3 

318481 AI291584 Hs.145921 ESTs; Weakly simHar to HYPOTHETICA 7.6 

35 318566 AI335361 Hs.226376 ESTs 5.8 

318517 AW247252 Hs.75514 nucleoside phosphoiylase 11.1 

318662 AI285898 Hs.294014 ESTs 16.3 

318691 AW192139 Hs.181307 H3 hislone; family 3A 4 

318740 NM_002543Hs.77729 EST cluster (not In UniGene) 21.3 

40 318744 AI793124 Hs.144479 ESTs 35 

318948 AA317274 Hs.13996 ESTs 11.7 

319163 F15257 Hs.27 glycine dehydrogenase (decarboxylaling; 7 

319478 R06841 Hs.270307 EST cluster (not in UniGene) 8.9 

319545 R83716 Hs.14355 ESTs 8.2 

45 319668 NMJ02731HS.87773 EST cluster (not in UniGene) 25.4 

319763 AA460775 Hs.6295 ESTs 7 

319913 AA179304 Hs.271586 ESTs; Moderately similar to 1111 ALU SU 8.7 

319935 W22152 Hs.282929 EST cluster (not in UniGene) 5.6 

319951 AA307665 Hs.14559 ESTs 4.9 

50 319962 H05350 Hs.135056 ESTs 9,2 

319977 AA632632 EST cluster (not In UniGene) 4.6 

320074 AA321166 Hs.278233 EST cluster (not in UniGene) 16.7 

320092 AF022799 Hs.1 13292 calpain 9 (na^) 5.4 

320107 AA836461 Hs.291712 EST duster (not In UniGene) 5.3 

55 320133 D63271 EST duster (not in UniGene) 5,5 

320167 AA9d4373 Ms.90790 EST cluster (not in UniGene) 15 

320187 T99949 Hs.303428 EST duster (not In UniGene) 6.7 

320211 AL039402 Hs.125783 DEME-6 protein 24.3 

320401 U9044g Hs.152717 nudeoside diphosphate kinase type 6 (Inh 10 

60 320458 AI884396 Hs.24131 ESTs 5.4 

320488 R31386 Hs.191791 EST duster (not in UniGene) 4.9 

320521 N31464 Hsi4743 ESTs 9.5 

320661 AA864846 Hs.l15175 EST duster (not In UniGene) 6.6 

320691 R61576 Hs.313951 hypothetical protein 5.9 

65 320699 R63161 Hs.118249 EST duster (not in UniGene) 4 

320727 U96044 Hs.181125 EST cluster (not in UniGene) -15.3 

320993 AL050145 H5.225986 Homo sapiens mRNA; cONA DKFZp586 7.2 
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321012 AA737314 Hs.194324 EST duster (not in UniGene) 6.1 

321050 AW393497 EST cluster (not in UniGene) 5 

321051 AF134149 H5.240395 EST duster (not in UniGene) 11.4 
321171 AI769410 Hs.221461 ESTs 7.7 

5 321192 AA295304 Hs.297939 ESTs; Weakly similar to neogeninIH.sap 5.5 

321354 AA078493 EST duster (not in UniGene) 165 

321387 H68014 Hs.141278 ESTs; Weakly similar to HI! ALU SUBFA 4.2 

321412 AW366305 Hs.22891 EST duster (not in UniGene) 6.3 

321489 AW392474 Hs.172759 ESTs; Moderatelysimilarto!!!! ALU SU 9 

10 321539 N98619 Hs42915 ARP2 (adin^elated protein 2; yeast) horn 11.3 

321593 H84762 Hs.253197 ESTs 10.4 

321666 D2B390 Hs.272897 EST duster (not In UniGene) 19.9 

321891 AW157424 Hs.165954 ESTs 5.6 

321910 H67065 Hs.271530 ESTs; Weakly similar to!!!! ALU SUBFA 5.4 

IS 321953 AW068268 Hs.292833 ESTs; Weakly similar to !!l! ALU CLASS 6.5 

321978 N77342 Hs.21851 EST duster (not in UniGene) 102 

322017 AA310039 Hs.9192 ESTs 9.8 

322026 AA233527 Hs.283675 low density Kpopiolein receptor (familial 27 & 

■ 322035 AL137517 Hs.306201 EST duster (not in UniGene) 40.2 

20 322171 AF085968 Hs.48474 EST cluster (not in UniGene) 5.7 

322175 AF085975 EST duster (not In UniGene) 7.7 

322236 AL134970 Hs,104222 foKistalin-like 1 14.4 

322303 W07459 Hs.157601 EST cluster(not in UniGene) 13.4 

322735 AA086123 Hs.297856 EST duster (not In UniGene) 7,6 

25 322777 AA679082 Hs.269947 ESTs 4.4 

322818 AW043782 Hs.293616 ESTs 21 

322882 AW248508 Hs.279727 DiGeorge syndrome critical region gene 2 15.3 

322975 016391 EST duster (not in UniGene) 21.3 

322991 C18965 Hs.159473 ESTs 11.7 

30 323011 AA580288 EST duster (not in UniGene) 8.9 

323091 AW014094 Hs.210761 ESTs 10.8 

323107 A1301107 Hs.150790 ESTs 6,5 

323136 A1120351 Hs.30177 EST duster (not in UniGene) 5.5 

323168 AL120862 Hs.124165 ESTs 17.9 

35 323195 AI064982 Hs,117950 muftifiinctfanal polypeptide similar to SA 5.8 

323201 AL049370 Hs.13350 Homo sapiens mRN A; cDNADKFZp586 11,6 

323203 AA203135 Hs.130186 ESTs 6.4 

323243 W44372 Hs.110771 EST duster (not in UniGene) 7.3 

323244 T70731 Hs.193620 EST duster (not in UniGene) 15.8 
40 323328 AA228078 Hs.255096 EST duster(not in UniGene) 4.8 

323332 AI829520 Hs.227513 ESTs 20.2 

323333 AA228B83 Hs.208558 EST duster(not in UniGene) 8.8 
323570 AL038623 Hs.208752 ESTs; Weakly similar to 1111 ALU SUBFA 5 
323604 AI751438 Hs.41271 ESTs; Weakly similar to I'll ALU SUBFA 6.5 

45 323685 AA344205 Hs.2890B8 EST duster(not in UniGene) 7.1 

323753 AA327102 Hs.70266 EST duster (not in UniGene) 6.1 

323817 AA410943 EST duster (not in UniGene) 16.8 

323845 AI684674 Hs.41127 ESTs; Weakly similar to wadaw [D.meian 10.1 

323930 AA570698 Hs.8173 ESTs 6,4 

50 323997 AA844907 Hs.274454 EST duster (not in UniGene) 8 

324047 AA378201 Hs.271340 EST duster (not in UniGene) 6^ 

324261 AL044891 Hs.269350 EST duster (not in UniGene) 50.1 

324302 AA543008 Hs.292471 ESTs;WeaklysimilartolI{l ALU SUBFA 5.7 

324338 AL138357 Hs.14507B ESTs 9.5 

55 324344 AW502000 Hs.46677 EST duster (not in UniGene) 4.4 

324432 AA464510 Hs.152812 EST duster (not in UniGene) 16.7 

324495 AW501411 Hs.122489 ESTs;Weaklysimi!artol!ll ALU CLASS 5.5 

324497 AW152624 Hs.136340 ESTs 5.4 

324598 AA502659 Hs.163986 ESTs 8.8 

60 324603 AW016378 Hs.292934 ESTs 23.1 

324620 AA446021 Hs.94109 EST duster (not in UniGene) 21.2 

324727 AI610425 Hs.ig597 ESTs 5 

324774 A1031771 H8.132586 ESTs 5 

324783 AA640770 Hs.200994 EST cluster (not in UniGene) 4.1 

65 324824 AI826999 Hs.224624 ESTs 6.3 

324826 AA704806 Hs.143d42 ESTs 11.7 

324902 D31323 Hs.271492 ESTs 4.8 
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324981 AA613792 

324987 108682 Hs.172634 

324988 T06997 Hs.121028 
325146 AI064690 Hs.171176 

325622 
326213 
32B474 
326816 
326817 
327110 
327196 
327283 
327313 
327450 
328059 
328304 
328492 
328857 
329367 
329373 
329655 
329899 
329960 
330084 

330384 M23263 

330385 AA449749 

330387 H14624 

330388 X03363 

330409 050692 Hs.78221 
330460 TIGR.HT544 

330486 M13755 Hs.833 

330494 M29696 Hs.237868 

330500 M34423 Hs79222 

330510 M75099 Hs.227729 

330513 M81057 Hs.180884 

330541 U22970 Hs.265827 

330542 U23942 Hs.226213 
330547 U32989 Hs.18367l 
330551 U39840 Hs.299867 
330562 U49082 Hs.76460 
330573 U62800 Hs.83393 
330673 D57823 Hs.321403 
330711 AA164687 Hs.177576 
330814 AA015730 Hs.265398 
330850 AA075298 Hs.322710 
330874 AA127474 Hs.191157 
330884 AA133457 Ms.102548 
330912 AA195936 Hs.82719 
330924 AA232136 Hs,159737 
330997 H55762 Hs.9302 
331014 H98597 Hs.30340 
331024 N32919 Hs.27931 
331046 N56553 Hs.191358 
331135 R61398 Hs.4197 
331145 R72427 Hs.129873 
331148 R73816 Hs.17385 
331222 T98531 Hs.173904 
331230 W69807 Hs.16537 
331306 AA252079 Hs.63931 
331327 AA281076 Hs.109221 
331337 AA287662 Hs.50495 
331341 AA303125 Hs.23240 
331344 AA357927 Hs.126550 
331362 AA417956 Hs.40782 
331353 AA421562 Hs.910l1 
331376 AA443802 Hs.41007 
331384 AA456001 Hs.93847 



EST duster (not ih UniGene) 
ESTs 

EST duster (not In UniGene) 
ESTs 

CH.14_hsgi|5867000 

CH.17 hsgl|5867224 

CH.19_hsgi|5867405 

CH.20„hsg(|6552458 

CH.20„hsgi|6552458 

CH.21^hsgi|6117842 

CH.0Lhsgi|5867446 

CH.0Lhsgi|5867478 

CH.0LhsgIl5867501 

CH.02.hsgi|5867766 

CH.06^hsgi|6117819 

CH.07_hsgl|6004478 

CH.07.hsgi|5868455 

CH,07^hsgl|6381927 

CH.)Lhsgi|5868842 

CHX-hsgi|6682537 

CH.144)2gf|6448515 

CH.15_p2gi|6563505 

CH.16j)2gll5091594 

CH.19j)2gi|60153O2 

androgen receptor (dihydrotestosterone re 

ESTs; Highly similar to secreted apoptosi 

ESTs; Highly similar to secreted apoptosi 

HER2 receptor tyrosine kinase (o«rbB-2; 

c-myc binding proton 

Hs.73946 

interferon-stimutated piotetn; 15 kDa 
Interleuktn 7 receptor 
galactosidase; beta 1 
FK506-bjnding protein 2 (13kD) 
carboxypeptidase B1 (tissue) 
multiple UniGene matches 
cytochrome P450; 51 (lanosterol 14-alpha 
tiyptophan 2;3-dioxygenase 
hepatocyte nuclear factor 3; alpha 
transporter protein 
cystatin E/M 

Sec23 (S. cerevisiae) homotog A 
mannosyt (alpha-1;3-)-glycoprotein beta-1 
ESTs; Weak^ similar to transfomnation-r 
ESTs 

ESTs; Weakly similar to HH ALU SUBFA 
ESTs 

general transcriptbn ^r IIA; 1 (37kD a 

Homo sapiens mRNA; cDNA DKFZp434 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs; Weakly similar to CYTOCHROME 

ESTs 

ESTs 

hypothetical protein; similar to (U0e944) 
dachshund (Drosophila) homoiog 
ESTs 
ESTs 

ESTs; Weakly similar to ALU SUBFA 

ESTs 

ESTs 

anterior gradient 2 (Xenepus laevis) homo 
ESTs; Weakly simflar to cDNA EST yk47 
ESTs 
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331478 N26608 Hs.40639 ESTs 7 

331526 N49967 Hs.46624 ESTs 19.8 

331533 N51517 Hs.47282 ESTs 6.5 

331681 W85712 Hs.1 19571 coOagen; type III; alpha 1 piers^anbs 13.8 

5 331686 W88502 Hs.182258 ESTs 9.9 

331750 AA284372 Hs.1l1471 ESTs 5.6 

331751 AA284840 Hs.143818 ESTs 5.8 
331760 AA292721 Hs.154434 ESTs; Weakly similar to unknown [H.sap 7.4 
331763 AA312861 Hs.96704 ESTs 7.8 

10 331825 AA411144 Hs.292882 ESTs 15.2 

331890 AA432166 Hs.3577 succinate dehydrogenase oomple)^ subuni 24.3 

331952 AA454756 Hs.97837 ESTs 5 

332015 AA487910 Hs.208800 ESTs; WeaWyslmBar to ALU CUSS 10.5 

332043 AA490831 Hs.125056 ESTs 11.4 

IS 332060 AA504779 Hs.191402 ESTs 13.6 

332071 AA598594 Hs.205293 ESTs 9.1 

332093 AA608794 Hs.112592 ESTs 8.8 

332139 AA6206e9 Hs.1 12879 EST 9 

332219 N22508 Hs.139315 ESTs 7.1 

20 332225 N33213 Hs.100425 ESTs 12.2 

332246 N57927 Hs.120777 ESTs; Weakly similar to RNAPOLYME 15.6 

332247 N58172 Hs.109370 ESTs 16.9 
332260 N70088 Hs.138467 ESTs 4 
332269 N91279 Hs.109654 ESTs; Moderately similar to outer membr 8.2 

25 332336 T96130 Hs.137551 ESTs 77 

332340 W15495 Hs.1 29781 chmmosome 21 open reading franne 5 14.1 

332347 W60326 Hs.288684 ESTs 4.4 

332362 W93640 Hs.4779 ESTs; Moderately similar to similar to AD 165 

332467 AA489630 Hs.1 19004 KIAA0665 gene pioduct 4.8 

30 332499 M12036 Hs.323910 Human tyrosine kinase-type receptor (HE 10.4 

332513 AA018162 Hs.154424 detodinase; (odothyronfine; type II 5.8 

332526 AA281753 Hs.77515 inositol1;4.5*triphosphate receptor type 19 

332532 N63192 Hs.1892 EST; Highly similar to PHENYLETHAN 15.3 

332565 AA234896 Hs.25272 E1A binding proleoi p300 12.3 

3 5 332607 R41791 Hs.36566 LIM domain kinase 1 11.1 

332640 AA417152 Hs.5101 protein regulator of cytokinesis 1 18.2 

332694 AA262768 Hs.243901 KIAA1 067 protein 15.2 

332702 H93968 Hs.75725 trajisge!in2 4.7 

332705 T59161 Hs76293 thymosin; beta 10 55 

40 332749 AA479968 Hs.88251 aiylsulfatase A 9.8 

332927 CH22 FGENES.38J 177 

332929 CH22IFGENES38J 4J 

332930 CH22.FGENES.38_4 7.4 
332955 CH2?.FGEI^S.4aLl2 5.4 

45 332958 CH2a/GENES.48J5 17.8 

332961 CH2Z.FGENES.48J8 10.6 

332983 CH2^FGENES.54.5 4.3 

333009 CH22^FGENES.61J 5.2 

333010 CH22J^GENES.6L2 8.1 
50 333013 CH22„FGENES.61_5 8.5 

333108 CH22LFGENES.79J4 5.6 

333139 CH22_FGENES.83J6 6.3 

333254 CH2Z,FGENES.118_2 6.8 

333305 CH2i.FGENES.137^2 11.4 

55 333343 CH2a_FGENES.139j2 5.1 

333388 'CH2^,FGENES.144J 127 

333456 CH22_FGENES.157_5 4.2 

333459 CH22J'6ENES.157J 7.6 

333517 CH22_FGENES.173^ 8.2 

60 333585 CH22-FGENES.203J 5 

333679 CH22-FGENES.247J 4.3 

333743 CH22-FGE^€S.264J 13.4 

333758 CH22J^GENES.268J 4 

333767 CH22_F6ENES.27L6 5.6 

65 333768 CH22J'GENES.27L7 12.2 

333769 CH22_FGENES.27U 483 

333795 CH22.FGENES.275J 6.1 
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333798 CH22_reENES.275J 6.8 

333892 CH2^GENES.292J4 4.4 

333904 CH22 FGENES.294^ 6.5 

333905 CH2^FGENES.294J 9.3 
5 333921 CH2^FGENES.296_12 9.6 

333968 CH2aj=GENES.307J 15.9 

334102 CH22-FGENES.327J0 7.1 

334222 CH2^FGENES.360„3 6.7 

334223 CH22_FGENES.360J 33.5 
10 334264 CH2^.FGENES.367J5 185 

334343 CH22.FGENES.375.25 6.1 

334360 CH22J=GENES.378J5 6.1 

334784 CH2?J=GENES.4323 4.8 

334789 CH2a/GENES.432J4 5.1 

15 334794 CH2^FGENES.434^ 7 

334889 CH2^GENES.452^3 12.4 

335004 CH2^FGENES.47a.8 7.9 

335115 CH22_FGENES.496^ 18.8 

335287 CH2a.FGENES.526 J 1 4.5 

20 335342 CH2aj=GENES.536J 5.3 

335491 CH22_FGENES.570^ 24 

335495 CH22„FGENES.570_28 7 

335498 CH2a.FG^ES.57L7 12.2 

335544 CH22^FGENES.576_5 8.4 

25 335610 CH22 FGENES.583_4 12.9 

335653 CH22[FGENES.590jt 6.7 

335682 CH2?.FGENES.595^ 12.1 

335687 CH22J=GENES.596_2 13.9 

335755 CH2^FGENES.604_4 115 

30 335782 CH2aj6ENES.609J 175 

335791 CH2i.F6ENES,611_7 27.3 

335809 CH2^FGENES.617^6 195 

335822 CH2a.FGENES.619_7 19.1 

335823 CH22_FGENES.619_8 4,5 
35 335824 CH2^FGENES.619J1 A02 

335825 CH22^FGENES.519J2 343 

335895 CH2ZJGENES.535„3 10.2 

335917 CH2a-FGENES.636J3 6 

335920 CH22_FG£NES.636J6 8.8 

40 336035 CH22J=GENES.678,6 5.9 

. 336042 CH2?.FGENES.679_4 5.8 

336093 CH2a.FGENES.691_2 11.6 

336096 CH22J^ENES.691j5 7.6 

336150 CH22_F6ENES.706.6 6.3 

45 336152 CH2^FGENES706,9 10.5 

336416 CH22»FGENES.823^38 5 

336444 CH22«FGENES.827J0 4,8 

336449 CH22_FGENES.829J 13,6 

336471 CH22 FGENES.829_30 6.9 

50 336512 CHZLFGENES.834.7 21.4 

336558 CH22.FGENES.842_3 8.2 

336560 CH22_FGENES.B42J 9 

336676 CH2a_FGENES.43-4 9.4 

336959 CH22_FGENES.367-13 19 

55 337968 CH22_EM:AC005500.GENSCAN.103-2 13.4 

338008 CH2a.EM:AC005500.GENSCAN.127-9 15.2 

338057 CH2?.EM:AC005500.GENSCAN.160-1 13.9 

338410 CH22_EM:AC005500.GENSCAN.341-6 8 

338451 CH22 EM:AC005500.GENSCAN.359;3 11.6 

60 338588 CH2£eMAC005500.GENSCAN.432.1 10.3 

338665 CH22_EMAC005500.GENSCAN.464-2 4.8 

338689 CH2a_EM:AC005500.GENSCAN.475-3 6.7 

338832 CH22_DJ246D7.GENSCAN.6-9 4.8 

338980 CH22_DA59H18.GENSCAN,2-4 5.1 

65 339352 CH22_BA35411ZGENSCAN59-7 6.9 

339373 CH2a.BA232E17.GENSCAN.1-29 4,3 
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TABLE 13A 



10 



15 



Table 13 A shows the accession numbers for those pkeys lacking miigenelD's for Table 13. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene duster number 
Aooession: Genbanlc accession numbers 

Pkey CATnuntber Accession 



123619 371681J AA602964 AA609200 
20 103207 30635_4 X72790 
103349 1105^-2 X89059 

1 108S5 19346.14 AA992380 N33063 N21418 H79958 R2191 1 H79957 
113248 328626 J T63857AW971220AA493469 763699 

123169 44573 2 AI950087 N70208 R97040 N36809AI308119AVV967677 N35320AI251473 H59397AW971573 R97278 W01059 
25 " AA908598 AA251875 AI820501 AI820532 W87891 TB5904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 

AI219788 AA884444 N92578 F13493 AA927794 AI560251 AW874068AL1 34043 AW235363 AA663345 AW008282 AA4fl8964 
AA283144 AI890387 A1950344 AI741346 AI689062 AA28291 5 AW102898 AI872193 AI763273 AW1 73586 AW150329 AI653832 
A1762688 AA988777 AA488892 AI356394 AW1 0381 3 AI539642 AA642789 AA856975 AW505512 AI961530 AW629970 
BE612881 AW276997 AW513601 AW512843 AA044209 AW856538 AA1 80009 AA337499 AW961 101 AA251669 AA251874 
3 0 A181 9225 AW205862 AI683338 A1858509 AW276905 AI633006 AA972584 AA908741 AW072629 AW513996 AA293273 

AA969759 N75628 N22388 H84729 H60052 T92487 AI022058 AA780419 AA551005 W80701 AW613456 AI373032 AI564269 
F00531 H83488 W371 81 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 F=04005 
123533 genbanKJ^608751 AA608751 
116480 genbank C14088 C14088 
35 132225 genbankj^128980 AA128980 . 
125154 genbanK_W38419 W38419 
118475 genbanK.N66d45 N66845 

102919 25160 2 M21191 AL035748 AA021266 AA323126 AA180515 AI613029 D28356 NM.000034 M11560 AW401425 AW248248 AA012956 
AA323294 W04965 H38759 AA206622 AA580747 AI541172 AA381075 AA354229 AW402353 AW405575 AW404021 AW406207 

40 AA075752 AW176066 AA287222 AA195818 T20243 R87945 AA295539 AA402533 AA232419 AA224515 AW401583 AA331367 

AW402140 AW249079 R31488 AA075757 X05236 AW239490 AA338036 AW239495 AA357262 AA431005 AA306726 R33804 
AA216544 AW275288 AA227044 AU)38124 AA243300 C03242 AA315615 AL035840 R64336 AA313917 AA018963 AA001385 
AA054395 H30840 AW498825 AA086141 AI557324 AA121576 H39128 R77161 AA019688 AA380987 AA348140 AA348257 
AW176086 AA362432 AA171389 AA362416 AA299938 AA319093 AA337972 C04921 AA345696 R89640 AA085425 AA481708 

45 AA313637 AL039229 H84490 H86153 F00656 AA326668 AA347304 R65890 H41949AA339309 AW402002 AW404854 

AA192582 AA1 12802 H09248 N83165 H38367 AA356339 AA455763 R66853 AA294935 H85911 AA310414 H93436 N87014 
AA001 186 H83640 AA41 1328 AA317929 C04192 AW406288 U46335 AA323179 AA427649 AI366131 H14328 AA197161 
AA379497AA311816AA017206AAD01137AA017420AA012990AW163775AA021397AA295513AA355248AA374921 
AW380419 AA345864 AA3 1 8058 AA371 711 AA363255AA057094 R88057 AA394045 AW362741 AA479579 AW362789 

50 AW362775 AA223624 AW362809 AW362746 AW362753 AW380412 AA057104 AA295271 X06352 AW362771 X12447 R59553 

R87925 AA292864 R83920 F01 120 AA496740 AA852551 AL039209 AA093339 AA333961 N86416 C03325 C03831 AA194750 
AA359270 AA359460 AW245492 AA339400 AA330784 AI908970 N89223 AW361938 AI940063 AA134323 AA371741 F01267 
AW372970 AA341973 AA346098 AW372969 AA337549 AA327342 H93855 AI074079 F291 18 AA852940 F35696 AA345963 
AA079578 AA113785 C02989 AA095945 003145 C05199 AA34eQ24 AA190506 AW361659 A1909d45 AW374374 AW374382 

55 AW374401 AW374373 AW374370 A1909831 AW374367 AA353658 F01Q41 C02843 AA375948 AW374414 AA213946 AI525039 

H13744 R31007 AA112044 AA134404 H47935 AW177018 AA429768 AA336873 AA112875 H46393 AA191267 D59131 
AW406037 AA055244 AA341880AA179024AA308537AW406985AW327311 H30301 AA300705 H43788AW364149 
AA806213 AA481936 C04941 AW375299 M21 190 AA41 0818 AA250940 A1354547 AA317422 AA250903 AI865497 AA890603 
AA366197 AW498538 M78072 AW406461 C03092 F00308 H56488 AA336320 AW406501 AA354102 AA382942 AA096393 

60 AW376830 AA383446 F01259 AW081388 T94035 AA379643 H43842 AI524063 AA621727 AA379099 AA371417 R6681 1 

H44129 N84794 F01135 AA477852 AA293062 AW361595 H27194 A1831650 H43253 H24797 AI564680 AA380090 W20057 
Atg21586 AI192549 AW0gO808 H25967 AA918121 AI626060 H20221 AA812572 H42178 AA887222 HSeOOO C03180 F00946 
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C03986 AI318091 AI860172 AA582179 AI633388 AA557193 R68075 F24105 AW518239 W56622 AI625219 A1925243 
AW468046 AI921828 AA339164 AI144391 AA643334 AA459631 AA873247 AW373432 AA604384 H27600 A1680458 AA1 59956 
AA610836 AA364298 AW373435 AA604225 W73754 AW087924 AA599776 N89227 AI630871 AI633128 AW51 4329 AA010455 
AA563928 AI571596 AI128394 W73707 AI423575 AA583809 AA657988 A1950837 AA169782 AA600009 AI885540 AA771884 
AI978829 AA505408 AA533937 AA481469 AA610869 AA775241 AW273870 AW070909 AI905695 AA4801 15 AA574051 
AI889185 AA773167 AA331375 AA001437 AA1 94324 AA194300 AA558632 AI038538 AA41 1 329 AA781570 AI833176 
AA935520 AW074197 AA583063 AW073099 AW001 198 AA948025 AA587857 AA191540 AI460085 AA193244 AI538037 
AA51 5572 AA758587 AI14931 1 AA50861 0 AA206409 AA534004 AA994600 AA827543 AI916349 AW245129 AW517804 D25663 
AA781985 AA284536 AI819422 H15040 H27531 AA456564 AA845555 AI423596 AA012908 AA889439 AA71631 1 AA958868 
AA320508 AA725731 AAfl34202 AA935997 AA72481 5 AA769353 AA594803 F00827 AI342442 AI003519 AI002503 AI347597 
A1040946 AA197162 AA987883 AA292865 AW001944 AI64071 1 AW244044 AA456784 F30588 AA290829 H24754 AI978683 
AA483688 AA583939 AA121382 AA833831 AA477102 AA977322 AA666379 F35456 AA993537 AI749610 AA226934 AA716204 
AW513025 AA628543 AA583705 F25702 A1368748 A1124097 AI880086 AA477513 AI758834 AI690753 AA477746 F37761 
AA642243 AA159957 AA250844 AA459406 AA427566 F25054 A1569314 AA961665 AI922050 AI759000 AA555236 AA514432 
AA293474 AA001129 AA826789 AA641390AA1 34405 F35585 AA477416 AW1933S9 A)361315 AA284988 F36340 AI361322 
F26g69 AA991922 AA021267 F26973 AI361314 F35691 AI918509 AA250964 AA190992 AA577139 AA865535 AA134324 
AW192842 AI224046 F18975 AA779626 AA856894 AW269997 AW014614 H95554 F31378 AA374868 F26343 AA654007 
AI830942 AA113195 F26432 W56652 AA464690 AA055263 AA340654 AA031448 AA976399 AA972526 AA063476 R83921 
T16240 AA533290 N91545 H44053 AA883451 AA513761 AA086477 H09249 F20482 F26737 AA054148 AA857063 AA017259 
AA179789AA088908 H43704AA194320 F35950 AI880127 F20441 F32878AA962483H39094H 56489 H44621 F19390 
AI302232 F25162 AA826965 AA086052 AA917410 AA454513 R59554 AA196755 AA086369 AA079530 H28106 AA243301 
AI025737 AA101239 AA088887 A1214910 AA974886 F16089 F26054 AA515092 F33436 F32829 M78061 AA235645 F19715 
F37529AI811549AA665180AA708200 F01124 F32382 AA346220 AA627361 F30741 F30010F28543AA211715 F20245 
AA331222 F25634 F21996 W28215 F21911 R65793 AI192566 H20130 H84491 AA719223 AA557435 F16967 F26989 F30353 
AA857159 AA291918 F28234 F20840 F25176 F22437 T27904 AA480355 F19528 R87926 H14286 F27532 AW337864 F2841 1 
H13692 F25651 AA975454 F24229 F29657 F18024 AA464779 F17588 F34954 AA947328 F18063 AA657777 AA459644 N91455 
F27850 F29608 F27206 F18418 F31459 F18564 F33496 F16376 F29740 F29843 F29904 F29866 F19135 AW276602 T40337 
F24835 F34672 F26474 AI926215 AA464185 F18217 N26193 AU)43256 T41197 F33055 F00386 F29500 F34191 F33297 
A1937207 F22724 F15909 F26232 F18889 AA318627 F29085 AA872i04 F17509 F23373 F15660 F17552 F17412 F16863 
F34033 F21515 F17364 F18383 F16546 F17561 F17260 AA292000 F15723 T47438 F16798 F18046 F18319 F17978 F17566 
F34230 F33258 F20860 F17998 AI695701 



129982 22L267 Z14221 AW381862 M97920 AW401444 Z66542 M29470 AW406502 X61011 [\/O4024 AA327072 Z14166 Z14167 Z14165 



AW403806 Z14200 AA383972 Z14205 Z14201 M1851 3 Z14202 AW403684 X14584 AR)62221 U43760 X65892 X65883 X62107 
Z80847 X65B85 X65893 AF062142 X65891 X17675 Z47274 Z47277 Z47276 X65888 Z47275 X62109 AF062140 L01278 
AF062134 AF062139 X81723 Z80840 X81733 X81743 X81744 X81732 Z80843 AW402942 AW403516 X65919 AF062190 
AF0621 77 AF062222 APD62115 Z47240 AF062263 AF062261 AF062223 AF06221 1 Z47238 AW401714 AW404008 AW404991 
AF062280 M99595 Z47214 Z47232 Z47218 M26995 AF062184 X65895 L38433 X81731 Z1 1946 Z47226 AF062205 AF174012 
L01276 AF062168 AF062136 X81755 XB1748 AF174019 AF062285 L06924 L11699 AW402665 L09070 128049 L08090 
AW407843 Z14171 AW402944 L08083 AW405627 L33035 L26907 M17750 L28052 M17751 AJ239360 U19885 L14821 X56526 
AB014341 L12087 L12098 U68231 L12184 AF062242 AF022000 U64499 U00570 AI268604 Y15773 X64239 X62969 U00506 
X73605 X99360 U00577 D83677 AB021539 AF035796 Z33899 U00588 AJ239353 AF174062 Z33901 X98899 AF174058 X63080 
D83676 AB021529 Z18318 U00488 101412 X81746 U21262 U21272 U00560 AF174060 U00547 U00561 Z18321 S73957 
M26435 AF115130 AF115117 Z92896 U21254 Z92895 AF115112 AF062290 T28938 Y09386 AF174067 U27189 Y09384 
U77373 AF174057 M17749 X69692 AF174038 U64478 U64486 AF174098 X99365 AF174077 AF174083 AF174089 AF015133 
AF1 27792 AJ006171 U00510 U58144 AF004323 AF1 15109 AF004324 AF015130 AF090414 AF090418 AB021 536 AB02151 9 
Z96957 AF021986 Z92898 Z96956 L34164 AF062251 AF052524 U00549 U00541 D83687 AF052522 AF087420 AF052520 
AF02201 1 L43085 U00536 L19915 L43083 AF090420 AB021530 AB021534 Z96955 Z96954 Z33903 X62964 AF103282 U00509 
AF062298 AF062289 AF062307 AW408326 L33037 L04337 L04328 X81742 L04336 L03385 AF032360 S56184 AF062191 
AF191092 AI906954 A'F103184 L04343 AW364860 AF001424 AF103163 Z98717 AF103143 AJ008250 ID4323 AF103321 
AF103335 L04342 L03818 L03817 AW404978 AW403690 UB6801 AF103150 AJ010334 AF035027 AJ007327 AF1031 15 
AF017458 AJ008207 AJ008183 AJ008196 AJ008241 AJ008208 AF103210 AF068668 AF068670 AF068671 AF068664 
AF068669 AF068666 AF068665 AF068672 AW371244 AW403670 AW408074 AW404575 AW362153 AW403B03 AW406702 
AW351514 D7B345 T29140 J00231 NNL002179 AW405146 AA301091 X04646 H64660 AW402990 AW406534 T93007 
AI857980 AW368899 AI905833 AW406586 AA482084 AI872299 AA715266 AW404328 AI831674 A1709348 AA6031 12 
AW514864 AA485775 H64492 AW404789 AA487630 AA715498 AA295885 T27613 T981 13 



118600 genbanK_N69222 

118952 genbank_N92966 

120873 genbankJU358015 

113702 genbanleT97307 



N69222 
N92966 
AA358015 
T97307 



115763 genbanlLAA421560 AA421560 

124357 genbanK_N22401 . N22401 

108733 504187J AA121022AA126422 

101544 entre2LM31ie9 M31169 

124447 genbaniai48000 N48000 

124677 genbank__R01073 R01073 

124777 genbaniLR41933 R41933 

119302 genbanK_T25725 T25725 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



135029 

112253 

Pkey 

322175 

323011 

322975 

317210 

323817 

309583 

324961 

303502 

320133 

311935 

321050 

319977 

312772 

321354 

336512 

336558 

336560 

329367 

329373 

336676 

338008 

338057 

329655 

336959 

329899 

329960 

338410 

338451 

338588 

338665 

338689 

308677 

338832 

338980 

333009 

333010 

333013 

308981 

308995 

333108 

333139 

333254 

333305 

333343 

333388 

326213 

333456 

333459 

333517 

333585 

333679 

326474 

333743 

333758 

333767 

333768 

333769 

333795 

333796 

335004 

333892 

335115 

333904 



entrez Z93784Z93784 

H5881d_at H58818 

genbanIeR51818 R51818 

CAT number Accession 

46877 1 AF085975 H53458 H53459 

139750 1 AA580288 AA315655 AA133031 AA377748 

1510563 1 C16391C16413 

211994 1 AW881145AA490718M85637AA304575 T06067AA331991 
233555 1 AA410943AW948953AA334202AA332882 

1046029^-2 AW170035 

376239J AA613792 AW182329 T05304 AW858385 

325188 1 BE174240AA488528AL042253 

447553 J BE151746 BE336853 D63271 T94955 AA774994 

174129 J AA216387 163548 AA228676 

502195 1 C05928AW393497 

345248 1 AA534222AA632632 T81234 

4380 7 AW962299 AA310349 AW962294 H63791 H63751 

116028^-2 AA078493 

CH22^3941FG^834_7_L1NK^DJ 

CH2^3992FG^842JJ.1NI^DJ 

CH22„3994FG_842^5_LINK.DJ 

c_x_hs 

c_x_hs 

CH22_4154FG^43J_ 

CH22 6490FG_LINK.EM:AC00 

CH22_6558FG_UN1CEM:AC00 

c14j)2 

CH2^_4764FG.367_13. 

c15j>2 

c16j)2 

CH2a„7067FG_LINK^EM:AC00 
CH22„7124FG_UNK_EM:AC00 
CH22_7331FG_UNK_EM:AC00 
CH22 7438FG_LINK-EMAC00 
GH22^7464FG_UN1^EM:AC00 
A1761173 

CH22 7678FG_UNICDJ246D7 

CH22l7859FG_LlNK DA59H18 

CH22^233FG_6LLL1NK_EM:A 

CH22_234FG_6U«L(NK.EM-;^ 

CH22^237FG.61_5^L!NK.EM:A 

AI873242 

AI880172 

GH22_336FG_79J4_LINK«EM: 
CH22_358FG_83_16.L!N}eEM: 
CH22.495FG_1 18 J.LINK^EM: 
CH22_550FGJ37^LINK_EM: 
CH22.589FG J 39J2_LINK,EM 
CH22^634FG_144J.UNK.EM: 
c17Jis 

CH22_706FGJ57_5_UNK^BV): 
CH22_709FG^157J_UNHLEM: 
CH22„773FG^173_2^LINK^EM: 
CH22.846FGJ03J^INICEM: 
CH22_941 FG_247 J^LINICEM: 
c19 hs 

CH22.1009FGJ64_1_L1NK^EM 
CH22_1024FG_268_LL1NK^EM 
CH22^1034FGJ71_6 LINK^EM 
CH22J035FG_271 7 LtNK^EM 
CH22 1036FG_271_8_L1NK„EM 
CH22J063FGJ75JJ.INK.EM 
CH22J065FGJ75_3„LINK_EM 
CH22_2326FGJ72J_UNK^EM 
CH2?_1 163FG_292_14 J^INK^E 
CH22_2447FG„496^LINieEM 
CH2a«1 176FG.294 J_LINK_EM 
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333905 CH22J177FGJ94XLINK.EM 
333921 CH22_1194FGJ96.1UINieE 
333968 CH2^1245FG_307jLUNICEM 
328059 c 6 hs 
5 335287 CH^_2629FG,526J1_L1NK_E 

326816 c20_hs 

326817 c20 hs 

335342 CH22J689FGJ36J_LINK^EM 
335491 CH22 2843FGJ70J3_UNK E 
10 335495 CH22I2847FG_570J8_LINKIe 
335498 CH22_2850FGJ71J.LINK-EM 
328304 cj hs 
305453 AATO110 

335544 CH22J899FG_576J_LINfeEM 
15 335610 CH22_29e9FGJ83J_LINK.EM 
335653 CH2Z.3013FG_590.4_LINK.EM 
335682 CH22.3043FG_595.2.LINK-EM 
335687 CH22.3048FG_596J_LINK.EM 
328492 c_7_hs 
20 335755 CH22_3122=G_604_4_LINK^EM 
335782 CH22.3151FG_609_4_LINK^EM 
335791 CH22 3160FG 611 7 LINK.EM 
335809 CH22J181FGJ17J^LINK_EM 
335822 CH2L3195FG 619 7_L1NK EM 
25 335823 CH22.3196FGJ19J^LINK[eM 

335824 CH22_3197FG_619„11_LINK^E 

335825 CH22L3198FG_619_12.LINK-E 
335895 CH22_3272FG_635_3_LlNf<LEM 
335917 CH22_3294FG_636J3_LINK.E 

30 335920 CH22L.3297FG.636.16.L1NK-E 

305898 AA872838 

305913 AA87610g 

305950 AA884479 

328857 c_7Jis 
35 330084 c19_p2 

337958 CH22_6419FG_L1NK.EMAC00 

309177 AI951118 

309198 AI955915 

309226 AI969897 
40 339352 CH22_8317FG_L1NK^BA354I1 

309279 AI990102 

339373 CH22 834aFG_LINK^BA232E1 
325622 c14Js 

334102 CH22.1380FG_327.60_LINK^ 
45 332927 CH22.148FG_38J_L1NK_C20H 

332929 CH22 150FGJ8_3_LINK_C20H 

332930 CH22ll51FG 38_4_LlNieC20H 
3^955 CH22 179FGJ8_12_LINK»EM: 
332958 CH22~182FGJ8J5^L1NK.EM: 

50 332961 CH22 185FG 48J8_L1NK.EM: 
332983 CH22_207FGJ4J_LINK-EM:A 

334222 CH22_1506FG_360XL(NK_EM 

334223 CH2Z.1507FG_360_4_LINK_EM 
334264 CH22_1551FG_367_15_L1NK.E 

55 327110 c21Jis 

334343 CH22_1636FG_375J5.LINK„E 

334360 CH22_1654FG_378XLINKLEM 

327196 c 1 hs 

327283 c 1 hs 
60 327313 c.1_hs 

304465 AA421948 

304507 AA456426 

327450 c_2^hs 

304591 AA505702 
65 304601 AA507875 

304659 AA533185 

334784 CH2?.2096FG_432_9_LINieEM 
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334789 CH22J101F6_432.1U1NK.E 
334794 CH22J106FG 434J_LINieEM 
336035 CH22 3420FG_678XL1NK.DJ 
336042 CH22„3427FG 679AUNKJ)J 
5 336093 CH22_3481FGJ91J_LINHLDJ 
336096 CH22 3484FG_691_5J.INK_DJ 
334889 CH22J206FGJ52XLINK^M 
336150 CH22J540FG 706 6_LINK_DA 
336152 CH22 3543FGJ05 9 UNK^DA 
10 336416 CH22_3833FGJ23J8_LINK^ 
336444 CH22^3864FG327J0_LINK^D 
336449 GH22.3870FGJ29_6_L1NKJ>J 
336471 CH22^3894FG 829_30J.INIO> 
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TABLE 13B 



Table 13B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 13B. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al' refers to the pubDcatton 

entitled The DNA sequence of human chromosonfie 22' Dunham I. et al., Nature (1999) 402:489-495. 
Strand: indicates DNA strand from which exons were predicted. 

Ntj)06it]on: Indicates nucleofide positions of predicted exons. 



Pkey Ref 




Strand 


Ntj)os1tion 


332955 Dunham, 


1. et.al. 


Plus 


2508896-2508992 


332958 Dunham, 


. etal. 


Plus 


2515164-2516310 


332961 Dunham, 


. elal. 


Plus 


2521424-2521555 


333139 Dunham, 


. etal. 


Plus 


3369495-3369571 


333254 Dunham, 


:. etal 


Plus 


2521424-2521555 


333305 Dunham, 


. etal. 


Plus 


4630388-4630645 


333388 Dunham, 


. etal. 


Pius 


4913749-4913805 


333517 Dunham, 


. etal. 


Plus 


5570729-5570925 


333585 Dunham, 


. eta!. 


Pius 


6234778-6234894 


333679 Dunham, 


. etal. 


Plus 


7068795-7068896 


333767 Dunham. 


.etal. 


Plus 


7694407-7694623 


333768 Dunham, 


. etal 


Plus 


7695440-7695697 


333769 Dunham, 


. etal. 


Plus 


7696625-7696707 


333795 Dunham, 


. etal. 


Plus 


7807688-7807795 


333796 Dunham, 


.etal. 


Pius 


7808253-7808319 


333892 Dunham. 


. etal. 


Plus 


8156825^157001 


333921 Dunham, 


.etal. 


Plus 


8380325^80441 


333968 Dunham, 


.etal. 


Pius 


8681004-8681241 


334102 Dunham. 


. et.aL 


Plus 


9995140-9996373 


334264 Dunham, 


. etal. 


Plus 


13234447-13234544 


334343 Dunham, 


.etat 


Plus 


13655828-13656307 


334794 Dunham. 


. etal. 


Plus 


16374312-16374458 


334889 Dunham. 


.etat 


Plus 


19286024-19286515 


335287 Dunham, 


. etal. 


Plus 


22299047-22299299 


335491 Dunham, 


.et.at 


Plus 


24128651-24128827 


335495 Dunham. 


. etat 


PhJs 


24140688-24140872 


335498 Dunham, 


.etal. 


Plus 


24172082-24172161 


335653 Dunham. 


. etal. 


Plus 


25329710-25329802 


335687 Dunham, 


.etal. 


Plus 


25445952-25446064 


335809 Dunham, 


. etal. 


Pius 


26310772-26310909 


335822 Dunham. 1 


. etal. 


Pius 


26364087-26364196 


335823 Dunham, 


. et.a}. 


Plus 


26365925-26366004 


335824 Dunham,! 


.etal. 


Plus 


26376860-26376942 


335825 Dunham, 1 


, etal. 


Plus 


26378175-26378268 


336035 Dunham,! 


. etal. 


Plus 


29016748-29017410 


336093 Dunham. 1 


. etal. 


Plus 


29555922-29557002 


336096 Dunham,! 


. etal. 


Plus 


29578878-29579047 


336444 Dunham.! 


. et.al 


Plus 


3419058&-34190718 


336959 Dunham,! 


. etal. 


Plus 


13233040-13233126 


338008 Dunham. 1 


. etal. 


Plus 


7697068-7697236 


338057 Dunham. 1 


.etat 


Plus 


8526397-8526522 


338410 Dunham. 1 


.etal. 


Phis 


19292807-19292916 


338588 Dunham, I 


. etal. 


Plus 


22896767-22896920 


338665 Dunham,! 


.et.al. 


Plus 


24472654-24472853 


338832 Dunham.! 


etal 


Plus 


27775128-27775290 


338980 Dunham,! 


. etat 


Plus 


29896789-29896874 


339352 Dunham. 1 


.et.at 


Plus 


33544784-33545121 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



332929 
332930 
332983 
333009 
333010 
333013 
333108 
333343 
333456 
333459 
333743 
333758 
333904 
333905 
334222 
334223 
334360 
334784 
334789 
335004 
335115 
335342 
335544 
335610 
335682 
335755 
335782 
335791 



335917 
335920 
336042 
336150 
336152 
336416 
336449 
336471 
336512 
336558 
336560 
336676 
337968 
338451 



339373 
325622 
329655 
329899 
329960 
326213 
326474 
330084. 
326815 
326817 
327110 
327196 
327283 
327313 
327450 
328059 
328492 
328304 
328857 
329367 
329373 



Dunham. I. elal. 
Dunham, I. et.al. 
Dunham, I. eta). 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, i. ela). 
Dunham, 1. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham. 1. etat 
Dunham, 1. etal. 
Dunham, L etal 
Dunham, I. etal. 
Dunham, I. etal 
Dunham, tetal. 
Dunham, I. ^.al. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etat 
Dunham, I etat 
Dunham. I. etal. 
Dunham. I. etal. 
Dunham, I. etal. 
Dunham. 1. etal. 
Dunham, 1. etal. 
Dunham, 1. etal. 
Dunham. I. etal. 
Dunham, I. et.at 
Dunham. I. etal 
Dunham, I. et.al. 
Dunham, I. et.al. 
Dunham, t etat 
Dunham, 1. et.al. 
Dunham, 1. et.al. 
Dunham, 1. etat 
Dunham, t etal. 
Dunham, 1. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etat 
Dunham, I. etal. 
Dunham, I. etal. 
5867000 
6448516 
6563505 
5091594 
5867224 
5867405 
6015302 
6552458 
6552458 
6117842 
5867446 
5867478 
5867501 
5867766 
6117819 
5868455 
6004478 
6381927 
5868842 
6682537 



Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Plus 

Minus 

Minus 

Minus 

Minus 

Plus 

Minus 

Plus 

Plus 

Plus 

Plus 

Minus 

Minus 

Minus 

Plus 

Minus 

Minus 

Minus 

Minus 

Minus 



2020758-2020664 

2022565-2022497 

2631933-2631797 

2766043-2765856 

2766207-2766119 

2772278-2772039 

3240494-3240389 

46928864692753 

2631933-2531797 

5144548-5144344 

7573218-7573060 

7666413-7666091 

8217374-8217261 

8217796-8217670 

12732417-12732289 

12734365-12734269 

13728850-13728751 

16294548-16294360 

16306095-16305996 

20581911-20581794 

21388250-21388146 

22597448-22597284 

24650505-24650403 

25068943-25068841 

25421215-25421093 

25763806-25753747 

25908578-25908440 

2594856^25948411 

26975307-26975239 

27028481-27028377 

27034927-27034811 

29041694-29041500 

30150423-30150256 

30156053<30155870 

34047408-34047311 

34204707-34204577 

34215091-34214978 

34278373-34278275 

34375825-34375698 

34376814-34376596 

2022565-2022497 

7095797-7095580 

20174286-20174193 

24893073-24892972 

33860127-33860047 

69994-70075 

35555-35843 

111058-111783 

1031-1162 

60751-60927 

16995-18101 

57019-59337 

198354-198436 

199909-200001 

94608-94785 

180921-181333 

567-962 

89734-89838 

4792848076 

37052-37204 

4609446241 

3884^952 

80557-81051 

87201-67587 

38950-39301 
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TABLE 14: Table 2 from BRCA 001-5 US 

Table 14, a subset of table 13, depicts a preferred group of genes highly upregulated in breast 
cancer cells. 



Pkey: 
ExAccn: 
UnlgenelO: 
Unlgene Title: 
Rl; 



Unique Bos probeset identifier number 

Exemplar Accession number, Genbank accession number 

Unigene number 



Ratio of tumor to norma) breast tissue 



Pkey BcAccn UnlGenelD Unigene rrtle 

100038 M97935 AFFX control: STAT1 

100114 D00596 Hs.82962 ttiymldylate synthetase 

100975 J02923 Hs76506 ^mphocyte cytosofic protein 1 (L-plastin) 

101031 J05070 Hs.151738 matrix metaltoprotelnase 9 (gelatinase B; 9 

101104 L07615 Hs.169266 Human neuropeptide Y receptor Y1 (NPYY 

101143 L12723 Hs.90093 heat shock 70kD protein 4 

101332 L47276 Hs.1 56346 Homo sapiens (cell line HL-6) alpha topols 

101378 M13755 Hs,833 interferon^stlmulated protein; 15 kOa 

101609 M86849 Hs.323733 Homo sapiens oonnextn 26 (GJB2) mRNA 

102618 U65932 Hs.81071 extracellular matrix protein 1 

102721 U79241 Hs.118666 Human clone 23759 mRNA; partial cds 

102817 U90904 Hs.83724 Human clohe 23773 mRNA sequence 

102907 X06985 Hs.202833 heme oxygenase (decycBng) 1 

102985 X17644 Hs.2707 G1 to S phase tfBnsition 1 

103060 X57766 Hs. 155324 matrix metalloproteinase 1 1 (stromelysin 3 

103180 X69433 Hs.5337 isodtrate dehydrogenase 2 (NADP+); mito 

103206 X72755 Hs77367 monokine induced by gamma Interferon 

103821 AA157623 Hs.198793 KIAA0750 gene product 

104115 AA428090 Hs.25102 ESTs 

104667 AA007234 Hs.30098 ESTs 

105186 AA191512 Hs.28005 Homo sapiens mRNA; cDNADKi^p564G 

106103 AA421104 Hs.12094 ESTs 

107151 AA621169 Hs.8687 ESTs 

109415 AA227219 Hs.1 10826 trinucleotide repeat containing 9 

110189 H20543 Hs.6278 DKFZP586B1621 protein 

110561 H59617 Hs.5199 ESTs; Weakly similar to UBIQUrriN-CON 

110734 H98714 Hs.24131 ESTs 

110915 N46252 Hs.29724 ESTs 

111179 N67239 Hs.10760 ESTs 

111357 N91023 Hs.87128 ESTs 

112134 R46025 Hs74l3 ESTs 

113970 W86748 Hs.8109 ESTs 

114124 Z38595 Hs.125019 ESTs; Highly similar to KIAA0886 protein 

114292 Z40715 Hs.184641 delta-6 fatty acW desaturase 

114901 AA236276 Hs.196437 ESTs; Weakly similar to R26660 1; partial 

114965 AA250737 Hs72472 ESTs 

115652 AA405098 Hs.38178 ESTs 

115875 AA433943 Hs.43946 ESTs; Weakly similar to Weak sirrylarity I 

116790 H29532 Hs.101174 microtubule^assodated protein tau 

116921 H72948 Hs.821 blglycan 

117412 N26722 Hs.42645 ESTs 

120241 Z41815 Hs.65946 ESTs 

120325 AA195651 Hs.104106 ESTs 

121596 AA416740 Hs.174104 ESTs 

123619 AA609200 ESTs 

124006 D60302 Hs.270016 ESTs 

125852 H09290 Hs.76550 Homo sapiens mRNA; cONA DKFZp564B 

126160 N90960 Hs.26539a ESTs; Weakly similar to translbnnatk)rHel 

252 



R1 

167 

15.9 

30.1 

37.2 

18.3 

17:4 

18.9 

18.1 

22.5 

23.2 

15 

15.2 

22.7 

20.6 

17.8 

18.9 

15.1. 

23.3 

28.7 

16.6 

19.3 

15,4 

19 

20.1 

16.6 

19.5 

30.2 

23.2 

37 

15 

17.4 

15 

22 

19.4 

16.9 

35.1 

16.1 

33.5 

22.2 

20.7 

18.1 

15.6 

15.2 

22.6 

23.1 

20.6 

25.9 

16.4 
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127677 AA916752 Hs.264190 ESTs; Highly similar to MEM3 [M.muscul 17^ 

126595 U31675 Hs.152677 shortKiiain alcohol dehydrogenase farnOy m 27.1 

126717 T30617 Hs.104222 Homo sapiens iTiRNA;cDNADKFZp566L 24.5 

129124 AA234530 Hs.108802 N^thylmaleimide^ensitltfe factor 20.7 

5 129366 H18027 Hs.184697 plexinCI 18.2 

130455 X17059 Hs.155956 N-acetyltransferase 1 (aiylamine N-acetyll 26.4 

130604 X03635 Hs.1657 estrogen receptor 1 39.9 

130913 W03592 Hs.21198 translocase of outer mitochondrial membra 20.9 

130944 M97935 Hs.21486 signal transducer and activator of trariscript 18.8 

10 131472 AA608952 Hs.27258 calcycfin binding protein 18.1 

131562 U90551 Hs.28777 H2A histone family; member L 18.8 

132180 AA405569 Hs.418 fibroblast activation protein; alpha; seprase 15.4 

132406 F09979 Hs.4774 ESTs 15 

132465 AA047895 Hs.49169 ESTs 15.4 

IS 132994 AA505133 Hs.279905 solute camer family 2 (fadlitated glucose t 26.4 

133294 R79723 Hs.69997 zincfinger protein 238 30,4 

133634 U24166 Hs.234279 mlcrotubule^sociated protein; RP/EB fem 15.2 

134374 D62633 Hs.8236 ESTs 15.2 

134405 J04177 Hs.82772 collagen; type XI; alpha 1 15.3 

20 134470 X54942 Hs.83758 CDC28 protein kinase 2 20.3 

134495 063477 Hs.84087 KIAA0143 protein 16.1 

134714 U89922 Hs.890 lymphotoxin beta (INF superfamily; memb 35.7 

135237 AA454930 Hs.9691 ESTs ^ 193 

301884 AA312082 Hs.105445 GDNF family receptor alpha 1 207 

25 302276 NM 004448Hs.323910 EST duster (not in UnlGene) with exon hit 21 .6 

302290 AL117607 Hs.175563 Homo sapiens mRNA; cDNA DKF2p564N 41.4 

309177 AI951118 EST singleton (not in UniGene) with exon 24.3 

309583 AW170035 EST 64.5 

310438 AW022192 Hs.200197 ESTs 39.1 

30 311166 AI821294 Hs.1 18599 ESTs 24.1 

312153 AA759250 Hs.153028 cytochrome b-561 27.1 

313915 AI969390 Hs.163443 ESTs 27.1 

314506 AA833655 Hs.206868 ESTs 27.8 

314558 AI873274 Hs.190721 ESTs 725 

35 314691 AW207206 Hs.136319 ESTs 21,4 

314943 A1476797 Hs.184572 cell division cycle 2; G1 to S and G2 to M 18.4 

315196 AA972756 Hs.44898 ESTs 28.8 

316177 Ai908272 Hs.293102 EST cluster (not in UniGene) 32.6 

318073 AW167087 Hs.131562 ESTs 157 

40 318662 AI285898 Hs.294014 ESTs 16.3 

318740 NM_.002543Hs77729 EST cluster (not In UniGene) 21.3 

318744 AI793124 Hs.144479 ESTs 35 

319668 NM 002731HS.87773 EST cluster (not in UniGene) 25.4 

320074 AA321166 Hs.278233 EST cluster (not in UniGene) 167 

45 320211 AL039402 Hs.125783 DEME-6 protein 24.3 . 

320727 U96044 Hs.181125 EST cluster (not In UniGene) 15.3 

322818 AW043782 Hs.293616 ESTs 21 

322882 AW248508 Hs.279727 DiGeorge syndrome critical region gene 2 153 

324261 AL044891 Hs.269350 EST cluster (not In UniGene) 50.1 

50 324432 AA464510 Hs.152812 EST cluster (not In UniGene) 167 

324603 AW016378 Hs.292934 ESTs . 23.1 

324620 AA448021 Hs.94109 EST duster (not In UniGene) 2U 

324988 T06997 Hs.121028 EST duster (not in UniGene) 24.5 

330388 X03363 HER2 receptor tyrosine kinase (c^rbB-2;E 17.7 

55 330486 M13755 Hs.833 interferon-stimulated protein; 15 kDa 67 

330814 AA015730 Hs.265398 ESTs; Weakly similar to transfomiation-iel 44.1 

331145 R72427 Hs.129873 ESTs; Weakly similar to CYTOCHROME 41.9 

331306 AA252079 Hs.63931 dadishund (Drosophila) homolog 15.1 

331890 AA432166 Hs.3577 sucdnate dehydrogenase complex; subunit 24,3 

60 332526 AA281753 Hs.77515 inositol 1:4;5-triphosphate receptor; type 3 19 

332532 N63192 Hs.1892 EST; Highly similar to PHENYLETHANO 153 

332694 AA262768 Hs.243901 K1AA1067 protein 15.2 

332958 CH22,FGENES.48_15 17.8 

333769 CH2a_FGENES.271J 48.3 

65 333968 CH22.FGENES.307_4 155 

334223 CH22J=GENES.360_4 335 

334264 CH2^FGENES367J5 185 
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335791 CH22.FGENES.611J 27^ 

336512 CH22_FGENES.834_7 21.4 

338008 CH22«EIVfcAC005500.GENSCAN.127-9 15.2 
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TABLE 14A 



Table 14A shows the accession numbers for those pkeys lacking imigenelD's for Table 14. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

10 

Pkey: Unique Eos probeset identifier number 

CAT number: Gene cluster number 
Acoession: Genbank aooessbn numbers 

15 

Pkey CAT number Accession 

309583 1046029 -2 AW170035 
20 336512 CH2^3941FG,834.7„LiNK_DJ 

338008 CH22_6490FG_LINK-EMAC00 

333769 CH22_1036FG_271J_LINK.EM 

333968 CH22.1245FG_307J_LiNK-EM 

335791 CH22_3160FGJ11.7_L1NK_EM 
25 309177 AI951118 

332958 CH22^182FG 48J5 LINK^EM: 

334223 CH22_1507FG^360 4_LINK_EM 

334264 CH22_1551FG_367_15^LINK-E 

123619 371681 1 AA602964 AA609200 
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TABLE 14B 

5 Table 14B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 14. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

1 0 Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this oolumn are Genbank Identifier (61) numbers. "Dunham i. et al." refers to the publication 

entitled The DMA sequence of human chromosome 22." Dunham ). et al, Nature (1999) 402:489^95. 
strand: Indicates DNA strand from which exons were predicted. 

Ntj)ositton: Indicates nucleotide positions of predicted exons. 



15 



Pkey Ref Strand NLposition 



332958 Dunham, I. et.aK Plus 2516164-2516310 

20 333769 Dunham. I. et.al. Plus 7696625-7696707 

333968 Dunham, 1. et.al. Plus 8681004-8681241 

334264 Dunham, I. etal. Plus 13234447-13234544 

338008 Dunham. 1. etal. Plus 7697088-7697236 

334223 Dunham. I. etal. Minus 12734365-12734269 

25 335791 Dunham, 1. etal. Minus 25948563-25948411 

336512 Dunham, 1. etal. Minus 34278373-34278275 
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TABLE 15: Table 3 from BRCA 001-5 US 
Table 15 shows genes downregulated in breast cancer cells. 



5 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number. Genbank accession number 

UnlgenelD: Unigene number 

10 Unigene Title: Unigene gene titie 

R1 : Ratio of nomtal breast tissue to tumor 



Pkey ExAccn UniGene ID Unigene Title R1 

100115 D00632 Hs.172153 glutathione peroxidase 3 (plasma) 17 

100499 TIGRiHT1428 Hs.283108 Globin.Beta 1.5 

100502 TIGRim496 Hs.169228 Adrenal-Specific Protein Pg2 2.3 

100815 TIGR:HT4268 Hs.9739 L-GlycefDl-3-Phospliate:Nad+Qxidoreduct 1.7 

20 101125 L10373 Hs.82749 transmembrane 4 superfamiiy member 2 1.5 

101367 M12963 Hs.4 ateohol dehydrogenase 1 (dass 1); dpha po 2.9 

101397 msm Hs.180878 lipoprotein lipase 1.6 

101883 M98399 Hs75613 CD36 antigen (collagen type 1 receptor; thr 1.6 

102227 U25138 Hs.93841 potassium large conductance calcium-actltf 1.6 

25 102857 X00129 Hs76461 retinol-binding protein 4; Interstitial 3 

103211 X73079 Hs.288579 polymeric immunoglobulin receptor 1.8 

103496 Y09267 Hs.132821 flavin containing monooxygenase 2 1.5 

103562 Z21966 Hs.2815 POU domain: dass 6; transcription factor 1 1.8 

104672 AA007629 glycerol-3-phosptiate dehydrogenase 1 (sol 2.4 

30 105083 AA146619 Hs,18791 ESTs; Weakly similar to CALCIUM^IND 17 

105138 AA164519 Hs.15248 ESTs 1.5 

106075 AA417915 Hs.25930 ESTs 1.5 

106670 AA487576 Hs.26a30 semm deprivation response (phosphatidyls 1.6 

107099 AA609645 Hs.211568 euka/yotJCtnanslatk)ninltia&)n factor 4 gam 2.7 

35 107616 AA004901 Ms.261164 ESTs 1.6 

107997 AA037388 Hs.82223 Human DNA sequence from done 141H5 o 1.7 

108604 AA099820 Hs.49696 ESTs 2.4 

111130 N64265 Hs.19515 yz44h12.s1 Morton Fetal Cochlea Homo sa 1.7 

111837 R36447 Hs,24453 ESTs 1.6 

40 112538 R70255 ESTs 1.9 

112808 R97970 Hs.281022 EST 1.5 

113086 T40652 Hs.209100 DKFZP434G171 protein 1.9 

115740 AA418033 Hs.283559 ESTs 1.6 

115949 AA443800 Hs.43125 ESTs 2 

45 115965 AA446661 Hs.173233 ESTs 2.2 

117224 N20300 Hs.218707 ESTs 17 

117513 N32174 Hs.44317 SRY (sex-deten7iining region Y)-box 10 17 

119059 R15436 Hs.77889 Friedreich ataxia region gene XI 23 17 

119175 R71792 Hs.301002 ESTs; WeaidysimDar to cell death activato 2.8 

50 1 19359 T71021 . Hs.285681 ESTs; Highly similar to WS baslo^-loo 1.9 

119798 W73386 Hs.249129 ESTs 3 

120889 AA365784 Hs.97044 ESTs 1.6 

121381 AA405747 Hs.97984 ESTs; Weakly similar to WASP-famlly pro 1.8 

121750 AA421184 Hs.97549 ESTs 1.5 

55 122127 AA434447 Hs.106771 ESTs 2.5 

122348 AA443695 Hs.293410 ESTs 2.1 

122485 AA448300 Hs.150318 phospholemman 1.5 

123443 AA598841 Hs.167382 natriuretic peptide receptor A/guanylatecy 1.8 

123505 AA600135 ESTs; Moderately simHar to !!!! ALU SUB 1.5 

60 125284 W94688 Hs.103253 perilipln 17 

126300 D81972 HUI\M27D08B Human fetal brain (TFufiw 1.8 

126747 R72515 Hs.160318 phospholemman 1.6 

127218 AA309765 Hs.1 16017 ESTs; Weakly simBar to KIAA0795protei 1.5 

127357 AA452788 Hs.75432 zx39g1U1 Soares totaLfetus.Nb2HF8_g 1.7 
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127638 AA634405 Hs.122608 ESTs 1.5 

128213 AA972780 Hs. 1291 94 ESTs; Weakly similar to ALU SUBFA 1.5 

128351 AI092391 Hs.134886 ESTs U 

128842 N44757 Hs^0340 ESTs 1.6 

5 128870 R71403 Hs.75309 eukaryottctrarislatfon elongation factor 2 17 

129146 AA459944 Hs.108924 DKFZP586P1 422 protein 1.5 

129285 T62068 Hs.11006 ESTs 2.1 

129331 N93465 Hs.279772 ESTs; Highly similar to CGW8 protein (H 1.5 

130085 M62402 Hs.274313 insulin-like growth factor binding protein 6 1.7 

10 130400 M25079 Hs.2831Q8 hemoglobin; beta 17 

131267 AA211776 Hs.2504 myomesln 1 (skeIemln)(185kD) 3.8 

131277 AA131466 Hs.23767 ESTs 1.9 

131282 M12272 Hs.4 aloohol dehydrogenase 3 (class Q; gamma p 2.2 

131304 AA295848 HsJ25475 aquapoi1n7 17 

IS 131810 D49487 Hs.194236 leptin (murine obesity homolog) 2.5 

132788 AA045503 Hs.56874 ESTs; Weakfy similar to Honrto sapiens p2 1.6 

132931 Z41452 Hs.6090 deleted in bladder cancer chromosome regl 1.5 

133120 X64559 Hs.65424 tetranectin (plasminogen-blnding protein) 2 

133314 U95367 Hs70725 gamma-aminobutync add (GABA) A recep 1.5 

20 133507 X74295 Hs.74369 Bitegrin; alpha 7 17 

133601 S95936 Hs.284176 Iransfemn 2.3 

133702 N56898 Hs.75652 glutathione S-transferase M5 1.9 

134111 N79674 Hs.8022 TU3A protein 4.6 

134699 U56814 Hs.88646 deoxyribonuclease l-IIke 3 1.5 

25 134749 L10955 Hs.894a5 carbonte anhydrase IV 1.6 

135173 M72885 Hs.95910 Human G0S2 protein gene; complete ods 1.9 

300132 AW027556 Hs.156286 ESTs 1.7 

300732 AI369956 Hs.257891 ESTs 1.5 

300750 AA514805 Hs.293055 ESTs 1.8 

30 301140 AI807692 H5.129129 ESTs 1.6 

301396 AA923549 Hs.224121 ESTs 2.1 

302910 N77976 Hs.251577 hemoglobin; alpha 1 1.8 

303798 V00505 Hs.36977 hemoglobin; delta 1.6 

303831 T04868 Hs.46780 EST duster (not in UniGene) with exon hit 1.7 

35 303844 U94362 Hs.58589 glyoogenin2 1.5 

304182 H91086 EST singleton (not In UniGene) with exon 1.5 

304622 AA516384 EST singleton (not in UniGene) with exon 1.5 

304682 AA550994 EST singleton (not in UniGene) with exon 17 

305612 AA782347 Hs.272572 EST singleton (not in UniGene) with exon 1.5 

40 306193 AA923457 EST singleton (not in UniGene) with exon 1.5 

307206 AI192534 EST singleton (not in UniGene) with exon 1,6 

307377 A1222691 EST singleton (not In UniGene) with exon 1.5 

308023 AM52732 Hs.251577 EST singleton (not in UniGene) with exon 1.9 

308359 AI612774 Hs.79372 retinoid X recepton beta 1.5 

45 309838 AW296073 Hs.255504 EST 1.5 

310403 AI720978 Hs.148006 ESTs; Moderately similar to alternatively s 1.8 

311671 AW241947 Hs.232478 ESTs 1.6 

311794 AW238092 Hs.254759 ESTs 2.1 

312082 T79860 Hs.118l80 ESTs 1.9 

50 312575 H25237 Hs.306814 ESTs 2.3 

313076 N49684 Hs.14304O ESTs 1.8 

313283 W32480 Hal57099 ESTs 2.2 

313374 AW328672 Hs.l32750 ESTs 1.9 

314701 A1754634 Hs.131987 ESTs 17 

5 5 315391 AA759098 Hs.192007 ESTs 1.8 

315688 AA680055 Hs.264885 ESTs 1.5 

316249 AA948612 Hs.130414 ESTs 16 

316586 AI205077 Hs.294085 ESTs 17 

316890 AA837079 Hs.24647 ESTs 15 

60 316983 AI480204 Hs.177131 ESTs 15 

317604 AI650625 Hs.300756 ESTs 16 

317951 AW205520 Hs.129621 ESTs 15 

319400 W26902 Hs.154085 ESTs 17 

320757 H22654 Hs.6382 EST duster (not in UniGene) 15 

65 321594 AA021402 Hs.11067 ESTs 17 

322102 H45589 EST duster (not in UniGene) 15 

322814 AI824495 Hs.211038 ESTs 2.2 
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322929 


AI365585 


Hs.146246 ESTs 


2.3 


323831 


AA335715 


Hs.200299 ESTs 


1.7 


324044 


AL045752 


Hs.22350 ESTs 


18 


324675 


AW014734 


Hs,1 57959 ESTs 


22 


325272 




CH.11_hsgi|5866902 


1.5 


325558 




CH.12^hsgil6056302 


1.6 


325656 




CH.14_hsgil6056305 


1.6 


326120 




CK17_hsgj|5867194 


1.5 


326139 




CH.17 hsgi|5867203 


1.5 


326855 




CH20_hsgl|6552460 


1.5 


327438 




CH.02_hsgi|6004454 


1.6 


329733 




CH.14ji2gi|6065783 


1.6 


330931 


F01443 


Ms^256 ESTs 


4.6 


331591 


N71677 


Hs.42146 ESTs 


1.9 


332159 


AA621393 


Hs.112984 EST 


1.5 


332364 


W94688 


Hs.103253 perillpin 


2.1 


332502 


H21819 


Hs.14896 Homo sapiens done 24590 mRNA sequenc 


1.5 


334175 




CH22„FGENES.349J0 


1.5 


334347 




CH2^FGENES.375_31 


1.8 


334737 




CH22_FGENES.424_12 


1.8 


335352 




CH2:LFGENES.539_5 


'1.5 


335639 




CH22 f=GENES.584_19 


1.6 


336244 




CH22_FGENES.746J 


1.5 


336336 




CH2a_FGENES.814_8 


1.7 


336855 




CH22^FGENES.305-1 


1.6 


337494 




CH2^FGENES.799-12 


1.6 


337764 




CH22_EM:AC000097.GENSCAN.119-1 


1.8 


337983 




CH22.EM:AC005500.GENSCAN.1 10-1 


2 


338192 




CH2^EKftAC005500.GENSCAN^28-1 


1.5 


339366 




CH22JBA354112.GENSCAN^4-2 


1.5 
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TABLE ISA 



Table ISA shows the accession numbers for those pkeys lacking unigenelD's for Table 15. 
5 For each probeset, we have listed the gene cluster number from which the ohgonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

10 

Pkey: Unique Eos probeset identifier numbe r 

CAT number Gene cluster number 
Accession: Genbani< accession numbers 



Pkey CAT number Accession 

126300 250375.2 081972 68)03132 

20 112538 504579.1 AA908813 R70255 

123505 genbanI(.AA600135 AA600135 - 

104672 6735 7 AA349096 A1368018 1=21390 F17759 R48772 A1421485 AI300352 H43971 AI378525 F33652 R47898 AI264177 F22289 N28263 
AI276281 R48205 A1245302 A1190036 AI281050 AW242903 H42892 AA910870 AW473816 H25721 AW451438 F19647 F22375 
H45809 F33447 AA774528 AA007629 H42537 C01077 F32386 

25 322102 46708^1 H45589 H19807AF075038H19808 H42437 

336865 CH22_4590FG 305 1 

338192 CH22_6755FG_Liri'K[EMAC00 

329733 c14_p2 

326120 c17.hs 

30 326139 c17.hs 

326855 c20.hs 

335352 CH22.2699FG.539.5.LINK»EIVI 

335639 CH22_2999FG_584.19.UNK_E 

307206 AI192534 

35 307377 AI222691 

337494 CH22.5727FG 799 12 

337764 CH22_6115FG_LINK_EMAG00 

337983 CH22_6438FG_LINK.EM:ACO0 

339365 CH22.8336FGJUNK-BA354I1 

40 325272 c11Jis 

325558 c12.hs 

325656 c14_hs 

334175 CH22_1455F6 349 10 UNK-E 

304182 H91086 

45 334347 CH22.1640F6 375 3UII^E 

327438 c 2 hs 

304622 AA516384 

334737 CH22.2049F6^424.1UINICE 

304682 AA550994 

50 336244 CH22.3642FG 746 2J.INK_DA 

306193 AA923457 

336336 CH2U746FG.814J.UNK.BA 
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TABLE 15B 



Table 15B shows the genomic positioning for those pkeys lacking unigene ID*s and 
accession numbers in Table 15. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
Usted. 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et a! " refers to the publteation 

entitled The DNA sequence of human chromosome 22.' Dunham t. et al.. Nature (1999) 402:489-495. 
Strand: indicates DNA strand from which excns were predicted. 

Nt_positk)n: indteates nucleotide positions of predk:ted axons. 



Pkey Ref 



Strand NLposition 



334347 
334737 
335639 
337494 
334175 
335352 
336244 
336336 
336865 
337764 
337983 
338192 
339366 
325272 
325558 
325656 
329733 
326120 
326139 
326855 
327438 



Dunham, I.etal. 
Dunham, i. etai. 
Dunham, I. eLal. 
Dunham, i. elai. 
Dunham, I. elal. 
Dunham. I. etal. 
Dunham, I. elal 
Dunham, i. elal. 
Dunham, I. eLai. 
Dunham, t. elal. 
Dunham, t. elal. 
Dunham, I. elal. 
Dunham, I. elal. 
5866902 
6056302 
6056305 
6065783 
5867194 
5867203 
6552460 
6004454 



Plus 

Plus 

Plus 

Plus 

Urns 

IVlinus 

iVIinus 

IVIinus 

IVIinus 

Minus 

IVDnus 

IVIinus 

M\tm 

IVIinus 

Plus 

Minus 

Plus 

Plus 

Minus 

Minus 

Minus 



13663814-13663926 

15998517-15998685 

25173591-25173696 

33339024-33339148 

11668659-11668597 

22681512-22681384 

31402729-31402583 

33797209-33797076 

8622405^22289 

4035640^(035446 

7Z75495-7275271 

13248453-13248277 

33647431-33647293 

13247-13312 

70930-71030 

7819CV78707 

163237-163450 

36116-36276 

218901-218960 

111390-111463 

199569-199692 
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TABLE 16: Table 4 from BRCA 001-5 US 

Table 16, a subset of table 1 5, depicts a preferred group of genes highly downregulated in 
breast cancer cells. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Un^ene Trtle: Unigene gene title 

R1: Ratio of nomnal breast tissue to tumor 



Pkey B(Accn 


UnSGene (D 


UnfgeneTftte 


R1 


100502 TIGR:HrT1496 


Hs.169228 


Adrenal-Specifjc Protein Pg2 


2.3 


101357 M12963 


Hs.4 


alcofiol dehydrogenase 1 (dass 1); alpha 


2.9 


102857 X00129 


HsJ6461 


retinol-binding protein 4; interstitial 


3 


104672 AA007629 




glyceroW-phosphate dehydrogenase 1 


2.4 


107099 AA609645 


Hs.211568 


eukaryotictranslatbn fnib'atkxi factor 4 gam 


2.7 


108604 AA099820 


Hs.49696 


ESTs 


2.4 


115949 AA443800 


Hs.43125 


ESTs 


2 


115965 AA446661 


Hs.173233 


ESTs 


2.2 


119175 R71792 


Hs.301002 


ESTs; Weaidy similar to eel! death activator 


2.8 


119798 W73386 


Hs.249129 


ESTs 


3 


122127 AA434447 


Hs.106771 


ESTs 


2.5 


122348 AA443695 


Hs.293410 


ESTs 


2.1 


129285 T62068 


Hs.11006 


ESTs 


2.1 


131267 AA211776 


Hs.2504 


myomestn 1 (skelemin)(185kD] 


3.8 


131282 M12272 


Hs.4 


alcohol dehydrogenase 3 (class 1); ganrvna 


2.2 


131810 D49487 


Hs.194236 


iepfin (murine obesity homolog) 


2.5 


133120 X64559 


Hs.65424 


tetranectin (plasmihogen^ihding protefn) 


2 


133501 S95936 


Hs.284176 


transferrin 


2.3 


134111 N79674 


Hs.8022 


TU3A protein 


4.6 


301395 AA923549 


Hs.224121 


ESTs 


2.1 


311794 AW238092 


Hs.254759 


ESTs 


2.1 


312575 H25237 


Hs.306814 


ESTs 


2.3 


313283 W32480 


Hs.157099 


ESTs 


2.2 


322814 A1824495 


Hs,211038 


ESTs 


2.2 


322929 AI365585 


Hs.1 46246 


ESTs 


2.3 


324675 AW014734 


Hs.1 57969 


ESTs 


2.2 


330931 F01443 


Hs.284256 


ESTs 


4.6 


332364 W94688 


Hs.103253 


perilipin 


2.1 


337983 




CH22^EIVt-^C005500.GENSCAN.1 10-1 


2 
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TABLE 16A 

Table 16A shows the accession numbers for those pkeys lacking unigenelD's for Table 16. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland Califomia). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT number Accession 

104672 6735 7 AA349096AI368018 F21390 F17759R48772AI421485AI300352 H43971 AI378525F33652 R47898AI264177 F22289 N28263 
AI276281 R48205 AI245302 AI190036 A1281050 AW242903 H42892 AA910870 AW473816 H25721 AW451438 F19647 F22375 
H45809 F33447 AA774528 AA007629 H42537 C01077 F32386 
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TABLE 17: Table 1 from BRCA 014 P 



5 Table 17 shows accession numbers representing 759 sequences of breast cancer genes or 
fragments thereof encoding breast cancer modulating proteins. For each overexpressed gene 
identified, a ratio of the relative amount of expression in breast tumors versus nomial breast 
tissue. 

10 

Pkey: Unique Eos probesel identifier number 

ExAccn: Exemplar Accession number, Genban)^ accession number 

UnigenelD: Unigene number 
Unigene Title: Unigene gene title 
15 R1 : Ratio of tumor to normal breast tissue 



Pkey ExAccn UnlgenetD Unigene Title R1 

20 100227 AV654694 Hs.82316 interferon-lnduced. hepafltis C-assodat 3 

100405 AW291587 Hs.82733 nldogen2 3.2 

100406 AI962060 Hs.1 18397 A E-binding protein 1 3.6 
100420 D86983 Hs,118893 Melanoma associated gene 3.2 
100911 X83300 Hs.289103 SMA4 5.2 

25 100960 J001 24 Hs.1 17729 keratin 14 (epidemvDlysis bullosa simple 4.3 

101011 BE387036 Hs.1211 add phosphatase 5, tartrate resistant 3 

101183 AA442324 Hs,795 H2A histone family, member 0 3.2 

101194 L20971 Hs.188 phosphodiesterase 48. cAMP-specific (dun 3 

101329 U6G042 ' Hs.82171 Homo sapiens clone 191 B7 placenta expres 4.1 

30 101378 BE563085 Hs.833 interferon-stimulated protein, 15 KDa 5.3 

101474 R07566 Hs.73817 small inducible cytokine A3 (homologous 3.9 

101491 M25809 Hs.64173 ATPase, H+transportingJysosomal(vacu 4.5 

101530 M29874 Hs.1360 cytochrome P450, subfamily II B (phenobar 9 

101602 AA353776 Hs.901 CD48 antigen (B-cell membrane protein) 3.4 

35 101663 NM_003528 Hs.2178 H2B histone family, member Q 5.6 

101758 BE019494 Hs.79217 pynofine-5-carboxylate reductase 1 3.6 

101767 M81057 Hs. 180884 carboxypeptidase B1 (tissue) 12 

101817 M89907 Hs.1 52292 SWI/SNF related, matrix associated, acti 3.2 

101851 BE260964 Hs.82045 mididne (neurite growth-promoting factor 4.1 

40 101878 M97815 Hs. 183650 cellular retinoic acid-binding protein 2 6.5 

102209 NM_002038 Hs.265827 Interferon, alpha-inducible protein (ck) 3 

102214 U23752 Hs.32964 SRY (sex determining region Y)-box 11 3 

102297 NM.001504 Hs.198252 6 protein-coupled receptor 9 3.7 

102299 NM-005824 Hs.155545 37 kDa leuclne-rich repeat (LRR) protein 3.7 

45 102301 NM.005651 Hs.183671 tryptophan 2.3-dk)xygenase 5.2 

102305 AL043202 Hs.90073 chromosome segregatton 1 (yeast homok)g) 3.5 

102369 U39840 HsJ299867 hepatocyte nuclear factor 3, alpha 3.9 

102591 U62325 Hs.324 1 25 amytoid beta (A4) precursor protein-blnd 4 

102721 H16646 Hs.1 18666 hypothetical protein PP591 3.5 

50 102739 AA363025 Hs.1 55572 Human clone 23801 mRNA sequence 3.2 

102791 AF080229 gb>^uman endogenous retrovirus K ctor« 1 3 

102804 NM_002318 Hs.83354 lysyl oxtdase-IIke 2 3.2 

102903 M73779 Hs.250505 retinoic acid receptor, alpha 3.3 

103010 X52509 Hs.161640 tyrosine aminotransferase 12.4 

55 103042 T81656 Hs252259 ribosomal protein S3 4.5 

103117 X63578 Hs.295449 paralbumin 3 

103207 X72790 gb:Human endogenous retrovirus mRNA for 5.9 

103282 BE390551 Hs.77628 stenoidogenic acute regulabry protein r 3.9 

103284 AI751601 Hs.8375 TNF receptor-associated factor 4 3.3 

60 103329 X85134 Hs.72984 retinoblastoma-binding protein 5 3.1 

103364 X90872 Hs.279929 gp25L2 protein 3 

103385 Nl\/L007069 Hs.37189 simifartoratHREV107 3.4 

103456 AA496425 Hs.9629 papilfary renal ceH cardnoma (transloc 3.2 
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103498 Y0g306 Hs.3014d homeodomah^nteracOngprolem kinase 3 3.4 

103558 BE616547 Hs.2785 keratin 17 37 

103563 L02911 Hs.150402 Acdvin A receptor, type 1 (ACVRI)^ 3.2 

103612 BE336654 Hs70937 H3histone family, member A 4.5 

5 103825 AI571835 Hs.55468 ESTs 4 

104073 AW779318 Hs.88417 ESTs 3.8 

104103 AW021102 Hs21509 ESTs 4.3 

104115 AF183810 Hs.26102 opposite strand to trichorhlnophalangeal 7.6 

.104168 AA461618 Hs.31704 ESTs. Weakly similar to KIAA0227 [H.sapi 3.6 

10 104173 AA084273 Hs.76561 ESTs, Weakly similar to S47072 finger pr 4 

104181 AF173296 Hs.283740 DC6 protein 3 

104189 AB040927 Hs.301804 KIAA1494 protein 3.2 

104269 A1559444 Hs.293960 ESTs 4.3 

104307 At929700 Hs.1116d0 endosulfine alpha 3.1 

IS 104518 H20816 Hs.1 12423 Homo sapiens mRNA;cDNADKFZp58611 420 (f 3.2 

104556 AV650851 Hs.96900 hypothetical protein; KIAA1830 protein 4.4 

104658 AA360954 H&27268 Homo sapiens cDNA:FU21933rts, done H 3.2 

104748 AA015879 Hs^3536 ESTs 3.2 

104755 T49951 H8.9029 DKFZP434G032 protein ' 4.5 

20 104825 AA035613 Hs.141883 ESTs 6.9 

104830 AW294092 Hs.21594 hypothetical protein MGC 15754 11.1 

104865 T79340 Hs.22575 Bk»I1 CLL/tymphoma 6. member B (zinc fi 3.5 

104906 BE298684 Hs26802 protein kinase domains containing protei 6.5 

104961 H78517 Hs.33905 ESTs 3.6 

25 105038 AW503733 Hs.9414 KIAA1488 protein 4.5 

105088 H58589 Hs.35156 Homo sapiens cDNAaJ1 1027 fis, done PL 3.8 

105092 AA148982 Hs.29068 ESTs 3 

105093 AL137566 Hs.32405 Homo sapiens mRNA; cDNA DKFZp586G0321 (f 4.8 
105304 AW134924 Hs.190325 ESTs 8.2 

30 105397 AAd14807 Hs7395 hypothetical protein FU23182 3.1 

105409 AW505076 Hs.301855 DiGeorge syndrome cntical regfon gene 8 4.2 

105431 AA252033 Hs242413 hypothetical protein DKFZp434K1421 4.4 

105552 AA256750 Hs^8802 centaurfn-alpha 2 protein 3.2 

105598 AA279439 Hs.279763 hypothetical protein FU 10504 3.5 

35 105650 W16741 Hs.25635 HSPC003 protein 3.7 

105688 A1299139 Hs.17517 ESTs 5.5 

105808 AI133161 Hs.286131 CGI-101 protein 3.5 

105809 AW973653 Hs.20104 hypothetical protein FU00052 3.3 
105909 AA195191 Hs.5111 hypothetical protein FLJ20729 3.2 

40 105965 AA131657 Hs.23830 ESTs 3.3 

106135 AL117474 Hs.41181 Homo sapiens mRNA: cDNA DKFZp727C191 (fr 3.2 

106184 W28948 Hs.10762 ESTs 3,3 

106293 N39842 Hs.301444 KIAA1673 4.1 

106400 BE397649 Hs.94109 Homo sapiens cDWA FU13634 fe. done PL 3.1 

45 106474 BE383668 Hs.424d4 hypothetical protein FU10618 3.2 

106484 AA351978 Hs.4943 hepatoceflular cardnoma associated prot 7.8 

106533 AL134708 Hs.145998 ESTs 3 

106614 AA648459 Hs.335951 hypothetical protein AF301222 3.8 

106636 AW958037 Hs.286 ribosomal protein L4 3.3 

50 105661 AW499914 Hs.7579 hypothetical protein FLJ 10402 3 

106743 BE613328 Hs.21938 hypothetical protein FLJ 12492 4.2 

106844 AA485055 Hs.1 58213 spemi associated antigen 6 3.4 

106864 AI31 1928 gb:qo89h04.x1 NCLCGAP^Kid5 Homo sapiens 4.4 

106865 AW192535 Hs.19479 ESTs 3.6 
55 106871 AW472981 Hs.321 130 hypothetical protein MGC2771 4.1 

106942 AA995351 Hs31314 retinoblastoma-blndlng protein 7 3.6 

106968 AF216751 Hs.26813 CDA14 5.3 

107105 AW963419 Hs.155223 slanniocaidn 2 3.4 

107158 N32849 Hs.31844 hypothetical protein FU12586 3.1 

60 107248 AW263124 Hs.315111 nudear receptor oo-repfessor/HDAC3 comp 5.9 

107265 BE379594 Hs.49135 ESTs, Moderately similar to ALU7^HUMAN A 3.9 

107630 AW961576 Hs.60178 ESTs 4.6 

107710 AJ955040 Hs.265398 ESTs, Weakly similar to transfbrniation-r 3 

107890 AA025386 Hs.61311 ESTs, Weakly similar to SI 0590 cysteine 3.1 

65 107985 T40064 Hs.71968 Homo sapiens mRMA;cDNADKFZp564F053(fr 4.8 

108000 AI263307 Hs.239884 H2B histone family, member L 3.3 

108217 AA058686 Hs.62588 ESTs 3.8 
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108435 T82427 Hs.ig4101 Homo sapiens cDNA: FU20869ris, done A 3 

108591 AB033073 Hs.43857 similar to glucosamine-6-sulfatases 3.3 

108733 AA121022 gb:zn84f10,r1 Stratagene lung carcinoma 3.9 

108771 AF068290 Hs.79741 hypothetical protein FLJ 1011 6 8.1 

5 108819 AA011449 Hs,271627 ESTs 3.6 

108912 AA136674 Hs.1 18681 EST 3.9 

109086 AF186114 Hs.270737 tumor necrosis factor (ligand) superfami 37 

109124 AKD00684 Hs.183887 hypothetical protein FLJ221 04 3.1 

109132 A1970536 Hs.16603 hypothetical protein FU 13163 3.7 

10 109163 N23235 Hs.30567 ESTs, Weakly similar to B34087 hypotheti 4.5 

109277 AA196443 Hs.86043 Homo sapiens cDNAFU13558ris. clone PL 3,7 

109410 AW504732 Hs.21275 hypothetical protein FU 11011 4,6 

109454 AA232255 Hs,295232 ESTs, Moderately similar to A46010 X-lin 6.4 

109514 AA234087 Hs.262346 ESTs, Weakiy similar to S72482hypotheti 4.8 

IS 109581 R45584 Hs.23025 ESTs. Weakly similar to ALU5_HUMAN ALU S 3.3 

109632 AA325138 Hs.235873 hypothetical protein FU22672 3 

109644 AW973964 Hs.291531 ESTs. Highly similar to 1203217Adehydro 3 

109700 F09609 gb:HSC33H092 nomialized infant brain cDN 3.2 

109768 F06838 Hs.14763 ESTs 3.2 

20 109807 R43646 Hs.12422 ESTs 3.8 

109842 AW818436 Hs.23590 solute carrier famUy 16 (monocarboxylic 3.3 

1098^ AK001680 Hs.30488 DKFZP434F091 protein 3.6 

110024 AW973152 Hs.31050 ESTs 4.2 

110561 AA379597 Hs.5199 HSPC150 protein similar to ubiquitin-con 5.1 

25 110675 H89355 Hs.249159 adrenergic, alpha-2A-, receptor 5.3 

110707 AI239832 Hs.15617 ESTs. Weakly similar to ALU4_HUMAN ALU S 3.7 

110915 BE092285 Hs.29724 hypothetical protein FLJ 13187 3.7 

111139 N64683 Hs,290943 ESTs 4 

111155 N66563 Hs.191358 ESTs 3.1 

30 111199 AI767435 Hs.29822 ESTs 4.5 

111336 AI457338 Hs.29894 ESTs 5.4 

111510 R07856 Hs.16355 ESTs 3.2 

111532 R08440 gb:yf19f09,s1 Soaresfetalliver spleen 3.1 

111689 AA602004 Hs.23260 ESTs 3.2 

35 111823 R35252 Hs.24944 ESTs, Weakly similar to 21 09260A B cell 3.3 

111876 R38239 Hs.293246 ESTs, Weakly similar to putative pi 50 [H 3.1 

111892 AA421081 Hs,12388 ESTs 3.4 

111893 AF070526 Hs.13429 Homo sapiens clone 24787 mRNA sequence 3.3 
112125 AW379029 Hs.1 18338 ESTs. Weakly similar to unnamed protein 4.4 

40 112170 BE246743 Hs.288529 hypothetical protein FU22635 7.3 

112287 AB033064 Hs.334806 KIAA1 238 protein 3.2 

112300 H24334 Hs.26125 ESTs 4.4 

112303 R54797 gb:yg87b07.s1 Soares intent brain 1NIB H 3.4 

112478 R66067 Hs.28664 ESTs 8.2 

45 112561 AI7gi493 Hs.129873 ESTs. Weakly similar to A36036Gytochrom 5.5 

112631 R82040 gb7j06b06.s1 Scares placenta Nb2HP Homo 3.9 

112637 R82331 Hs,164599 ESTs 5,4 

112657 AW844878 Hs.19769 hypothetical protein MGC4174 3.2 

112678 A1418466 Hs.33665 ESTs 4.7 

50 112917 AA082465 Hs.125031 choline/ethanolaminephosphotransfarase 3.7 

113070 AB032977 Hs.6298 KIAA1 151 protein 3.1 

113095 AA828380 Hs.126733 ESTs 3.4 

113117 AW813731 Hs.159153 ESTs. Moderately similar to S65657 alpha 3.4 

113187 BE613410 Hs.31575 SEC63. endoplasmic reticulum traisk)Con 3.2 

55 113200 T57773 Hs,10263 ESTs 3.5 

113206 BE262470 Hs.241471 RNB6 6.2 

113374 T79925 Hs.269165 ESTs. Weakly similar to ALU1_HUMAN ALUS 37 

113440 U54727 Hs.191445 ESTs 3 

113494 T91451 Hs.86538 ESTs 3.4 

60 113518 AW367788 Hs.323954 postmelotic segregation increased 2-like 3.1 

113571 AI702609 Hs.15713 hypotheOcal protein MGC2776 3.1 

113822 NM.004585 Hs,17466 retinolc acid receptor responder(tazaro 3.9 

113835 Ai912410 Hs,27475 Homo sapiens cDNA FLJ 12749 fis.ctone NT 3 

113938 W81598 gb:zd88g02.s1 Scares fetaUearLNbHH19W 4.6 

65 113947 W84768 gb:zh53d03.s1 Soares felalJver^spleerL 3,1 

1 13970 W27249 Hs.8109 hypothetical protein FL J21080 6.9 

114086 AA378776 Hs.288649 hypothetical protein MGC3077 4.3 
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114148 AW470411 Hs.28a433 neurotrimln 4.1 

114424 AW780192 Hs.267596 ESTs 3.4 

1 14518 AW163267 Hs.106469 suppressor of varl (S.cerevi5iae) 3-(ike 3.1 

114563 AI97916a Hs.82226 glycoprot^ (transmembrane) nmb 4.8 

5 114965 Ai733881 Hs.72472 BMP-RIB 10.1 

114995 AA769266 Hs.193657 ESTs 3.6 

115121 AI634549 Hs.88155 ESTs 3,2 

115134 AW968073 Hs.194331 ESTs, Highly similar to A55713 Inositol 4.2 

115167 AA749209 Hs.43728 hypothetical protein 3 

10 115253 BE149845 Hs.289038 hypothetical proteffi MGC4126 3.6 

115277 AA814100 Hs.86693 ESTs 3.9 

115327 N46436 Hs.1 09221 ESTs 3.4 

115354 AA281636 Hs.334827 ESTs 4.8 

115657 AA405620 Hs.55158 ESTs, Weakly similar to T29520 hypotheU 35 

15 115676 AA953006 Hs.88143 ESTs 9.3 

115709 AW293849 Hs.58279 ESTs, Weakly similar to ALU7_HUMAN ALUS 3.4 

115729 AA417812 Hs.38775 ESTs 4 

115787 AI126772 Hs.40479 ESTs 3.1 

115830 AW970529 Hs.86434 hypoOietical protein FLJ21 816 3.6 

20 115836 AA521410 Hs.41371 ESTs 3.1 

115850 NM.014937 Hs.52463 KIAA0966 protein 3 

115900 AK001500 Hs. 1651 86 hypothetical protein FU 13852 3.2 

1 15935 AA354M9 Hs.4l 181 Homo sapiens mRNA; cDNA DKFZp727C191 (fr 3 

115948 AL042465 Hs.43445 poly(A)'Specific ribonuclease (deadenyla 3.1 

25 116092 AB041035 Hs.93847 NM„016931:Homo sapiens NADPH oxidase 4 ( 6.7 

116115 AL042355 Hs.70202 WD repeat domain 10 16 

116184 AW450737 Hs.128791 CGI-09 protein 3.1 

116192 AA454976 Hs.62528 ESTs, Moderately similar to A46010X-lin 3.3 

116208 AI219083 Hs.42532 ESTs. Moderately similar to ALU8«HUMAN A 3.2 

30 116246 AF265555 Hs.250646 baculovlral lAP repeat-«)ntalning 6 3.6 

116443 AW962196 Hs.321264 LBP protein 32 4.1 

116470 AI272141 Hs.83484 SRY (sex detemrtining region Y^jox 4 4.1 

116726 AKD01114 Hs.53913 hypothetical protein FU10252 . 8.6 

116845 AA649530 gbjis44l05.s1 NCLCGAP^I Homo sapiens 3.2 

35 117026 H88256 Hs.50456 ESTs, Moderately similar to ZN75_HUMANZ 3.5 

1 17216 AI569804 Hs.42792 ESTs. Weakly simitar to 178885 seilne/th 3.1 

117296 AL133427 Hs.42506 Homo safrfens mRNA full length Insert cDN 3.2 

117403 H84455 Hs.40639 ESTs 47 

117691 AB040959 Hs.93836 DKFZP434N014 protein 3 

40 118229 AW968941 Hs.166254 hypothetical protein DKFZp5661133 3.3 

118363 AI183838 Hs.48938 hypothetical protein FLJ21 802 4.3 

118416 N66028 Hs.49105 FKBP-associated protein 3.1 

118470 AW970584 Hs.291033 ESTs 3.4 

118502 AL157488 Hs.50150 Homo sapiens mRNA; cDNADKFZp564B1 82 (fr 5.2 

45 118695 AK000465 Hs.50081 K1AA1 199 protein 3.4 

1 18925 N92293 Hs.206832 ESTs, Moderately similar to ALU8_HUMAN A 3.3 

119025 BE003760 Hs.55209 Homo sapiens mRNA; cDNA DKFZp434K0514 (f 19.7 

119036 R95872 Hs.117572 chemoklne binding protein 2 37 

119063 R16833 Hs.53106 ESTs, Moderately slnular to AlUIJlUMAN A 4.1 

50 119075 M10905 Hs.2B7820 fibronectin 1 3.2 

119620 W47620 Hs.56009 2*-5'>oli3oadenylato synthetase 3 (100 k 3.3 

119741 AF041853 Hs.43670 Idnesin family member 3A • 3.1 

119747 A1970797 Hs.64859 ESTs 5 

119754 AL037824 Hs. 194695 ras homotog gene family, member I 3.8 

55 119905 AW449064 Hs.119571 collagen, type 111. alpha 1 (Ehlers-DanI 3.1 

120084 W94472 Hs.59529 ESTs, Moderately similar to ALU1_HUMAN A 8,4 

120241 AA825686 Hs.321176 ESTs. Weakly similar to S65824 reverse t 3.6 

120326 AA195300 Hs.21145 hypothetical protein RG083M05.2 3.2 

120742 AA225084 gb:nc21d06.r1 NCLCGAP^Prl Homo sapiens 3.6 

60 120870 AA357172 Hs.292581 ESTs, Moderately similar to ALU1JIUMAN A 5.8 

120885 AA365515 Hs301872 hypothetical protein MGC4840 3 

120970 AA398118 Hs.97579 ESTs. Weakly similar (oA46010X'linked 37 

121054 AW976570 Hs.97387 ESTs 5.3 

121095 AA320134 Hs.198029 Homo sapiens mRNA for KIAA1657 protein. 4 

65 121103 AA398936 Hs.97697 EST 3.5 

121121 AA399371 Hs.189095 similar to SALL1 (sal (Drosophila]-iike 6.3 

121337 AW885727 Hs.301570 ESTs 4.7 
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121351 AW206227 Hs^87727 hypothetical proten FU23132 5 

121611 M31669 Hs.1735 inhtbin. beta B (activin AB beta polypep 3.6 

121643 AA640987 Hs.193767 ESTs 5.6 

121770 NIVLO15902 Hs.278428 progestin induced protein 3.4 

5 122125 AKO00492 Hs.98806 hypothetical protein 4.1 

122338 AA443311 Hs.98998 ESTs 3 

122417 AA446965 Hs.1l2092 ESTs 4.7 

122513 AI767879 Hs.99214 ESTs 3.8 

122544 AW973253 Hs.292689 ESTs 3 

10 122655 AA323296 Hs.97837 Homo sapiens mRNA; cDNA DKF2p547J047 (fr 5.6 

122805 AA526911 Hs.82772 collagen, type XI, alpha 1 3.2 

122851 AW205931 Hs.99598 hypothetical protein MGC5338 8.6 

123105 AA487809 Hs.166011 catenin (cadherin-assodatsd protefn), d 3 

123111 AA228776 Hs.191721 ESTs 6.9 

IS 123249 AA371307 Hs.125056 ESTs . 3.6 

123273 AA49ia3 Hs.173611 Empirically selected from AFFX single pr 7 

123385 BE149685 Hs.17767 KIAA1 554 protein 3.1 

123419 T66087 Hs.1 12482 Homo sapiens unknown mRNA sequence 3,4 

123485 AI308876 Hs. 103849 hypothetical protein DKFZp761D1 12 3.1 

20 123645 A1675944 Hs.188691 Homo sapiens cDNAFU12033fis, done HE 3.8 

123819 AA580082 Hs.1 12264 ESTs 4.7 

124012 AA352723 Hs.241471 RNB6 3.8 

124243 H69125 Hs,133525 ESTs 4.1 

124357 N22401 gb7w37g07. si Morton Fetal Cochlea Homo 4.1 

25 124359 N22508 Hs.139315 Homo sapiens cDNA: FU21479 fis. done C 3.6 

124557 AW451645 Hs.151504 Horn o sapiens cDNAFU 11 973 fis, done HE 3.1 

124911 N34151 Hs.174195 interferon induced transmembrane protein 3.5 

124972 R41396 Hs.101774 hypothetical protein FU23045 4.3 

125006 BE065138 Hs.145696 splidng factor (CC1.3) 6 

30 125042 T789C6 Hs.269432 ESTs, Modeiately similar to ALU IJiUIVIAN A 8.1 

125184 W60326 Hs.288684 Homo sapiens cDNAFU11750 fis, done HE 4.7 

125243 AW970536 Hs.105413 ESTs 3.1 

125286 AF086534 Hs.187561 ESTs. Moderately similar to ALUIJiUMAN A 3.3 

125304 AU59573 Hs.1 24940 GTP-binding protein 3 

35 125330 AW880562 Hs.1 14574 ESTs 3 

125331 A(422996 Hs,161378 ESTs 3.2 

125685 AI924630 Hs.4943 hepatocellular cardnomaassodatedprot 3.2 

126257 N99638 gb:za39g11.r1 Scares fetal liver spleen 4 

126474 AW975814 Hs.326714 Homo sapiens done IMAGE:713177, mRNA se 4 

40 126666 AA648886 Hs.151999 ESTs 3.8 

126872 AW450979 gb:UI-H-BI3-ala^-12-0-Ul.s1 NCLCGAP.Su 3 

127431 AW771958 Hs.175437 ESTs, Moderately similar to PC4259fem* 3.6 

127980 AA961459 Hs.125644 ESTs 4.1 

127997 AW068311 Hs.311054 Homo sapiens mRNA ^1 length insertcDN 3.3 

45 128420 AA650274 Hs.41298 fibronectin leudne rich transmembrane p 4.6 

128809 NM.003616 Hs.102456 survival of nootorneuron protein interac 3.9 

128946 Y13153 Hs.107318 kynurenine S-monooxygenase (kynurantne 3 3.1 

128955 AA775076 Hs.185807 Homo sapiens, Similar to PRO0478 protein 3.9 

129092 D56365 Hs.63525 poly(rC)-binding protein 2 3.3 

50 129270 AA357185 Hs.109918 ras ho molog gene family, member H 3.1 

129301 AF182277 Hs.330780 cytochrome P450, subfamily I IB (phenobar 3.9 

129385 AA172106 Hs.1 10950 Rag C protein 6.2 

129619 AA209534 Hs.284243 tetraspan NET-6 protein 3.4 

129629 AK000398 Hs.11747 hypothetical protein FU20391 3 

55 129725 X56411 Hs.1219 alcohol dehydrogenase 4 (dass II), pi p 3.2 

130069 AI754813 Hs.146428 collagen, type V. alpha 1 5.4 

130092 X03363 Hs.323910 v-ert-b2 avian erythroblastic leukemia v 4.4 

130298 AI347487 H5.132781 class I cytokine receptor 4.6 

130382 NM_003450 Hs.1 55204 anc finger protein 174 5.6 

60 130622 AI582291 Hs.16846 ESTs, Weakly similar to 04HUD1 debrisoqu 3 

130703 R77776 Hs.18103 ESTs . 3.8 

130881 AA809875 Hs.25933 ESTs 4.2 

130954 AB014544 Hs.21572 KIAA0644 gene product 4.7 

131095 A1399653 Hs.22917 ESTs 4.3 

65 131153 H09048 Hs.23605 ESTs 3.8 

131253 R71802 Hs.24853 ESTs 3.5 

131372 AW293399 Hs.144904 nuclear reoeptoroo-repressorl 3.6 
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131507 AI826268 Hs.277e9 ESTs, Weakly sirrtlar to MCATJ^UMANMrrOC 3.2 

131587 AI695549 Hs.1 83868 glucuronidase, beta 3.1 

131739 AF017986 Hs.31386 secreted frizzled-related prolan 2 3.2 

1317^ BE501849 Hs,32317 high-mobiTity group 20B 3.2 

5 131970 D86960 Hs.3610 KIAA0205 gene product 3.6 

131986 NIVL002314 Hs.36566 LI M domain kinase 1 3.2 

132093 AA400091 Hs.39421 ESTs 3.2 

132122 AA426202 Hs.40403 Cbp/p300-interaceng transacBvator, wit 3 

132159 D76435 Hs.41154 Ztc fanilly member 1 (oddpaired DrasophI 3.2 

10 132333 AA192669 Hs.45032 ESTs 35 

132406 AL133731 Hs.4774 Homo sapiens mRNA: cDNA DKFZp761C1712 (f 92 

132482 AV560345 Hs,238126 CGM9 protein 8.2 

132543 BE568452 Hs.5101 protein regulator of cytokinesis 1 3.4 

132624 AA326108 Hs.33829 bHLH protein DEC2 3.2 

15 132700 AA319233 . Hs.5521 ESTs 4.8 

132725 NMJ06276 Hs.184167 splicing factor, arginine/seilne-rich 7 3.6 

132799 W^ll Hs.169407 SAC2 (suppressor of actinmutatons 2. y 3.2 

132847 T48195 Hs.58189 eukaryotic translation Initiation factor 3.5 

132857 Y00272 Hs.184572 cell division cycle 2, G1 to Sand G2 to 4.4 

20 132936 AL120659 Hs.6111 aryl-hydrocarbon receptor nudeartransi 4.8 

133130 AI128606 Hs.6557 zinc finger protein 161 3.3 

133142 AW952412 Hs.65874 ESTs, Weakly similar to A40348 Elav/Sex- 3.5 

133167 AW162840 Hs.6M1 kinesin family member 5C 4.5 

133225 AW600291 Hs.6823 hypothetical protein FIJI 0430 33 

25 133274 AA085191 Hs.6949 hypothetical protein MGC1 1275 3 

133275 Z93241 Hs.239934 CGI-96 protein 4.5 

133287 AW797437 Hs.69771 B -factor, properdin 4.1 

133376 BE618768 Hs.7232 acetyl-Coenzyme A carboxylase alpha 5.1 

133462 AW675064 Hs73875 fumarylacetoacetate hydrolase (funnarylac 3 

30 133740 AW162919 Hs.170160 RAB2. member RAS oncogene family-like 3.4 

133831 BE274552 Hs.76578 protein inhibitor of activated STAT3 3.9 

133976 A1908165 Hs.1 69946 GATA4)lndlng protein 3 CT-cell receptor 6.2 

134666 BE391929 Hs.8752 transmembrane protein 4 3.1 

134710 A1433797 Hs.8889 serine hydroxymethyltransferase 1 (solub 3 

35 134731 D89377 Hs.89404 msh (Drosophila) homeo box homobg 2 5.8 

134776 J05582 Hs.89603 mucin 1. transmembrane 4 

135230 AF064804 Hs.96757 suppressorof Ty (S.cerevisiae)3homoto 3.2 

135303 R61253 Hs.98265 KIAA1 877 protein 3.3 

135400 X78592 Hs.99915 androgen receptor (dihydrotestoslerone r 4.8 

40 135411 L10333 Hs.99947 reticuloni 3.8 

300089 AI199738 Hs.208275 ESTs. Weakly similar to ALUAJHUMANlllI 3.8 

300233 AW514220 Hs.189402 ESTs 4.2 

300254 AW183618 Hs.55610 solute canier family 30 (anc transport 9.9 

300256 AW591433 Hs.298241 Transmembrane protease, serine 3 4.9 

45 300378 Z45270 Hs.235873 hypothetfcal protein aJ22672 3.4 

300973 AA572949 Hs.207566 ESTs 3.5 

301111 R10799 Hs.191990 ESTs 3.8 

301341 AA887801 Hs.208229 G proteinK»upled teceptor 13.9 

301548 A1091631 Hs.203845 two pore potassium channel KT3.3 4.4 

50 301884 AA312082 Hs.105445 GDNF family receptor alpha 1 57 

301936 U79745 Hs.1 14924 solute canier family 16 (monocarboxylic 8.6 

301976 T97905 gb:ye54cl0.r1 Soares fetal Bver spleen 3.9 

302001 AB020711 Hs.278346 KIAA0904 protein 7.7 

302067 BE542706 Hs.222399 CEGP1 protein 7.3 

55 302094 AW749321 Hs.6786 ESTs 3.3 

302099 AL049670 Hs.137576 ribosomal protein L34 pseudogene 1 4.2 

302145 Nl\/L003613 Hs.151407 cartnage intemiedlate layer protein, nu 7.9 

302235 AL049987 Hs.166361 Homo sapiens mRNA; cDNADKFZp564F1 12 (fr 5.6 

302276 AW057736 Hs.323910 HER2 receptor tyrosine kinase (c^-b2, 5.4 

60 302290 AA179949 Hs.175563 Homo sapiens mRNA; cDNA DKFZp564N0763 (f 34.1 

302372 AL117408 Hs.200102 ATP-binding cassette transporter MRP8 67 

302378 AL109712 Hs.296506 Homo sapiens mRNA full length insert cDN 4 

302384 AI678059 H$^02676 synaptonemal complex protein 2 4.3 

302385 AJ224172 Hs.204096 lipophllln B (uteroglobin family member) 13.8 
65 302680 AW192334 Hs.38218. ESTs 9.6 

302830 AI038997 Hs.132921 ESTs 5 

302857 AF282265 Hs.44836 innercentiomere protein antigens (135kD 3.4 
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302892 AW176909 Hs.42346 calcfneurfn-binding protein calsardn-1 3.4 

302970 W05608 Hs.312679 ESTs,Weak)ysimilartoA49019dyneinhe 5.1 

303271 AA652687 Hs.96151 Human DNA sequence from clone RP5-1103G7 3.7 

303289 AL121460 Hs.272673 hypothetical protein FU20508 4.1 

5 303357 AW006352 Hs.159643 ESTs, Weakly similar to T32554hypotheti 4.2 

303540 AA355607 Hs.309490 ESTs. Weakly similar to putative WHSC1 p 4.3 

303563 AA367699 H&10082 potassium Intemiediata/small conductance 3.3 

303642 AW299459 gb:xs50d08J(1 NCLCGAP^Kidl 1 Homo sapien 4.2 

303780 AI424014 Hs.18995 KIAA1304 protein 3.6 

10 303797 AW629759 gb:hh70e05.y1 NO CGAP.GU1 Homo sapiens 4.9 

303852 R53434 Hs.9O207 hypothetical protein MQC1 1138 3.7 

304328 AA149951 Hs.62112 zinc finger protein 207 3 

304782 AA582081 gb:nn32h08.s1 NCLCGAP.GasI Homo sapiens 4.1 

305913 AA876109 gb^ix24h0 1. si NCLCGAP.GC3 Homo sapiens 3 

IS 305917 AA876469 gb:oe48b04.s1NCLCGAP.Pi25 Homo sapiens 3.1 

307010 AI140014 9b:qa68f09.x1SoaresLfetaLhearLNbHHigW . 3.5 

307041 AI144243 gb:qb85b12j(1 SoaresLfetaLheaiLNbHH19W 3.9 

308106 AI476803 gb:577e12.x1 Soares_NSF_F83W^OT_PA_P^S 4.3 

308307 AI581398 Hs.172928 collagen, type Ulpha 1 4.6 

20 308615 AK000142 Hs.101774 hypothetical protein FU23045 4.4 

309177 AI951118 Hs.326736 Homo sapiens breast cancer antigen NY-BR 17.3 

309328 AW024348 Hs.233191 EST, Weakly similar to A2721 7 glucose tr 3.2 

309574 AW168083 gb:xg59g04.x1 NCLCGAP_Ut4 Homo sapiens 3.1 

309583 AW170036 Hs.326736 Homo sapiens breast cancer antigen NY-BR 57.6 

25 310064 AI199712 Hs.l48486 ESTs, WeaWy similar to 1917210APrQ/Arg 4.6 

310098 AI685841 Hs,161354 ESTs 3.6 

310438 AW022192 Hs.20O197 ESTs 4.6 

310683 AI939456 Hs.160870 ESTs 3.2 

310727 AK000703 Hs.323822 Homo sapiens mRNA for KIAA1551 protein, 3.6 

30 310781 AI380797 Hs.158992 ESTs 10.2 

■ 310895 AI955121 Hs.165724 N-acetylgaiactosamine^O-^ulfotransfera ZA 

310955 AI476732 Hs.263912 ESTs 10.9 

311117 AI671439 Hs,196029 Homo sapiens mRNA for KIAA1657 protein, 3.1 

311166 AI821005 Hs.118599 ESTs 10.8 

35- 311237 AA641098 Hs.208809 ESTs, Moderately similar to ALUI^HUMAN A 4.3 

311455 AI758660 Ha206132 ESTs 4,4 

311587 A1828254 Hsi71019 ESTs, Weakly sltrflar to A47582&cengr 5.1 

311598 AW023595 Hs.232048 ESTs 5.8 

311774 AA700870 Hs.14304 ESTs 3.3 

40 311785 A1056769 Hs.1 33512 ESTs 3.9 

311872 R12375 Hs.194600 ESTs 3.3 

311889 AA767342 Hs.122483 ESTs, Weakly similar to PSF.HU MAN PTB-AS 3 

311913 AI358522 Hs.270188 ESTs 3 

311923 T60843 ' H5.189679 ESTs 5.6 

45 311935 AA216387 gbmc16b025l NCIjCGAP.Prl Homo sapiens 5.2 

312019 AA373630 H8,188750 ESTs 3 

312021 AA759263 Hs.14041 ESTs 3.4 

312067 T78968 Hs,14411 ESTs 3.5 

312090 T&0177 Hs.1 18064 similar to rat nuciearubk)uitous casein 3.8 

50 312147 AI633744 Hs. 195648 ESTs. Weakly similar to 138022 hypotheti 4.4 

312153 BE261944 Hs.118625 hexokinasel 5.2 

312168 T92251 Hs.198882 ESTs 3.3 

312182 T94344 Hs.326263 ESTs 3.3 

312187 AA700439 Hs.188490 ESTs 3.4 

55 312199 AW438602 Hs.191179 ESTs 3.9 

312219 H73505 Hs.1 17874 ESTs 4 

312226 AA315703 Hs,199993 ESTs, Weakly similar to ALUB.HUMAN III! 4.9 

312299 AA972712 Hs.269737 ESTs 5.7 

312544 AA516420 Hs.183526 ESTs. Weakly similar to 138022 hypotheti 6.3 

60 312638 AW439195 Hs256880 ESTs. Weakly similar to S65657alpha-1C. 4.9 

312826 AW291545 Hs.185018 ESTs 4:9 

312837 AW292286 Hs.255058 ESTs 4.4 

312980 AA497043 Hs.1 15685 ESTs 3.1 

313070 AI422023 Hs,161338 ESTs 4.3 

65 313079 N76497 Hs.1787 proteolipid protelnl (PefizaeusJiilerzbac 3.3 

313089 AF026944 Hs.293797 ESTs 5.8 

313096 AWD73310 Hs.163533 HomosapienscDNAFU14142fis. clone MA 4.5 
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313126 AA746503 Hs.283313 ESTs 10 

313166 AI801098 Hs.151500 ESTs 3.5 

313197 AW979008 Hs.222487 ESTs 3.3 

313280 AW960454 Hs.222830 ESTs 47 

5 313325 AI420611 Hs.127832 ESTs 3.4 

313328 AW449211 Hs.105445 GDNF family receptor ai)ha 1 12.4 

313352 AW150945 Hs.144758 ESTs 4.1 

313385 AI032087 Hs.259819 ESTs • 3 

313393 A(874685 Hs.200141 ESTs 5.2 

10 313417 AA741151 Hs.137323 ESTs 3.5 

313434 W92070 gb2h48g05.r1 Soares_fetaLliver_spleen_ 3.7 

313569 AI273419 Hs.135146 hypothetical protein FU 13984 3 

313591 AA046309 gb2f12K)1.s1 SoaresjBtalJiearLNbHH19W 5.6 

313615 A1540978 Hs.301997 hypott)etIcal protein FU13033 3.2 

IS 313915 C18863 Hs.163443 Homo sa^HenscDNAFUl 1576 fis, clone HE 26.3 

313975 AW175896 Hs.65114 keratin18 3 

313979 A1535895 Hs.221024 ESTs 4.9 

313997 AV657317 Hs.288649 hypothetical protein MGC3077 3.9 

314043 AA827082 Hs.291872 ESTs 3.1 

20 314078 AW129357 Hs.329700 ESTs 8.3 

314097 AA648744 Hs.269493 ESTs 6.6 

314121 AI732083 Hs.187619 ESTs 6.2 

314129 AA228366 Hs.115122 ESTs 4 

314138 AA740616 gb:ny97f11.s1I^CljCGAP.GCB1 Homo sapiens 5.9 

25 314236 AA743396 Hs.189023 ESTs 3.1 

314244 AL036450 Hs.103238 ESTs 4 

314305 AI280112 Hs.1 25232 Homo sapiens cDNAFU 13266 fis, clone OV 8 

314306 A1697901 Hs.192425 ESTs 3.7 
314322 AAg07l53 Hs.190060 ESTs 3.3 

30 314394 AW961597 Hs.130816 ESTs, Moderately similar to 138022 hypot 4.2 

314401 AI660412 Hs.234557 ESTs 3.3 

314465 AA602917 Hs.156974 ESTs 4.7 

314506 AA833655 Hs.206868 Homo sapiens cDNA FU14056 fis, done HE 8.5 

314510 AI204418 Hs.190080 ESTs 4 

35 314546 AW007211 Hs.16131 hypothetical protein FU12876 3.4 

314547 AA399272 Hs.14434l ESTs 6.7 

314558 AI873274 Hs.190721 ESTs 27.4 

314627 AA425310 Hs.155766 ESTs, Weakly simitar to A47582&<:ettgr 4.4 

314648 AW979268 gb:EST391378 MAGE resequences. MA6P Homo 4.6 

40 314691 AW207206 Hs.136319 ESTs 20.7 

314729 AA457367 Hs.191638 ESTs 3.6 

314754 AW026761 Hs.134374 ESTs 3.6 

314814 BE350122 Hs.157367 ESTs, Weakly simitar to 178885 serinefth 4.9 

314864 AW971198 Hs.294068 ESTs 4.3 

45 314881 AI095087 Hs.1 52299 ESTs, Moderately similar to S65657 alpha 3.7 

314882 AA828032 Hs.189076 ESTs 3.1 

314981 AW972359 Hs.293334 ESTs 3 

315006 AI538613 Hs.298241 Transmembrane protease, serine 3 10,9 

315021 AA533447 Hs.312989 ESTs 5.3 

50 315051 AW292425 Hs.163484 ESTs 12.9 

315060 AA551104 Hs.189048 ESTs. Moderately similar to ALUC_HUMAN I 5.8 

315073 AW452948 Hs.257631 ESTs 4.2 

315080 AA744550 Hs.136345 ESTs 3.7 

315175 AI025842 Hs.152530 ESTs 6 

55 315183 AW136134 Hs.220277 ESTs 3.9 

315193 AI241331 Hs.131765 ESTs, Moderately similar to 138937 DNA/R 4,4 

315196 AI367347 Hs.44898 Homo sapiens done TCCCTAOO 151 mRNAsequ 8.2 

315198 AI741506 Hs.165900 ESTs, Weakly similar to ALU1.HUMAN AUU S 3.6 

315240 R38772 Hs.172619 myelin transcription factor Mike 3.4 

60 315263 AW510994 Hs.220740 ESTs 3.4 

315282 AI222165 Hs.144923 ESTs 4.9 

315296 AA876905 Hs.125286 ESTs 4 

315368 AB037745 Hs. 1 04696 KIAA1 324 protein 4,7 

315397 AA218940 Hs.1 37516 fidgetin-like 1 3.1 

65 315489 AI378817 Hs.191847 ESTs 3.1 

315498 AA628539 Hs.116252 ESTs. Moderately similar to ALULHUMAI^ A 3.2 

315526 AI193043 Hs.128685 ESTs, Weakly similar to T17226hypotheti 4.1 
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315530 AW015415 Hs.127780 ESTs 8.9 

315562 AA737415 Hs.152826 ESTs 5.5 

315634 AA837085 Hs.220585 ESTs 6.3 

315647 AA648983 Hs.212911 ESTs 3,6 

5 315707 AI418055 Hs.161l60 ESTs 5.1 

315772 AW515373 Hs.271249 Homo sapiens cDNAFU13580fiS. done PL 3,1 

315850 AW270550 Hs.1 16957 ESTs 3.8 

315858 AA737345 Hs.294041 ESTs 5 

315878 AA683336 Hs.189046 ESTs 3.1 

10 315977 AW865916 Hs.151206 ESTs 47 

315978 AA830893 Hs.1 19769 ESTs 4.1 

315995 A(217477 Hs.194591 ESTs 4.1 

316012 AA764950 Hs.119898 ESTs 7 

316042 A1469960 Hs.170698 ESTs 4.9 

15 316052 AI962796 Hs. 136754 ESTs 4.1 

316072 AW517524 Hs.135201 N0D2 protein 3.2 

316074 AW975114 Hs.293273 ESTs 3.8 

316100 AW203986 Hs.213003 ESTs 3.2 

316133 AI187742 Hs,125562 ESTs 3.7 

20 316177 AI904982 Hs.293102 ESTs. Moderately similar to ALUI^HUMAN A 30.7 

316186 AI433540 gb:ti69g05.x1 NCI_CGAP_Kid11 Homosaplen 3.1 

316244 AI640761 Hs.224988 ESTs 3.5 

316303 AA740994 Hs.209609 ESTs 3,8 

316313 AA741300 Hs.202599 ESTs. Weakly slmSar to 138022 hypotheti 4.4 

25 316364 AA747807 Hs.149500 ESTs 3.2 

316560 AA93819d Hs.146123 poty(A) potymerase gamma 9.4 

316697 AW293174 Hs.252627 ESTs 4.4 

316715 A(440266 Hs.170673 ESTs. Weakly similar to T24832hypotheB 3 

316868 AI660898 Hs.195602 ESTs 3.2 

30 316869 AI954880 Hs.134604 ESTs 3.2 

316886 AA836331 Hs.134981 ESTs 4.4 

316897 AA838114 Hs.221612 ESTs 3.7 

316943 AW014875 Hs.137007 ESTs 4.6 

317069 A(732892 Hs.190489 ESTs 5.9 

35 ' 317194 AW445167 Hs.126036 ESTs 4.1 

317360 AI125252 Hs.126419 ESTs 3.5 

317404 AI806867 Hs.1 26594 ESTs 5.1 

317452 AA972965 Hs.135568 ESTs 6.9 

317501 AI822034 Hs.137097 ESTs 4.6 

40 317674 AW294909 Hs.132208 ESTs 4.3 

317803 AW664964 Hs.128899 ESTs 6.1 

317834 X56348 Hs.287270 ret proto-oncogene (multiple endocrine n 3.1 

317850 A1581545 Hs.152982 hypothetical protein aj13117 3.4 

317881 Aia27248 Hs.224398 Homo sapiens cDNAaJ1 1469 fis,ck)ne HE 9,6 

45 317902 AW102941 Hs,211265 ESTs 4.1 

317916 AI565071 Hs.159983 ESTs 10.3 

318042 AW294522 Hs.149991 ESTs 3.1 

318223 AI077540 Hs.134090 ESTs 3.9 

318327 AW294013 Hs.200942 ESTs 3 

50 318332 AI093930 Hs.163440 Homo sapiens cDNA: FU21000 fis, done C 4,4 

318418 AF107493 Hs.1 18498 Homo sapiens LUCA.15 protein mRNA, splic 5.4 

318558 AW402677 Hs.146381 RNA binding motif protein. X diromosome 4.4 

318625 AA526235 Hs.193162 Homo sapiens cDNAaJ119B3 (is, done HE 5.9 

318634 T49598 Hs.156832 ESTs 4 

55 318740 NIVL002543 Hs.77729 oxidised low density lipoprotein pect(n 7.3 

318744 A1793124 Hs.144479 ESTs 17.8 

318781 F11802 Hs.6818 ESTs 3 

319191 NML0123gi Hs.79414 prostate epitheOunvspedfic Ets transcr 3.6 

319478 AI524124 Hs.270307 ESTs 4.6 

60 319510 W88532 Hs,254562 ESTs 3.3 
319551 AA761668 gb:nz24o08.s1 NCLCGAPJsCBI Homo sapiens 3.2 

319745 T79366 Hs.108258 actin binding prolein; macrophm (mioof 3.3 

319834 AA071267 gb:zm61g01j1 Stralagene fibroblast (937 6.2 

319840 C19035 Hs.164259 ESTs 3.3 

65 319977 AA534222 gbaii21dOZs1 NCLCGAPJW1 Homo sapiens 4.3 

320074 AA321166 Hs,278233 ESTs . 3.4 

320167 AA984373 Hs.90790 Homo sapiens cDNA: FU22930 fis. done K 4.1 
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320187 T99949 Hs.303428 Homo sapiens cDNARJ14832fB. clone OV 5.3 

320211 AL039402 Hs.125783 DEME-^ protein 9.2 

320416 AIQ26984 Hs.2g3662 ESTs 3.1 

320588 U78082 Hs.167738 RNA polymerase II transcriptional reguia 3.1 

5 320635 N50617 . Hs.80506 small nudear rtbonucleoprotein potypept 6.1 

320654 AI160015 Hs.118112 ESTs 3.5 

320742 AI601188 Hs.120910 ESTs 3 

320832 AA214584 Hs.290i67 ESTs 37 

320915 AI359144 Hs.143688 Homo sapiens cDNA: FLJ23031 fis, done L 3.1 

10 321016 BE144167 Hs.49994 hypothetical protein similar to RNA-bind 3^ 

321107 AI732643 Hs.144151 ESTs 12.3 

321171 AI769410 Hs.221461 ESTs 3.3 

321253 AA610649 Hs.333239 ESTs 3 

321318 AB033041 Hs.137507 vang (van gogh, DrosophilaK'ke 2 3.9 

15 321642 AI432199 Hs^47084 ESTs 3 

321644 AW975944 Hs^37396 ESTs 11.7 

321683 A)471598 Hs.197531 ESTs 3.8 

321758 U29112 Hs.196151 ESTs 4.4 

321811 D80630 gb:HUM091D02B Human fetal brain (TFujlwa 3.2 

20 321828 R59890 Hs.83623 nuclear receptor subfamily 1, group I, m 3.1 

321910 H67055 Hs.271530 ESTs. Weakly similar to ALU7J^UMAN ALUS 4.7 

321937 AL049351 Hs.302058 Homo sapiens mRNA; cDNA DKFZp566C093 (fr 3.5 

321978 N77342 Hs.21851 Homo sapiens cDNAFLJ12900 lis, done NT 5 

322035 AL137517 Hs.334473 hypothetical proteh DKFZp56401 278 19 

25 322136 AF075083 gb:Homo sapiens fulllength insert cDNA 3.6 

322258 BE265745 Hs.194359 ESTs, Weakly similar to ALUC_HUMAN 111! 3 

322296 W76326 gb:zd50d04.r1 SoaresJetalJeartJibHHIOW 4.4 

322303 AI357412 Hs.157601 ESTs 11.5 

322476 AW963372 Hs.46677 PRO2000 protein 3 

30 322520 T55956 gb:yb35t05.r1 Stratagene fetal spleen (9 3 

322521 AF147347 gb:Homo sapiens full length Insert cDNA 4.2 

322567 AF155108 Hs.256150 Homo sapiens, Sinular to RiKENcDNA 2810 4 

322595 W92147 H&1 18394 ESTs 5.4 

322675 AA017656 gb:ze39h01 j1 Scares retina N2b4HR Homo 3.1 

35 322766 AW068805 Hs.288467 Homo sapiens cDNA FU12280 lis. done MA 52 

322818 AW043782 Hs.293616 ESTs 7.6 

322882 AW248508 Hs.279727 Homo sapiens cDNAFUl4035 fis, done HE 5.9 

322975 C16391 gb:C1 6391 Clontedi human aorta polyAmRN 16.5 

323091 AI902456 Hs.210761 ESTs, Weakly similar to l38Q22hypotheti 4 

40 323131 AK002088 Hs.270124 Homo sapiens cDNA FU1 1226 fis, done PL 3.3 

323168 AL120862 Hs.124165 programmed ceU death 9 (PDCD9) 6.3 

323244 AW675572 Hs. 193620 ESTs 4.6 

323262 AL133990 Hs.190642 ESTs 10.5 

323332 AI829520 gb:wl19c06j(1 NCLCGAP^Utl Homo sapiens 6J2 

45 323333 AV651680 HS.2085S8 ESTs 4.3 

323335 AI655499 Hs.161712 ESTs 9^ 

323645 AW445014 H5.ig7746 ESTs 3.1 

323663 BE081058 Hs.243023 ESTs 4 

323693 AA317962 Hs.249721 ESTs. I^derately similar to PC4259 ferri 3 

50 323782 AW961560 Hs.97600 ESTs 3.2 

323817 AA410943 BMP-RIB 8.4 

323930 AL043683 Hs.8173 hypothetical protein FLJ10803 3.3 

323974 AI825204 Hs.211408 ESTs 4.5 

324001 AL044949 Hs.116298 ESTs 4.5 

55 324036 AI472078 Hs.303662 ESTs 8.4 
324261 BE059341 gb:QV3-BT0381-270100-073^:08BT0381 Homo 49,4 

324285 AA431159 Hs.122954 ESTs 3 

324296 AI524039 Hs.1 92524 ESTs 3 

324305 AA642007 Hs.116369 ESTs 3.3 

60 324432 AA464510 Hs.152812 ESTs 16.5 

324585 AI823969 Hs.132678 ESTs 3.3 

324598 AW972227 Hs.163986 Homo sapiens cDNA: FU22765 fis. ctone K 5 

324603 AW993522 Hs.292934 ESTs 10.4 

324631 AA937116 Hs.293683 ESTs, Weakly similar to 154374 gene NF2 3.3 

65 324716 BE169746 Hs,12504 likely ortholog of mouse Arkadia 3.2 

324748 AW974941 Hs.292386 ESTs, Weakly similar to 178885 serine/th 3 

324771 AA631739 Hs.335440 EST 3 
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324774 AI031771 Hs.132586 ESTs 4,2 

324823 AW516704 Hs.208726 ESTs 3.4 

324824 AI826999 Hs.224624 ESTs 3.1 
324826 AA704806 Hs.143842 ESTs. Weakly similar to 2004399AchR)mos 4.4 

5 324961 AA613792 gb:no97h03.s1 Na_CGAPJ>r2 Homo sapiens 3.9 

324987 AI375572 Hs.172634 ESTs 18.8 

324994 Ai805416 Hs.213897 ESTs 3.3 

325146 AI064690 Hs.171176 ESTs 4.2 

325372 Phase2&3ExDns 4,4 

10 325544 Phase2&3Exons 5.7 

327075 Phase2&3Exons 3.8 

332798 C22000007:gi|12314195|emb|CAB99338.1| (A 4.3 

334223 NM_005080':Homo sapiens X-box binding pr 26.2 

334447 NM_012429':Homo sapiens SEC14 (S. cerevi 3.9 

1 5 335809 NI^_014509*:Homo sapiens l<ral<en-lii<e (BK1 10.1 

335824 ENSP00000249072*:DJ222E13.1 (N-TERMINAL 20 

338255 NI^_014323*:Homo sapiens zinc finger pnot 9 

409430 R21945 Hs.166975 splicing factor, aiglnine/serine-rich 5 4 

428046 AW812795 Hs.155381 ESTs, Moderately similar to 138022 hypot 4,6 

20 432558 R97268 Hs.177269 ESTs 3.2 

436808 AA731602 Hs,120266 ESTs 3.9 

448569 BE382657 Hs.21486 signal transducer and activator of trans 4.1 

453542 AW836724 Hs.339660 Homo sapiens mRNA expressed only in plac 3,7 

IW97935 AFFX control: STAT1 3.2 

25 M97935 AFFX control: STAT1 3 

M55150 fumarylacetoacetate 3 

M13755 interferon stimulated protein; 15 l<Da 4.5 

AI052047 ESTs 6,7 

AA252033 ESTs; Weakly simitar to ALU SUBFAMILY J 3.2 

30 AA401739 ESTs 3.3 

H18459 hepatocellularcardnoma associated protein; 3 

R48744 ESTs 4,2 

M31 682 inhibin; beta B (acHvin AB beta polypeptide) 3 

AA416873 ESTs 3 

35 D80240 HUM5G11A Human fetal brain (TFujiwarB) Homo 4 

R49590 ESTs 3.2 

CH22_FGENES.678_5 16.8 

CH22_FGEN£S.619_7 12.9 

CH22_FGENES.619_12 11,3 

40 CH22_EM:AC005500.GENSCAN,127 9 9.2 

CH22 EM:AC005500.6ENSCAN.3042 8,5 

CH22_FGENES.271_8 8.4 

CH22 FGENES.619_13 8 

CH22_FGENES.271_7 7.3 

45 CH22_FGENES.617_7 7.2 

CH,07_hsgi|6004473 7.1 

CH22_FGENES.264_1 6.8 
X03363 HER2 receptor tyrosine kinase (c erbB 2; ERBB2: neu) 6.6 

CH22.FGENES.617_9 6.5 

50 CH.07 hsgi|5868264 5.8 

CH.19_hsgi|5867439 5.7 

CH22_FGENES.63 " 5.3 

CH.17_hsgl|5867230 - 5.1 

CH.20 hsgll6552458 5.1 

55 CH22_EM:AC005500.GENSCAN.148 22 4,7 

CH22_FGENES.669_10 4.6 

AA034918 KIAA1028 protein 4.6 

CH22^FGENES.48J2 4.5 

CH22_FGENES.118_2 4,5 

60 AF049569 ESTs 4,4 

M13955 multiple UniGenematdies 4.3 

CH22.FGENES.619_8 4.3 

CH2^GENES.137 4.3 

HG4126HT4396 Zinc Finger Pix)tein Hzf4 4.3 

65 CH22_FGENES.360_3 4.3 

CH22^FGENES.706_9 4.3 

CH.21.hsgi|6531965 4.2 
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10 



15 



20 



25 



30 



35 



40 



45 



HG2614Hr2710 
X83535 

HG4716HT5158 
HG4677HT5102 



M714311 



HG2465HT4871 



AA707750 



AA976074 



CH.17_hsgl|5867215 
CH22_FGENES.669J 

CH22_FGENES.48_18 

matrix metalloproteinase 14 (nnembrane inserted) 
CH22^FGENES,271J 
CH22_FGENES,617_3 
CH22_FGENES.290_8 

CH22_F6ENES.135 
CH22_FGENES.13 2 
CH.14_hsgi|6682474 
CH.02_hsgl|5867750 
CH22.F6ENES,617_8 

CH22_DJ32I10.GEN8CAN.23 39 

CH22_F6ENES.543_20 

CH22_EM:AC005500.GENSCAN56 1 

CH22_FGENES.204_2 

CH22 FGENES.619_4 

CH.16_hsgi|5867087 

EST cluster (not in UniGene) 

CH22_EM:AC005500.GENSCAN.149 9 

CH22_EM:AC005500.GENSCAN.421 5 

CH22_FGENES.134 

CH.07_hsgi|6004478 

CH22_FGENES.3eO_1 

CH2a_FGENES.6j2 

CH22_C20H12.GENSCAN.16 2 

CH22_C65E1.GEN$CAN.8 1 

ESTs; Weakly similar to cIs Golgl matrix 

CH22_FGENES.307_4 

CH22_EM:AC005500.GENSCAN.248 14 

CH.06Jisgi|5902482 

CH2^FGENES.669_5 

CH22_DJ32I10.GENSCAN.19 8 

CH22_FGENES.527_6 

CH22_FGENES.330_10 

CH22_FGENES.14 2 

ESTs 

CH22^FGENES.226 7 
CH2a,FGENES.133 
CH22.EM:ACOO55O0.GENSCAN.209 12 
CH22_FGENES.271.3 



4.1 
4.1 

Collagen, Type Vlil, Alpha 1 4.1 

4.1 

4 

3.9 
3.9 
3.8 

Guanosine 5' MonophosphatB Synthase 

3.8 

3.8 

3.8 

3.8 

3.7 

Oncogene Ret/Ptc2. Fusion Activated 

3.7 

3.7 

3.7 

3.5 

3.5 

3.5 

3.4 

3.4 

3.4 

3.3 

3.3 

3.3 

Dna Binding Protein Ap 2. Alt Splice 3 

3.3 

3.2 

3.2 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3 

3 

3 

3 

3 



3.8 



37 



3.3 



275 



wo 02/059377 



PCT/liS02/02242 



TABLE 17A 

Table 17A shows the accession numbers for those pkeys lacking unigeneE)'s for Table 17. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled usmg sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession nimibers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster nu mber 
Accession: Genbank accession numbers 



Pkey CAT number Accession 



116845 
103207 
126257 
102791 



1 

30635.-4 
182217J 
37186_1 



126872 142696.1 



112631 
120742 
106864 
109700 
111532 
113938 
113947 
124357 
108733 
112303 
322136 
322296 
321811 
314648 
322520 
322521 
322675 
323332 
316186 
322976 
324261 
323817 
301976 
324961 
303642 
303797 
319551 
311935 
319834 
319977 
314138 
313591 



AA649530 AA659316 H64973 
X72790' 

N99638 AW973750 AA328271 H90994 AA558020 AA234435 N59599 R94815 

AFO80229 AF080231 AF080230 AF080232 AF080233 AF080234 BE550633 A1636743 AW614951 BE467547 AI680833 
AI533818 N29986 U87592 U87593 U87590 U87591 846404 U87587 AA463992 AW206802 AI970376 A1583718 AI672574 
N25695 AW665466 AI818326 AA126128 AI480345 AW013827 AA248638 AI214968 AA204735 AA2071 55 AA206262 AA204833 
AW003247 AW495808 AI080480 AI631703 AI651023 A1867418 AW818140 AA502500 AI206199 AI671282 AI352545 BE501030 
AI652535 BE465762 AA206331 AW451866 AA471088 AA206342 AA204834 AA206100 AW021661 AA332922 N66048 
AA703396 H92278 AW139734 H92683 U87589 U87595 H69001 U87594 BE466420 AI624d17 BE466611 AI206344 AA574397 
AA348354AI493192 

AW450979 AA136653 AA136656 AW419381 AA984358 AA492073 BE168945 AA809054 AW238038 BE011212 BE011359 
BE011367 BE011368 BE011362 BE011215 BE011365 BE011363 
1746257_1 R82040 R70934 
176835J AA225084AA302713 

324239.1 AI311928 AA936030 T51931 AA609816 AA487195 AA664207 

9enbanK_R)9609 F09609 

9enbank_R08440 R08440 

genbank_W81598 W81598 

genbank_W84768 W84768 

genbank^N22401 N22401 

504187_1 AA121022AA126422 

genbanLR54797 R54797 

46802_1 AF075083 H52291 H52528 

47334 1 W76326AF086341W72300 

1527481.1 D80630 D80896 D80895 

293660_1 AW979268 AA878419 AA431342AA431628 

38916.1 T55958 T57205 AF147346 

38917_1 AF147347 T55426 T55503 

86787.1 AA017656 AA017374 AA019761 

179142.1 AI829520 Af791832AA228414 AI791823 AA229211 AA22g31S 
425440 J AI433540AA728984AA804981 
1510563.1 C16391 C16413 

273265.1 BE069341 AW748403 AL044891 AI908240 AA393080 

233566.1 AA410943 AW948953 AA334202 AA332882 

128835.1 T97905AA101672 

376239.1 AA613792 AW182329 T05304 AW858385 

284260_1 AW299459AA417112 

386364 J AW629759 AW749955 AA633408 AI651005 

357371 J AA761668 AA573621 R92814 R09670 

174129.1 AA216387 T63548AA228676 

1 12523.1 AA071267 T65940 T64515 AA071334 

345248.1 AA534222 AA632632 T81234 

179960.1 AA740616 AA654854 AA229923 

103087.1 AA046309 AI263500 AA046397 
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308106 AI476803 

338255 CH22 6856FG__l(NK.EM:AC00 

335809 CH22 3181FG^617 6_LINK_EM 

335824 CH22_3197FG.619 11_UNK_E 

307010 AI140014 

307041 AI144243 

305913 AA876109 

305917 AA876469 

309574 AW168083 

325372 Cl2_hs 

325544 c12_hs 

332798 CH22_14FG_6^5_LINK_C4G1.G 

334223 CH22.1507FG^360J_LINK.EM 

327075 c21_hs 

334447 CH2a_1746FG_387_7_LINK.EM 

304782 AA582081 

313434 441798.1 W92070AW019952W92053 
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TABLE 17B 

Table 17B shows the genomic positioning for those pkeys lacking unigene ID's and 
5 accession numbers in Table 17. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Un ique number corresponding to an Eos probeset 

10 Ret Sequence source. The 7 digit numbere in this column are Genbank Identifier (Gl) numbers. "Dunham I. et a! refeis to the publication 

entitled "The DNA sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489*495. 
Strand: Indicates DNA strand from which exons were predicted. 

Ntjpositton: Indicates nucleotide positions of predicted exons. 

15 

Pkey Ref Strand NLpositfon 

334447 DunhainJ.eUl Plus 14308764-14308824 

335809 Dunham, I. elal. Rus 26310772-26310909 

20 335B24 Dunham, Letal. Plus 26376860-26376942 

332798 Dunham, I. elal. Minus 232147-231974 

334223 Dunham, 1. etal. Minus 12734365-12734269 

338255 Dunham, I. etal. Minus 15242294-15242231 

325372 5866920 Minus 1117061-1117304 

25 325544 6682452 Plus 171228-171286 

327075 6531955 Plus 4041318-4041431 
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TABLE 18: Table 2 from BRCA 014 P 

Table 18 shows genes with atleast five times the expression in breast tumor tissue than is 
5 expressed in normal body tissues. 



Pkey: Unique Eos probeset identifier number 

ExAocn: Exemplar Accessibn number, Genbank accession number 

10 UnigenelD: Unlgene number 

Unigene Title: Unigene gene title 

R1: Ratio of tumor to normal body tissue 



IS Pkey ExAccn UnigenelD Unigene Tide R1 

101378 BE563085 Hs.833 interferon-stimulaled protein, 15 kDa 5.3 

101530 M29874 Hs.1360 cytochrome P450, subfamily IIB (phenobar 9 

101767 M81057 Hs.180884 carboxypeptidase B1 (tissue) 12 

20 101878 M97815 Hs.183650 cellular retlnoicadd-bindlng protein 2 6.5 

103010 X52509 H&161640 tyrosine aminotransferase 12.4 

104115 AF183810 Hs.26102 opposite strand to trichortitnophalangeal 7.6 

104825 AA035613 Hs.141883 ESTs 6.9 

107105 AW963419 Hs.155223 stannlocalcin 2 5.3 

25 108819 AA011449 Hs.271627 ESTs 6.1 

112287 AB033064 Hs.334806 K1AA1 238 protein 7.3 

112561 A1791493 Hs.129873 ESTs, Weakly similar to A36036 cytochrom 8.2 

112637 R82331 Hs.164599 ESTs 5,4 

113206 BE262470 Hs.241471 RNB6 6.2 

30 113970 W27249 Hs.8109 hypotiietical protein FU21 080 6.9 

114965 A1733881 Hs,72472 BMP-RIB 10.1 

118925 N92293 Hs.206832 ESTs, Moderately similar to ALU8_HUMAN A 19.7 

1 19905 AW449064 Hs. 1 19571 collagen, type 111, alpha 1 (Ehlers-Danl 8.4 

121611 M31669 Hs.1735 inhlbin, beta B (activin AB beta polypep 5.6 

35 129301 AF182277 H8330780 cytochrome P450. subfamily IIB (phenobar 6.2 

133976 AI908165 Hs.169946 GATA-binding protein 3 (T-cell receptor 6.2 

134731 089377 Hs.89404 msh (Drosophila) homeo box homotog 2 5.8 

300254 AW183618 Hs.55610 solute canier family 30 (zinc transport 9.9 

301664 AA312082 Hs.105445 GDNF family receptor alpha 1 5.7 

40 302001 AB020711 Hs.278346 KIAA0904 protein 77 

302067 BE542706 Hs.222399 CEGPl protein 7.3 

302276 AW057736 Hs.32391 0 HER2 receptor tyrosine kinase (c-erb-b2, 5.4 

302290 AA179949 Hs.175563 Homo sapiens mRNA;cONADKF2p564N0763(f 34.1 

302372 AL1 17406 Hs.200102 ATP-blnding cassette transporter MRP8 6.7 

45 302385 AJ224172 Hs.204096 lipophilin B (uterogtobin family member) 13.8 

309177 A1951118 Hs.326736 Homo sapiens breast cancer antigen NY-BR 17.3 

309583 AW1 70035 Hs.326736 Homo sapiens breast cancer antigen NY-BR 57.6 

310781 AI380797 Hs.158992 ESTs lOi 

311166 AI821005 Hs.118599 ESTs 10.8 

50 311935 AA216387 gb:nc16b02.s1 NCLCGAP.Pr! Homo sapiens 5.2 

312153 BE261944 Hs.1 18625 hexokinasel 5.2 

313328 AW449211 Hs.105445 GDNF family receptor alpha 1 12.4 

313915 C18863 Hs.163443 Homo sapiens cDNAaJII 576 fis. done HE 26.3 

314097 AA648744 H5.269493 ESTs 6.6 

55 314138 AA740616 gb:ny97f1 U1 NCLC6AP.GCB1 Homo sapiens 5.9 

314506 AA833655 Hs.206868 Homo sapiens cDNA FU14056 fis. ctone HE 8.5 

314558 A1873274 Hs,190721 ESTs 27.4 

314691 AW207206 Hs,136319 ESTs 20.7 

315006 A1538613 Hs.298241 Transmembrane protease, serine 3 10.9 

60 315021 AA533447 Hs.312989 ESTs 5.3 

315051 AW292425 Hs. 163484 ESTs 12.9 

315060 AA551 104 Hs.189048 ESTs, Moderately similar to ALUC.HUMAN I 5.8 
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315196 AI367347 Hs.44898 

315530 AW015415 Hs.127780 

315634 AA837085 Hs,220585 

316012 AA764950 Hs.119898 

316177 AI904982 Hs.293102 

316580 AA938198 Hs.146123 

317803 AW664964 Hs.126899 

317881 AI827248 Hs.224398 

318740 NMJ02M3HS.77729 

318744 AI793124 Hs.144479 

320211 AL039402 Hs.125783 

321107 At732643 Hs.144151 

321644 AW975944 Hs.237396 

321978 M77342 Hs.21851 

322035 AL137517 Hs.334473 

322766 AW068805 Hs.288467 

322818 AW043782 Hs.293616 

322975 C16391 

323262 AL1 33990 Hs.1 90642 

323332 AI829520 

323817 AA410943 

324261 BE069341 

324432 AA464510 Hs.152812 

324598 AW972227 Hs.163986 

324603 AW993522 Hs.292934 

324987 Ae75572 Hs.172634 
325544 

330388 Hs.46 

334223 

335809 

335824 

A1052047 
R72427 



Homo sapiens clone TCCCTA00151 mRNA sequ8.2 



ESTs 8.9 

ESTs 6.3 

ESTs 7 

ESTs. Moderately similar to ALU1_HUMAN A 307 

poly(A) polymerase g amma 9.4 

ESTs 6.1 

Homo sapiens cDNA FLJ 1 1469 fis, clone HE 9.6 

oxidised low density lipoprotein (lectin 7.3 

ESTs 178 

OEME-6 protein 92 

ESTs 12.3 

ESTs 117 

Homo sapiens cONA FLJ 12900 Ts, done NT 5 

hypotiieticalprot^DKFZ|^401278 * 19 

Homo sapiens cDNA FLJ12280 fis, clone MA 5.2 

ESTs 7.6 

gb:C16391 Cbntecii human aorta polyA mRN 16.5 

ESTs 10,5 

gb:wl19c06.x1 NCI CGAP^Utl Homo sapiens 6.2 

BMP-R1B 8.4 
gb:QV3.BT0381-270100-073-c08 BTD381 Homo 49.4 

ESTs 16.5 

Homo sapiens cDNA: FU22765 fis, done K 5 

ESTs 10.4 

ESTs 18.8 

Phase2&3Exons 5.7 

HER2 receptor tyrosine kinase {oettM, 6.6 

NM 005080*:Homo sapiens X-box binding pr 2S2 

NM_014509';Homo sapiens i(ral(en-like (BK1 10.1 
ENSP00000249072*:DJ222E13.1 (N-TERMINAL 20 

ESTs; Weakly similar to CYTOCHROI\/IE P450 6.7 

CH22_EM:AC005500.GENSCAN.127 9 55 

CH22_FGENES.619_13 92 

CH22_FGENES.617_9 8 

CH22_FGENES.271_7 6.5 

CH22_FGENES.619_7 7,3 

CH22_FGENES.271_8 12.9 

CH22_FGENES.619_12 8.4 

CH22_EM:AC005500.GENSCAN,304 2 11.3 

CH.07.hsgl|6004473 8.5 

CH22„FGENES.617_7 7.1 

CH22_F6ENES.678_5 7.2 

CH22^GENES.678.5 16.8 
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10 



15 



TABLE 18A 

Table 18A shows the accession numbers for those pkeys lacking unigeneID*s for Table 18. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland CaUfomia). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT numben Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT number Accession 



323332 179142^1 AI829520 AI791832AA228414 AI791823AA229211 AA229315 

20 322975 1510563J C16391C16413 

324261 273265 1 BE069341 AW748403 AL044891 AI908240 AA393080 

323817 233566_1 AA410943 AW948953 AA334202 AA332882 

311935 174129_1 AA216387 T63548 AA228676 

314138 179960.1 AA740616 AA654854 AA229923 

25 335809 CH22_3181FGJ17_6.UNK_EM 

335824 CH2^.3197FG„619JUINK_E 

325544 cl2 hs 

334223 CH22.1507FG_360AUNK.EM 
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TABLE 18B 

Table 18B shows the genomic positioning for those pkeys lacking unigehe ID*s and 
5 accession numbers in Table 18. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

10 Pkey: Unique number conespondlr^ to an Eos probeset 

Ref: Sequence source. The 7 digit numbers tn this column are Genbank Identifier (Gl) numbers. Dunham I. et al* refers to the publication 

entitled The DNA sequence of human chromosome 22 * Dunham I. d al„ Nature (1999) 402:489-495. 
strand: Indicates DNA strand from which exons were predicted. 

NLpositbn: Indicates nucleotide positions of predicted exons. 



15 



Pkey Ref Strand NtJ)osition 



335809 Dunham, I. etal. Plus 26310772-26310909 

20 335824 Dunham, I. etal. Plus 26376860-26376942 

334223 Dunham, I. etal. Minus 12734365-12734269 

325544 6682452 Plus 171228-171286 
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TABLE 19: 1045 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT TISSUES 

5 Table 19 shows 1045 genes up-regulated in breast cancer compared to normal adult tissues. 

These were selected from 59680 probesets on the Af[ymetrix/Eos-Hu03 GeneChip array such 
that the ratio of "average" breast cancer to "average" nonnal adult tissues was greater than or 
equal to 2.5. The "average" breast cancer level was set to the 90**' percentile value. The 
"average" normal adult tissue level was set to the 90*^ percentile value amongst 144 non- 
10 malignant tissues. In order to remove gene-specific backgroimd levels of non-specific 

hybridization, the 15*^ percentile value amongst the 144 non-malignant tissues was subtracted 
from both the numerator and the denominator before the ratio was evaluated. 



1 5 Pkey: Unique Eos probeset identifier nuinber 

ExAccn: Exemplar Accession number, Genbank accession number 

UnlgenelD: Unigene number 

Un^ene Title: Unigene gene title 

R1 : Ratio of tumor to nomial body tissue 

20 



Pkey ExAccn UnlgenelD UnlgeneTltle R1 

408591 AF015224 Hs.46452 mammagtoblnl 137£ 

25 406964 M21305 gb.Humanalphasateirit6andsateBlte3 71,0 

400291 AA401369 Hs.190721 ESTs 68.4 

407277 AW170035 Hs,326736 Homo sapiens breast cancer antigen NY-BR 54,2 

449746 Ai6685g4 Hs.176588 ESTs. Weakly sImOar to CP4YJiUlSV^ CYTOC 46.4 
426878 BE069341 gbX3V3^T0381-270100^73^8 BT0381 Homo 44.8 

30 400292 AA250737 Hs.72472 BMP-RIB 37.4 

427585 D31 152 Hs.179729 collagen, type X, alpha 1 (Schniid metaph 32.9 

408045 AW138959 Hs.245123 ESTs 31.9 

407178 AA195651 Hs.104106 ESTs 30.4 

407377 C16391 gb-C 1 6391 Clontedi human aorta polyAmRN 27.7 

35 450705 U90304 Hs.25351 Iroquois homeobox protein 5 24.8 

407212 AA412108 Hs.269350 ESTs 22.0 

428848 NM_000230 Hs.194236 leptin (murine obesity homolog) 21.9 

404561 trichorhinophalangeal syndrome I (TRPSI) 21.8 

407980 AA046309 gb:zf12f01.s1 SoanBSjetal.heart^NbHH19W 19.8 

40 447350 AI375572 Hs.172634 ESTs 17.3 

450375 AA009647 Hs.8850 adisintegrin and metalbprotelnasedoma 16.6 

422109 S73265 Hs.1473 gastrin-releasing peptide 16.5 

435496 AW840171 Hs.265398 ESTs. Weakly similar to transfomiatk)n-r 16j0 

453160 AI263307 Hs.239884 H2B histone famOy, meniber L 15.8 

45 420813 X51501 Hs.99949 prolactin-induced protein 15.8 

415989 AI267700 • Hs.3175M ESTs 15.5 

422505 AL120862 Hs.124155 programmed cell death 9 (PDCD9) 14.8 

424399 AI905687 Hs.2533 a Wehyde dehydrogenase 9 family, member 14.5 

423575 C18863 Hs.163443 Homo sapiens cDNA FLJ1 1576 (is, done HE 13.7 

50 429441 AJ224172 Hs.204096 lipophilin B (uteroglobin family member) 13.6 

431474 AL133990 Hs.190642 ESTs 13.5 

448595 AB014644 Hs.21572 KIAA0644 gene product 13.0 

427217 AA399272 Hs,144341 ESTs 12.8 

402578 C1001134:gil2117372|pirlll65981 fatty ac 12.6 

55 422805 AA436989 Hs.121017 H2Ahlstone family, member A 12.2 

424634 NIVL003613 Hs.151407 cartilage intemrediate layer protein, nu 12.0 

456207 AA193450 gb2r40eO7.r1 Soares JihHMPu_S1 Homo sapi 11.9 

424086 AI351010 Hs.102267 lysyloxklase 11.9 

459587 AA03ig56 gbzk15e04.s1 Soaresj)regnanLutenis_NbH 11.5 
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416208 AW291 168 Hs.41295 ESTs, Weakly sfmHar to MUC2_HUMAN MUCIN 11.5 

429170 NWL001394 Hs.2359 dual spedficity phosphatase 4 11.5 

407276 A(951118 Hs326736 Homo sapiens breast cancer antigen NY-BR 11.4 

434377 AW137148 Hs.306593 Homo sapiens cDNA FLJ11382 fis. clone HE 1 1.3 

5 448390 AL035414 Hs.21068 hypotheticaJ protein 11.1 

452401 NM_007115 Hs.29352 tumornecrosls factor, alpha-induced pro 11.0 

421037 AI684808 Hs.197653 programmed cell death 9 (PDCD9) 10.9 

452461 N78223 Hs.108106 transcription factor 10.7 

443348 AW873596 Hs.182276 calmodulin 2 (phosphorylase kinase, dell 10.6 

10 421155 H87879 Hs.l02267 lysyl oxidase 10.5 

402606 NI\4.024626:Homo sap/ens hypothetical prot 10.4 

409269 AA576953 Hs.22972 hypothet'cal protein FUl 3352 10.4 

447268 AI370413 Hs.36563 hypothetical protein FU2241 8 10.3 

447033 AI357412 Hs.157601 ESTs 10.2 

1 S 400295 W72838 Hs.2533 aldehyde dehydrogenase 9 family, member 10.1 

424905 NM.002497 Hs.153704 NIMA (never in mitosis gene a)-related k 10.1 

432441 AW292425 Hs,163484 ESTs 9.9 

427365 AI873274 Hs.l90721 ESTs 9.9 

438950 H23789 Hs.144530 EST 9.8 

20 422835 BE218705 Hs.121378 metaltolhioneln-like 5, testis-spedfic 9.7 

425692 D90041 Hs. 155956 N-acetyltransfenase 1 (arylamine N-acety 9.7 

411869 W20027 Hs.23439 ESTs 9.6 

439820 AL360204 Hs.283853 Homo sapiens mRNA fulllength insert cDN 9.6 

445730 AI624342 Hs.170042 ESTs 9.5 

2 5 459583 AI907673 gb:IL-BT152-080399-004BT152 Homosapien 9.3 

452744 AI267652 Hs.30504 Homo sapiens mRNA; cDNA DKFZp434E082 (fr 9.2 

432596 AJ224741 Hs,278461 matriOn3 9.1 

400297 AI127076 Hs.334473 hypottietlcal protein DKFZp66401 278 9.1 

449448 D60730 Hs^7471 ESTs 9.1 

30 423945 AA410943 gbzt32h03.r1 Soaies ovary tumor NbHOTH 9^1 

406348 Target Exon 9.0 

424735 U31875 Hs.272499 short-chain alcohol dehydrogenase family 9.0 

453392 U23752 Hs.32964 SPY (sex deferminfrig region Y)-box 11 9.0 

433365 AF026944 Hs.293797 ESTs 8.8 

3 5 405654 NA C12001521:gl|7513934|pir||T3l081 cca3pr 8.8 

418601 AA279490 Hs.86368 calmegin 8.8 

451110 A1955040 Hs.265398 ESTs. Weakly similar to transformatlon-r 8.7 

408771 AW732573 Hs.47584 potassium voltage-gated channel, delayed 8.5 

409041 AB033025 Hs.50a81 KIAA1 199 protein 8.4 

40 442353 BE379594 Hs.49136 ESTs. Moderately simiIartoALU7_HUMAN A 8.3 

451561 N52812 Hs.177403 ESTs 8.2 

424001 W67883 Hs. 137476 paternally expressed 10 8.2 

429859 NM_007050 Hs.225952 protein tyrosine pho^haiase, receptor t 8.1 

423887 AL080207 Hs.134585 DKFZP434G232 protein 8.1 

45 405095 NA Target Exon 8.1 

419296 AA236115 Hs,120785 ESTs 8.0 

447164 AF026941 Hs.17518 Homo sapiens dgS mRNA. partial sequence 8.0 

413472 BE242870 Hs.75379 solute can-ier family 1 (glial high affi 8.0 

416747 AW876523 Hs.15929 hypothetical protein FLJ 12910 8.0 

50 415385 R17798 Hs.7535 COBW-Jike protein 7.9 

434424 AI811202 Hs.325335 Homo sapiens cDNA; FU23523 fis, done L 7.9 

420931 AF044197 Hs.100431 small inducible cytokine B subfamily (Cy 7.9 

406687 M31126 Hs.272620 matrix metaltoprotemase 11 (MMP11;stro 7.8 

400285 NA Eos Control 7.7 

55 437207 T27503 Hs.15929 hypothetical protein FLJ12910 7.6 

427119 AW880562 Hs.114574 ESTs 7.5 

429534 AW976987 Hs.163327 ESTs. Weakly similar to 2109260AB cell 7.5 

433426 H69125 Hs.133525 ESTs 7.5 

411078 AI222020 Hs.182364 CocoaCrisp 7.4 

60 426214 H59846 Hs.128355 ESTs, Moderately similar to ALU7_HUMAN A 7.4 

447475 AI380797 Hs.158992 ESTs 7.3 

415263 AA948033 Hs.l30853 ESTs 7.2 

439569 AW602166 Hs.222399 CEGP1 protein 7.2 

414142 AW368397 Hs.150042 Homo sapiens cDNA FLJ1 4438 fis, clone HE 7.1 

65 426261 AW242243 Hs. 168670 peroxisomal famesylated protein 7.0 

444763 AK001468 Hs.62160 aniIGn (Diosophila Scraps homotog). act 6.9 

445885 AI734009 Hs.127699 KIAA1603 protein 6.9 
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429432 At678059 Hs.202676 synaptonemal complex protein 2 6.9 

410781 AI375672 Hs.165028 ESTs 6.9 

443788 AI732643 Hs.144151 ESTs 6.9 

421373 AA808229 Hs.l67771 ESTs 6.8 

5 451398 AI793124 Hs.144479 ESTs 6.8 

404253 NMJ21058*'JHomo sapiens H2B histone fami 6.8 

441098 AI015591 Hs.131004 ESTs. Weakly s&riilar to T1 7227 hypothetl 6.7 

426215 AW963419 Hs.155223 stanniocaldn 2 6.6 

428227 AA321649 Hs.2248 smaU inducible cytokine subfamily B (CX 6.6 

10 422956 BE545072 Hs.122579 hypothetical protein FLJ10461 6.6 

411111 AW818127 gb:CM1-STa277^1299-0S9-b07ST0277 Homo 6,6 

434988 AI418055 Hs.161160 ESTs 6.6 

442580 AI7336B2 Hs.130239 ESTs 6.6 

449611 AI970394 Hs.197075 ESTs 6.6 

1 5 408000 L11690 Hs.620 buUous pemphigoid antigen 1 (230/240kD) 6.5 

420757 X78592 Hs.99915 androgen receptor (dOiydrotestosterone r 6.5 

431089 BE041395 Hs.283676 ESTs, Weakly similar to unknown protein 6.5 

400301 X03635 Hs,1657 estrogen receptor 1 6.5 

427356 AW023482 Hs.97849 ESTs 6.5 

20 425704 U79293 Hs.159264 Human done 23948 mRNA sequence 6.4 

441134 W29092 Hs.7578 cellular retinoic add-binding protein 1 6.4 

424902 NM_003866 Hs.153687 Inositol polyphosphates-phosphatase, ty 6.4 

448693 AW004854 Hs.228320 hypothetical protein FLJ23537 6.4 

431448 AL137517 Hs.334473 hypothetical protein DKFZp5640l278 6.2 

25 444342 NM_014398 Hs.10887 similar to iysosome-assodated membrane 6.1 

422168 AA586894 Hs, 11 2408 SI 00 caldum4)lndin9 protein A7(p8orias 6.1 

453331 AI240665 Hs.8895 ESTs 6.1 

418007 M13509 Hs.63169 matnxme1a»opn}teinase1 (MMP1; Inters 6.0 

441233 AA972965 Hs.135568 ESTs 6.0 

30 418092 R45154 Hs.106604 ESTs ' 6.0 

430044 AA464510 Hs.152812 ESTs 5.9 

432837 AA310693 Hs.87329 HSPC072 protein 5.9 

433285 AW975944 Hs.237396 ESTs 5.9 

450701 H39960 Hs.288467 Homo sapienscDNAFLJ12280fis. done MA 5.9 

35 425707 AF1 15402 Hs.11713 E74-like factor 5 (ets domain transcript 5.9 

410785 AW803341 gb:IL2-lM)079^90X04)50-D03 UM0079 Homo 5.9 

425398 AL049689 Hs.1563e9 hypothetical protein similar to tenasdn 5.9 

414812 X72755 Hs.77367 monokine induced by gamma interferon 5.8 

459371 R20991 gb:yg06h01.r1 Soares infant brain IN IBM 5.8 

40 411284 N28519 Hs.135191 ESTs. Weakly similar to unnamed protein 5.8 

453511 AL031224 Hs.33102 transcription factor AP-2 beta (activati 5.8 

451307 W52854 Hs.27099 hypothetfcal protein FU23293s)mjlaf to 5.7 

430510 AW162916 Hs.241576 hypothetical protein PR02577 5.7 

415539 AI733881 Hs,72472 BMP-RIB 5.6 

45 438199 AW016531 Hs.122147 ESTs 5.6 

417866 AW067903 Hs.82772 collagen, type XI. alpha 1 55 

430019 AA463893 Hs.220933 ESTs 5.5 

439809 R41396 Hs.101774 hypothetical protein FU23045 5.5 

423811 AW299598 Hs.50895 homeoboxC4 5.4 

5 0 434539 AW748078 Hs.2l4410 ESTs, Weakly simflar to MUC2_HUMAN MUCIN 5.4 

439138 AI742605 Hs.193696 ESTs 5.4 

453931 AL121278 Hs.25144 ESTs 5.4 

444078 BE246919 Hs.10290 U5 snRNP-specifc 40 kDa protein (hPip8- 5,4 

447102 BE167434 Hs.98471 ESTs. Weakly similar to T1 8712 hypotheti 5.4 

55 451821 A1879148 Hs.26770 fatty add binding protein 7. brain 5.4 

425236 AW067800 Hs.155223 stannk)catein 2 53 

421464 AA291553 Hs, 190086 ESTs 5.3 

450736 AW970060 gb;EST382140 MAGE resequences, MAGK Homo5.3 

428085 AA421081 Hs.12388 ESTs 5.3 

60 452838 U65011 Hs.30743 preferentially expressed antigen In mela 5.3 

445424 A6028945 Hs.12696 cortacGn SH3 domain4)lndlng protein 5.3 

456938 X52509 Hs.161640 tyrosine aminotransferase S3 

422867 L32137 Hs.1584 cartilage ongomeric matrix protein (COM 5.2 

438167 R28363 Hs.24286 ESTs 5.2 

65 433330 AW207084 Hs.132816 hypothetical protehMGCI 4801 5.2 

449765 N92293 Hs.206832 ESTs, Moderately similar to ALU8J^UMAN A 5.2 

416276 U4106O Hs.79136 UV-1 protein, estrogen regulated 5.2 
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400300 X03363 HER2 receptor tyrosine tdnase(oert>b2, 5.2 

418004 U37519 Hs.87539 aldehyde dehydrogenase 3 family, member 5.2 

439840 AW449211 Hs.105445 GDNFfeirrniy receptor alpha 1 5.2 

428771 AB028992 Hs.193143 KIAA1069 protein 5.2 

5 455047 AW852530 gb:PM1.CT0243-07l099^1-g06CT0243Homo5.2 

419169 AW851980 Hs.262346 ESTs. Weakly similar to S72482 hypothetl 5.2 

453197 AI916269 Hs.109057 ESTs, Weafdy similar to ALUS^HUMAN ALU S 5.1 

400298 AA032279 Hs.61635 six transmembrane epitheliaJ antigen of 5.1 

431023 AI283133 Hs.297420 ESTs 5.1 

10 427665 AI791495 Hs.180142 caimodulin-like skin protein 5.1 

427718 Af798680 Hs.25933 ESTs 5.1 

434531 AA642007 Hs.116369 ESTs 5.1 

429220 AW207206 Hs.1 36319 ESTs 5.1 

405494 NA C2001837*:gi|12697903|dbi|BAB2177ai| (A 5.1 

15 452930 AW195285 Hs.194097 ESTs. Weakly similar to 138022 hypothetl 5.1 

444910 AI201849 gb:qs76g04.x1 NCLCGAP.Pr28 Homo sapiens 5.1 

453310 X70697 Hs.553 solute canier family 6 (neurotransmltte 5.0 

444381 BE387335 Ms.283713 ESTs. Weakly similar to S64054hypotheti 5.0 

450603 R43646 Hs. 12422 ESTs 5.0 

20 416575 W02414 Hs.38383 ESTs 5.0 

438504 AW665281 Hs.224625 ESTs 5.0 

416209 AA236776 Hs.79078 MAD2 (mitotic arrest deficient, yeast h 5.0 

428804 AK000713 Hs.193736 hypothetical protein HJ20706 5.0 

420077 AW512260 Hs.87767 ESTs 4.9 

25 450480 X82125 Hs.25040 zinc finger protein 239 4.9 

437637 AJ003029 Hs.65792 syntropWn, gamma 2 4.9 

431808 M30703 Hs.270833 amphireguJin{schwannoma-<Jerived growth 4.9 

418836 AI555499 Hs.161712 ESTs 4,8 

442441 A)820662 Hs.129598 ESTs 4.8 

30 435635 AF220050 Hs.181385 uncharactetized hematopoietic stem/proge 4.8 

400286 NA C16000922;9i|7499103|plrHT20903hypothe 4.8 

407506 U71600 gb:Human zinc finger protein zfp31 (zf31 4.8 

420026 AI831190 Hs.166676 ESTs 4.8 

441377 BE218239 Hs.202656 ESTs 4.8 

35 457726 AI217477 Hs.l94591 ESTs 4.8 

412785 AW997556 Hs.78521 KIAA171 7 protein 4.8 

428368 BE440042 Hs.83326 matrix metalopnoteinase 3 (stromelysln 4.7 

436026 AI349764 Hs.217081 ESTs 4.7 

409110 AA191493 Hs.48778 niban protein 4.7 

40 400284 NA estrogen receptor 1 4.7 

410102 AW248508 Hs.279727 Homo sapiens cDNAFLJ14035fis» clone HE 4.7 

407819 R42185 Hs.274803 ESTs 4.7 

430486 BE062109 Hs.241551 chloride channel, calcium activated. £am 4.7 

422896 AW961489 Hs.154116 ESTs 4.7 

45 453616 Nf\L003462 Hs.33846 dyneln. axonemal, Hght intermediate pel 4.7 

427427 AF077345 Hs.177936 ESTs 4.6 

421751 AW813731 Hs.159153 ESTs, IVIoderalely similar to S65657 alpha 4.6 

454074 R63603 Hs.28419 ESTs 4.6 

405718 C4000799*:gll6330365|dbilBAA86508.1| (AB 4.6 

50 444649 AW207523 Hs.197628 ESTs 4.6 

429431 240313 Hs.106330 Homo sapiens clone IMAGE;23371, mRNAseq 4.6 

427811 M81057 Hs.180884 carboxypeptidase B1 (tissue) 4.6 

447342 AI199268 Hs.19322 Homo sapiens. Similar to RIKENcDNA 2010 4.6 

430345 AK000282 Hs.239681 hypothetical protein FLJ20275 4.6 

55 454307 AW855717 gb:RC1-CT0279-081299-013-b01 CT0279 Homo 4.6 

400303 AA242758 Hs.79136 LI V-1 protein, estrogen regulated 4.6 

438180 AA808189 Hs.272151 ESTs 4.6 

451340 AW936273 gb:QV0^002(W)9020a-107-g07DT0020 Homo 4.6 

458711 AL036877 Hs.282878 ESTs 4.6 

60 457430 AA514660 Hs.128443 ESTs 4.6 

416030 H16261. Hs.21948 ESTs 4,6 

447233 AW246333 Hs,17901 Homo sapiens, clone IMAGE:3937015,mRNA, 4.6 

445537 AJ245671 Hs.12844 EGF-lii<e-domaln, multiple 6 (EGFL6) 4.5 

424590 AW966399 Hs.46821 hypothetical protein FU20086 4.5 

65 432374 W68815 Hs.301 885 Homo sapiens cDNAFU 11 346 fis, done PL 4.5 

423833 AW503329 gb:Ul-HF.BNO^kx^-02-0-Ul.r1 NIH_MGCJ0 4,5 

406747 A1925153 Hs.217493 annexinA2 4.5 
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4121Q2 H56435 9b:yq98e09.r1 Soares fetal liver spleen 4.5 

431716 D89053 H5.268012 fatty-acld-Coenzyme A l^ase, long-chain 4.5 

411050 AW614902 gb:MR1-ST0206-120400-a22-f08ST0206Homo 4.5 

401418 NA C14000338*:glI7459502|pir||S74665 outer 4.5 

5 436194 AK001074 Hs.333435 Homo sapiens cDNAFU10212fis, clone HE 4.5 

436211 AK001581 Hs.33482B hypothetical protein FU1 0719; KIAA1 794 4.4 

414080 AA135257 Hs.47783 B aggressive lymphoma gene 4.4 

424115 AA335497 Hs.293965 ESTs, Weakly sfenllar to 138022 hypotheti 4.4 

415786 AW419196 Hs.257924 hypothetical protein FU13782 4.4 

10 442117 AW664964 Hs.128899 ESTs 4.4 

452784 BE463857 Hs.151258 hypothetical protein FU21062 4.4 

432731 R31 178 Hs.2B7820 fibronectin 1 4.4 
410534 AW905138 gb:QVO-NN1071-280400-207-g07NN1071 Homo4.4 

405195 NA C2000662*:gi|7512792|pir||T12482hypothe 4.4 

IS 430217 N47863 Hs.336901 ribosomal protein S24 4.4 

401793 C17001545:gi|5360127|gb|AAD4288Z1|AF155 4.4 

415747 AA381209 gb:ESTg4257 Activated T-cells I Homo sap 4.4 

423679 AB007975 Hs.131454 KiAA0506 protein 4.4 

400238 NA C19000274*:gi|12741327|rBflXP_008833.2| 4.4 

20 425627 AF019612 Hs.297007 membrane-bound transcription factor prot 4.4 

400608 C100018999i|7508633|pir|[T25392hypothe 4.4 

458534 AV657310 Hs.282898 ESTs 4.3 

407771 AL138272 Hs.62713 ESTs 4.3 

405906 NA Target Exon 4.3 

25 405925 NA Target Exon 4.3 

439382 BE2476M Hs.1 03070 ESTs 4.3 

445263 H57646 Hs.42586 KIAA1 560 protein 4.3 

407162 N63855 Hs.142634 zinc finger protein 4.3 

419536 AA603305 gb:np12d11.s1 NCLC6AP_Pr3 Homo sapiens 4.3 

30 454359 N71277 gb:za36e03.s1 Soares fetal liver spleen 4.3 

411558 AA102670 Hs.70725 gamma>amlnobutyr1cadd(GABA)Arecepto 4.2 

450715 At266484 Hs.31570 ESTs, Weakly similar to KtAA1324 protein 4.2 

421451 AA291377 Hs.50831 ESTs 4.2 

452864 AA033714 Hs.287629 hypothetical protein FU14260 4.2 

35 409757 NM001898 Hs,123114 cystatinSN 4.2 

413043 BE158766 gb:IL2-HT0397*O71299-O24-F02HT0397 Homo 4.2 
413499 BE144884 gb:CMO-HT0182-04109W)65-e11 HT0182Homo4.2 

444619 BE538082 Hs.8172 ESTs, Moderately similar to A46010X-lln 4.2 

408380 AF12305O Hs.44532 diublquitin 4.2 

40 406992 S82472 gb:beta-poI=DNA polymerase beta {exon a 4.2 

404285 NA C6001909;gip04441|dbj|BAA1B909.1|(D298 4.2 

425247 NM_005940 Hs.155324 matrix meyioproleinase 11 (MMP11;stro 4,2 

428046 AW812795 Hs.155381 ESTs. Moderately similar to 138022 hypol 4.2 

446163 AA0266B0 Hs.25252 prolactin receptor 4.2 

45 421147 AW592167 Hs.293299 ESTs 4.2 

426451 AI908165 Hs.16g946 GATA-bindlng protein 3 (T-ceil receptor 4.2 

415227 AW821113 Hs.72402 ESTs 4.2 

452176 AA024538 Hs,282990 Human DNAsequerice from clone RP1-28H20 4.2 

452862 AW378065 Hs.8687 ESTs 4.2 

50 443646 AI085198 Hs.164226 ESTs 4.2 

425523 AB007948 Hs.1 58244 KIAA0479 protein 4.1 

424687 J05070 Hs.151738 matrix metaltoproteinase 9 (gelatinase B 4.1 

430009 AA894554 Hs.22242 ESTs 4.1 

434469 AA634806 gb:ab28c02.r1 Stratagene lung (937210) H 4.1 

55 451381 BE241831 Hs.172330 hypothetical protein MGC2705 4.1 

450229 R18717 Hs.8929 hypothetical protein FU 11 362 4.1 
455700 BE068115 gb:CM1-BT0368-061299-060-g07BT0368 Homo 4.1 

431924 AK000850 Hs.272203 Homo sapiens cDNA FU20843 fis. ctone AD 4.1 

438885 AI886558 Hs.184987 ESTs 4.1 

60 401451 NM_004496*:Homo sapiens hepatocyte nude 4.1 

431676 AI885464 gb:tl88f04j(1 NCI CGAP_Pr28 Homo sapiens 4.1 

409092 AI735283 Hs.172608 ESTs 4.1 

429270 We0379 Hs.57773 ESTs 4.1 

443903 AI220547 Hs.135223 ESTs 4.1 

65 427122 AW057736 Hs.323910 HER2 receptor tyrosine kinase (o^rb-b2. 4.1 

410275 U85658 Hs.61796 transcription factor AP-2 gamma (adivat 4.1 

432912 BB)07371 Hs.200313 ESTs 4.1 
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403585 Target Exon 4.1 

438295 AI394151 Hs.37932 ESTs 4.1 

420380 AA840891 Hs.102406 ESTs 4.1 

431118 BE264901 Hs.250502 carbonic anhydrase VIII 4.1 

5 416182 NM_004354 Hs.79069 cyclln62 4.1 

418994 AA296520 Hs.89546 selectin E (endothelial adhesion molecul 4.1 

400555 Ta/getExon 4.1 

410079 U943B2 Hs.58589 glycogenin2 4.0 

427674 NM_003528 Hs.2178 H2Bhistone family, member Q 4.0 

10 427131 AA448460 Hs.112017 GE36gene 4.0 

439759 AL359055 Hs.67709 Homo sapiens mRNA full length Insert cDN 4.0 

429353 AL1 17406 Hs.200102 ATP-l)indlng cassette transporter MRP8 4.0 

421296 NM.002666 H5.103253 perinpin 4.0 

418819 AA228776 Hs.191721 ESTs 4.0 

IS 424188 AW954552 H5.142634 zinc finger protein 4.0 
455431 AW938484 gbX:M0-DT0O57.2902O0-253-d06 DT0057 Homo4.0 

404142 m Target Exon 4.0 

441143 AI027604 Hs.159650 ESTs 4.0 

444540 A1693927 Hs.265165 ESTs 4.0 

20 415579 AA165232 HS.222D69 ESTs , 4.0 

452891 N75582 Hs.21 2875 ESTs, Wealdy similar to DYH9_HUMAN CILIA 4.0 
414605 BE390440 gb:601 283601 F1 NIH.MGC 44 Homo sapiens c 4.0 

452281 T93500 Hs.28792 Homo sapiens cDNAFUl 1041 fis. done PL 4.0 

417801 AA417383 Hs.82582 integrin. beta-like 1 (with EGF-like rep 4.0 

25 446232 AI281848 Hs.194691 reUnolc add Induced 3 4.0 

447377 X77343 Hs.334334 transcription factor Ap.2 alpha (activat 4.0 

437854 AL119723 gb:DKFZp761A2124_r1 761 (synonym: hamy2) 4.0 

446140 AA356170 H5.26750 hypothetical protein FU21 908 4.0 

452240 AI591147 Hs.61232 ESTs 4.0 

30 459574 AI741122 Hs.101810 Homo sapiens cONA FU1 4232 fis, done NT 4.0 

458673 N9g626 gb:za39d11.r1 Scares fetal liver spleen 4.0 

444858 AI199738 Hs.208275 ESTs, Weakly similar to ALUA^HUMAN lill 4.0 

452166 AI948607 Hs.264680 ESTs 4.0 

452681 AF1 53330 Hs.30246 solute earner family 19 (thiamine trans 3.9 

35 450192 AA263143 Hs.24596 RAD51 -interacting protein 3.9 

406554 NA Target Exon 3.9 

416259 AA573006 Hs.19173 ESTs 3.9 

445813 Z42023 Hs.106576 alanine^lyoxylate aminotransferase 2-11 3.9 

451024 AA442176 gb:zw63b08.r1 SoarBs_totaLfetus_Nb2HF8_ 3.9 

40 413930 M86153 Hs.75618 RAB1 1 A. member RAS oncogene family 3.9 

401781 Target Exon 3.9 

415296 F05086 Hs.328142 ESTs 3.9 

452564 AA026777 gb:2e93c11.r1 SoaresL.fetaLheartNbHH19W 3.9 

442500 AI819068 Hs^09122 ESTs 3.9 

45 419759 Z21336 Hs.135411 adln relatad protein 3.9 

424638 AI472106 Hs.49303 Honm sapiens cDNAFUl 1663 fis, clone HE 3.9 

439699 AF086534 Hs.187561 ESTs, Moderately similar to ALU1J^UMAN A 3.9 

428042 AA419529 Hs.76391 myxovlrus (influenza) resistance 1, homo 3.9 

452501 AB037791 Hs.29716 hypothetical protein FL J 10980 3.9 

50 453049 BE537217 Hs.30343 ESTs 3.9 

443213 BE568414 Hs.145497 Homo sap/ens cDNA: FU22097 fis, clone H 3.9 

443489 AI073512 Hs.1339l6 ESTs 3.9 
455092 BE152428 gb:CM0-HTO323-1 51 299-1 26-b04 HT0323 Homo3.9 

401785 NtA.002275*:Homo sapiens keratin 15 (KRT1 3S 

55 426427 M86699 Hs. 1 69840 TTK protein kinase 3.9 

446009 AI989885 Hs.231926 ESTs 3.9 

436033 H75391 Hs.255748 ESTs 3.9 
451067 BE172186 gb:MR0*HT055g-1 1 0300-005^11 HT0559 Homo 3.8 

419348 AA236645 Hs.98274 ESTs 3.8 

60 444635 AI164268 Hs.339665 ESTs 3.8 

412140 AA219691 Ks.73625 RAB6 Interacting, kinesin-like (rabklnes 3.8 

403593 NA Target Exon 3.8 

442323 AW016669 Hs.29190 ESTs 3.8 

419854 AW664873 Hs.87836 Homo sapiens PAC done RP5-1087M1 9 from 3.8 

65 433871 W02410 Hs.205555 ESTs 3.8 

445253 AI217928 Hs.144762 ESTs 33 

409542 AA503020 H5.36563 hypothetical protein FLJ2241 8 3.8 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



443162 
458194 
422475 
440705 
447290 
403426 
427821 
454288 
443801 
410658 
410672 
428579 
445495 
447995 
401747 
420633 
423545 
• 433138 
434715 
428664 
450951 
402696 
446868 
458154 
422026 
419440 
421524 
417283 
401508 
410303 
420362 
433384 
434302 
443938 
448420 
458712 
433404 
405232 
430491 
455609 
450164 
453948 
436061 
401049 
418867 
420179 
458663 
437259 
428309 
450522 
451952 
412209 
425201 
443830 
439255 
414869 
409064 
407721 
445135 
404091 
409731 
405153 
423248 
403639 
404360 
422352 
423338 



T49951 

AW383618 

AL359938 

AA904244 

AI476732 

AA470158 

BE222648 

AW206942 

AW105231 

AW794600 

NM 005755 

BE622641 

AI742618 

NM^014581 

AP000692 

AB029496 

BE005346 

AK001666 

AA018534 

NA 

AV660737 

AW816379 

U80736 

AB020689 

AA312082 

N62840 

NA 

AA324597 

U79734 

AI021992 

AA629065 

R55373 

BE623004 

AI347502 

T32982 

AL109791 
BE011668 
AI239923 
AI970797 
AI248584 
NA 

D31771 

N74530 

AV658444 

AI377755 

M97815 

AI698839 

AL120173 

AW901456 

AA352111 

A1142095 

BE164500 

AA157291 

AA062954 

Y12735 

AK000054 

NA 

AA125985 

AA380177 
NA 

AA766296 
AB007961 



Hs.9029 DKFZP4346032 protein 3.8 

Hs.265459 ESTs.Moderatefysimitarto ALU2.HUMAN A 3.8 

Hs.117313 Me!s(mouse)h(>molog3 3.8 

H&153205 ESTs 3.6 

Hs^63912 ESTs 3.8 

Target Exon 3.8 

H&98202 ESTs 3.8 

Hs.279458 ESTs, Highly similar to c380A1.1bIH.sap 3.8 

Hs.253594 ESTs 3.8 

Hs.192035 ESTs 3.8 
gbJ^C6^M0014.170300-022^05 UM0014 Homo3.8 

Hs.1 84942 G protein-coupled receptor 64 3.8 

Hs.38489 ESTs. Weakly sImBar to 138022 hypotheti 3.8 

Hs.181733 ESTs, Weakly similar to nitrtlase homolo 3.7 

Homo sapfens keratin 17 (KRT17) 3.7 

Hs.274480 odorant-blnding protein 2A 3.7 

Hs.129781 chrDmosome 21 open reading frame 5 37 

Hs.59729 semaphortnsGm2 3.7 

Hs.116410 ESTs 37 

Hs.189095 similar to SALL1 (sal (Drosophlla)-iike 3.7 

Hs,103334 ESTs 3.7 
C3002523:gil6686211|sp|Q27533|YH2KLCAEEL3.7 

Hs.135100 ESTs 37 

Hs.335018 ESTs 37 

Hs.110826 trinucleotide repeat containing 9 37 

Hs.90419 KIAA0882 protein 37 

Hs.105445 6DNF family receptor alpha 1 37 

Hs.48648 ESTs 37 

NI\/L024817:Homo sapiens hypothetical prot 3.7 

Hs.21851 Homo sapiens cDNAFLJI 2900 fis» done NT 37 

Hs.97206 huntingtin interacting protein 1 37 

Hs.124244 ESTs 37 

Hs.116301 ESTs 37 

Hs.20864 ESTs 3.7 
gbS01441282F1 NIH_MGC_72 Homo sapiens c 3.7 

Hs.107872 hypothetical protein aJ20761 3.7 

Hs.102720 ESTs 37 

NWL015832:Homo sapiens methyl<;pG bindin 37 

Hs.241559 Homo sapiens mRNA full length Insert cDN 37 
gb:CM3-BN0223-100500.177-a04 BN0223 Honw37 

Hs.30098 ESTs 3.7 

Hs.64859 ESTs 37 

Hs.190745 Homo sapiens cDNA: FU21326 fis. done C 37 

Target Exon 3.6 

Hs.89404 msh(Drosophila)homeobQxhomok)g2 3.6 

Hs.21168 ESTs 3.6 

Hs^80776 tankyrase.TRF1-inleiact!ngankyrfn-fela 3.6 

Hs.1 20695 ESTs 3.6 

HS.1836S0 cellular retlnoic add-bindlng protein 2 3.6 
gb»d31f02.x1 Soares_NFL_T_6BC„S1 Homos3.6 

Hs.301663 ESTs 3.6 
gb«C04^N1012-270300-031-c07 NN1012 Homo3.6 

gb:EST60061 Activated T-cells XX Homo sa 3.6 

ESTs 3.6 
gbrf^C4-HT0469-230300-014-e10 HT0469 Honw3.6 



Hs.143273 

Hs.21479 
Hs,141883 
Hs.38018 
Hs.1 2347 

Hs.56145 

Hs.125845 



Hs.99200 
Hs.127338 



ublnudein 1 3.6 

ESTs 3.6 

dual^pedficity tyroslne-{Y)-phosphoryt 3.6 

hypothetical protein FU20047 3.6 

Taiget Exon 3.6 

thymosin, beta, Identified in neuroblast 3.6 

Taiget Exon 3.6 

ribulo6e^-phosfihate-3-ep{merase 3.6 

ENSP00000233023*:CDNAFU12662fis.clon 3.6 
C7001385.-gil12082809|gb|AAG48618.1|AF315 3.6 

ESTs 3.6 

KIAA0492 protein 3.6 
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424202 BE350295 Hs.15032 RAN binding protein 17 3.6 

431750 AA514986 Hs.283705 ESTs 3.6 

439907 AA853978 Hs.124577 ESTs 3.6 

453596 AA441838 Hs.62905 hypothetical protein FU 14834 3.6 

5 406446 NA Target Exon 3,6 

418454 AA315308 Hs.195870 hypothetical protein FU14991 3.6 

434360 AW015415 Hs.127780 ESTs 3.6 

409079 W87707 Hs.82065 Interleukin 6 signal transducer (gp130, 3.6 

440132 AI687121 H5.202466 ESTs, Weakly similar to S65624 reverse! 3.6 

10 448706 AW291095 Hs.21814 interleukin 20 receptor, alpha 3.6 

440671 AW297920 H5.130054 ESTs 35 
407647 AW860158 gb:RaWJT0379-29010aO32-b04CT0379Homo35 

459023 AW968226 Hs.60798 ESTs 3.5 

402620 NIVL017646*:Homo sapiens tRNAisopentenyt 3.5 

15 417009 AA191719 Hs.3l4714 ESTs 3.5 

447048 AW393080 Hs.228320 hypothetical protein FU23537 3.5 

449978 AI806335 Hs.200829 ESTs, Weakly similar to T30171 nlnein - 3.5 

428062 AA420683 Hs.98321 hypothetical protein FU14103 35 

452909 NIVL015368 Hs.30985 pannexln 1 ' 35 

20 400610 NA Target Exon 35 

417643 W07361 Hs.22545 Homo sapiens cDNA FU1 2935 fe. done NT 35 

419335 AW960146 Hs.284137 hypothetical protein FU12888 3.5 

451992 AI805416 Hs.2l3897 ESTs 35 

443270 NM_004272 Hs.337737 Homer, neuronal immediate early gene, 1 B 35 

25 423948 AW392342 Hs.283077 centrosomal P4. 1 -associated protein; unc 35 

449424 AW448937 Hs.197030 ESTs 35 
423841 AW753967 gb:RC2-CT0304^100^11-h12CT0304Homo35 

416806 NM_000288 Hs.79993 peroxisomal bfogenesfe factor 7 35 

422060 R20893 Hs.325823 ESTs, Moderately similar to ALU5J^UIVIAN A 35 

30 433104 AL043002 Hs.128246 ESTs, Moderately similar to unnamed prot 35 

415778 H84847 Hs.49391 hypothetical protein LOC54149 35 

413054 AW316843 Hs.66309 hypothetical protein M6C11061 35 

416636 N32536 Hs.42645 solute canier family 16 {monocarboxyllc 35 

424639 AI917494 Hs.9812 Homo sapiens cDNA FU14388 fis, clone HE 35 

35 424827 A1057094 Hs.96867 Homo sapiens cDNA: FLJ231 55 fis, done L 35 

437782 AI370876 Hs.79090 exportin 1 (CRMI. yeast, homolog) 35 

411514 AW850178 gb:lL3>CT0219-271099^22-H12CT0219Homo 35 

413783 AA314337 Hs.301547 ribosomal protein 87 35 

421106 AA877124 Hs.172844 ESTs 3.5 

40 431291 N25521 Hs.25275 Kruppel-type zinc finger protein 35 

440623 AI935016 Hs.216639 ESTs 3.5 
455838 BE145808 gb:MRO-HTQ208-101299-1 0341 1 HT0208 Homo 35 

458771 AW295151 Hs.163612 ESTs 35 

442942 AW167087 Hs.131562 ESTs 35 

45 436550 Z50158 Hs.270235 ESTs, Weakly similar to MMHUB1 lairinln b 35 

418849 AW474547 Hs.53565 Homo sapiens PIG-MmRNA for mannosyltran 35 

424420 BE614743 Hs.146688 prostaglandin E synthase 35 

430916 AW505021 Hs.88414 BTB and CNC homology l.basicleudnezl 35 

432030 A1908400 Hs.143789 ESTs 35 

50 439405 AF086224 Hs.55238 ESTs 3.5 

405917 NA C17000675:gi|7290703|gb|AAF46150.1|(AEO 3,5 

452727 AW993582 Hs.17622Q ESTs 3.5 

426320 W47595 Hs.169300 transfonntng growth factor, beta 2 3.4 

421070 AA283185 Hs.19327 ESTs 3.4 

55 424625 AW904466 Hs.321197 PDZ domain protein (DrosophOa lna[>like 3.4 

426508 BE2S2383 Hs.184668 SBBI31 protein 3.4 
455651 BE064962 gb:RC1-BT0313-1304004]16^2 BT0313 Homo 3.4 

410555 U92649 Hs.64311 a dtsintegrin and metalloproteinase doma 3.4 

447754 AW073310 Hs.163533 Homo sapiens cDNAFU14142 fis. done MA 3.4 

60 418636 AW749855 gb:QV4^T0534-2812994)53-c05BT0534 Homo 3.4 

404097 NA C5000242*:gi|9369379|gb|AAF87128.1|AO006 3.4 

434205 AF119861 Hs.283032 hypotheUcal protein PRO201 5 3.4 

421072 AI215069 Hs.89113 ESTs 3.4 

402421 NA C1001578*:gl|6759903|gb|AAF28099.1|(AF1 3.4 

65 405248 NA Target Exon 3.4 

407638 AJ404672 Hs.334483 hypothetical protein FU23571 3.4 

403000 BE247275 Hs.151787 U5 snRNP-spedfo proteh, 116 kO 3.4 
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433393 AF038564 Hs.98074 itchy (mouse homolog) E3 ubiqpitin prolB 3.4 

432239 X81334 Hs.2936 matrix metalloproteinase 13 (oollagenase 3.4 

458747 BE618395 Hs.257391 hypothetical protein DKFZp761J1523 3.4 

442082 R41823 Hs7413 ESTs; calsyntenln-2 3.4 
5 417974 AA210765 gbu:r90c06.r1 NCLC6AP_GCB1 Homo sapiens 3.4 

446002 AJ346468 Hs.145789 ESTs 3.4 

448995 AI613276 Hs.5662 guanine nudeofide binding protein (6 pr 3.4 

436007 AI247716 Hs.232168 ESTs 3.4 

424698 AA164366 Hs.151973 hypothetical protein FLJ23511 3.4 

10 435202 AI971313 Hs.170204 KIAA0551 protein 3.4 

410467 AF102546 Hs.63931 dachshund (Drosophlla) homdog 3.3 

405460 NA Target Exon 33 

441826 AW503603 Hs.12g915 phosphotriestefase related 3.3 

453472 AL037925 gb^3KFZp564M037j1 564 (synonym: hfbt2) 3.3 

IS 447078 AW885727 H5301570 ESTs 3.3 

441690 R81733 Hs.33106 ESTs 3.3 

420092 AA814043 Hs.88045 ESTs 3.3 

418478 U38945 Hs.1174 cycHn-dependent kinase Inhibitor 2A (me 3.3 

408908 BE296227 Hs.250822 serine/threonine kinase 15 3.3 

20 414737 AI160386 Hs.125087 ESTs 3.3 

449650 Af=055575 Hs.23838 calcfum channel, voltage-dependent, L ty 33 

418912 NM_000685 Hs.89472 angiotensin neoeptorl 33 

436405 AA150079 Hs.172932 Homo sapiens mRNA for paifal 3'UTR. seq 33 

453911 AW503857 Hs.4007 Sarcolemmal-assodated protein 33 

25 409361 NM_005982 Hs.54416 sine oculis homeobox (Drosophlla) homoio 33 

429548 AW138872 Hs.135288 ESTs 3.3 

420807 AA280627 Hs57846 ESTs 33 

409695 AA296961 gb:EST112514 Adrenal gland tumor Homo sa 33 

445189 AI936450 Hs.147482 ESTs 33 

30 402892 NA Target Exon 33 

426681 AA994896 H&22S14 ESTs 33 

458722 AA741545 Hs.282832 ESTs. Weakly similar to T24961 hypothetl 33 

409430 R21945 Hs.1 66975 splicing factor, arglnlne/^rine-rich 5 33 

443194 A1954968 Hs.279009 matrix Gla protein 33 

35 445432 AV653771 gbAV653771 GLC Homo Sapiens cONA ctone 33 

410908 AA121686 Hs.10592 ESTs 33 

406151 NA Target Exon 33 

436461 AW511956 Hs.29326l ESTs 33 
411171 AW820260 gb:QV2-ST0296-15020O^40-c10 ST0296Homo 33 

40 432415 T16971 Hs.289014 ESTs. Weakly similar to A43932 mucin 2 p 33 

439310 AF086120 Hs,102793 ESTs 3.3 

401575 NA Target Exon 33 

420900 ALJ045633 Hs.44269 ESTs 33 

445628 AI344166 Hs.155743 ESTs 3.3 

45 448243 AW369771 Hs.52620 integrin, beta 8 33 

445102 AW204610 Hs.22270 ESTs 33 

442118 AA976718 Hs.202242 ESTs 33 

416783 AA206186 Hs.79889 monocyte to macropliage differentiations 33 

435039 AW043921 Hs.130526 ESTs 3.3 

50 451474 T70874 Hs.207636 ESTs 3.2 

442559 T10213 Hs.159993 gycosyltiansferase 3.2 

453921 AI824009 Hs.44577 ESTs 3.2 

420036 R60336 Hs.52792 Homo sapiens mRNA; cDNA DKFZp586l1823 (f 32 

435627 W88774 Hs.118370 ESTs 3.2 

55 411598 BE336654 Hs.70937 H3histone family, member A 3.2 

445733 AA863360 Hs.26O40 ESTs. Weakly similar to fatty add omega 3.2 

410153 BE311926 Hs.15830 hypothetical protein FU1 2691 Z2 

403637 NA C3001106*-^i|10047201|dbj|QAB13394.1|(A 3.2 

405547 NM.018833*:Homo sapiens transporter 2. A 3.2 

60 427878 C05766 Hs.181022 CGI^ protein 3.2 

451871 AI821005 Hs.118599 ESTs 3.2 

410313 R10305 Hs.185683 ESTs 3.2 

416856 N27833 Hs.269028 ESTs, Weakly similar to 138022 hypottwtf 3.2 

449490 AI652777 Hs.197069 ESTs 3.2 

65 450506 NMJ04460 Hs.418 fibroblast actlvaUon protein, alpha 3.2 

440684 AI253123 Hs.127356 ESTs. Highly similar to S21 424 nestin|H 3.2 

459055 N23235 Hs30567 ESTs. Weakly similar to B34087 hypothetl 3.2 
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452190 
430965 
405394 
424693 
454265 
437687 
428372 
414083 
411670 
416283 
437488 



452042 
421477 
438078 
446816 
419519 
404580 
447046 
457473 
429838 
459702 
400195 
417860 
417995 
422589 
435870 
440801 
426274 
423728 
439677 
452834 
431349 
417576 
430264 
418827 
410835 
426269 
405336 
437783 
440931 
455945 
430437 
405848 
455685 
406970 
409602 
423518 
425653 
426326 
433805 
437152 
448602 
452844 
407366 
408254 
424065 
416790 
420020 
426119 
426968 
457421 
453403 
454141 
426650 
450866 
407993 



H26735 
AA489732 

BE169810 

H03556 

M765917 

AK000684 

AL121282 

AW856552 

NWL005429 

AA758239 

AI249368 

H38857 

Af904743 

A1015377 

AB033052 

AI198719 

AA326187 

AW974903 

AW904907 

AI204995 

NA 

AW408557 

AW974175 

AA312735 

AA701327 

AA906366 

D38122 

AW891294 

R82331 

AI638627 

AA503653 

AA339449 

AA470519 

BE327311 

AW806906 

H15302 

NA 

AI683150 
AI583052 
BE160636 
AI768801 
NA 

BE066976 

M29994 

W26713 

D45027 

AI065104 

BE165753 

AA706910 

AL050027 

AI541305 

AW407181 

AF026942 

AW807227 

NIVL002914 

R83066 

BE295866 

W94997 

U07616 

All 17431 

BE466639 

AW138413 

AA382814 

A(248013 

AW135274 



Hs.91 668 Homo sapiens done PP1498 unknown mRNA 3.2 

Hs.154918 ESTs 3.2 

Target Exon 3.2 

Hs.47557 ESTs a2 

Hs.300949 ESTs. Weakly similar to thyroid homione 3.2 

Hs.122840 ESTs 3.2 

Hs.1 83887 hypothetical protein RJ22104 3.2 

Hs.257785 ESTs 3.2 

gb:RC1-CT0294-08010D-O12-a04 CT0294 Homo 3.2 

Hs.791 41 vascular endothelial growth factor C 3.2 

Hs.180330 ESTs 12 

Hs.98556 ESTs 3.2 

Hs.243901 Homo sapiens cDNAFU20738fIs,ck)ne HE 3.2 

Hs.104650 hypothetical protein aJ10292 3.2 

Hs.131693 ESTs 3.2 

Hs.22151 KIAA1226 protein 3.2 

Hs.176376 ESTs 3.2 

NM_014112*:HorDo sapiens trichorhlnophala 3.2 

Hs.17170 G protein-coupled receptor 4 3.2 

Hs.291231 ESTs 3.1 

Hs.30732 hypothetical protein FU13409; KIAA171 1 3.1 

gb;an03G03j(1 Stratagene scbizo brain SI 3.1 

NM^007057*:Homo sapiens ZW10 interactor 3.1 

Hs.235498 hypothetical protein FLJ1 4075 3.1 
Hs.188751 ESTs. Weakly similar to MAPB^HUMAN MICRO 3.1 

Hs.30512 Homo sapiens mRNA for K1AA0556 protein. 3.1 

Hs.17949 ESTs 3.1 

Hs.190535 ESTs 3,1 

Hs.2007 tunx)r necrosis factor (ligand) superfami 3.1 

Hs.132136 solute carrier family 4. sodium blcarbon 3.1 

Hs.164599 ESTs 3.1 

Ha105685 K/AA1 688 protein 3.1 

Hs.156942 ESTs, rtoderately similar to AUJ2_HUMAN A 3.1 

Hs.82285 phosphoribosylglydnamldefonnyltransfer 3.1 

gb:nc71f10.s1 NCLCGAP^Prl Homo sapiens 3.1 

Hs.47166 HT021 3.1 

gb:QV4-ST0023-16040a.172-d12 ST0023 Homo 3.1 
Hs.168950 Homo sapiens mRNA; cDNA DKFZp566A1046 (f 3.1 

Target Exon 3.1 

Hs.201550 ESTs, Weakly similar to ALULHUMAN ALUS 3.1 

Hs.270058 ESTs 3.1 

gb:PM1-HT0422-291 299^2^08 HT0422 HomoS.I 

Hs,169943 Homo sapiens cDNA FU13569 fis, ctone PL 3.1 

Target Exon 3.1 

gb:PM0.BT034O-211299-O0i<:12 BT0340 Homo 3.1 

g b:Human alpha-) spectrin gene, exon 12. 3.1 

Hs.256972 ESTs 3,1 

Hs.129732 R3H domain (binds single-stranded nudei 3.1 

Hs.249718 ESTs, Weakly similar to A46010X-linked 3.1 

Hs.250528 Homo sapiens, done iMAGE:4098694, mRNA. 3.1 

Hs.112742 ESTs 3.1 

gb:Homo sapiens mRNA; cDNA DKFZp566C03243.1 

H&48778 niban protein 3.1 

Hs.218377 Homo sapiens cDNA FU1 1927 fis. done HE 3.1 

gb*iiomo sapiens dg33 mRNA, partial sequ 3.1 

gb:MR4-ST0062-180200^1^10 ST0062 Homo 3.1 

Hs.139226 repOcation fador C (activator 1) 2 (40 3.1 

Hs.7043 sucdnate-CoA ligase, GDP-forming, alpha 3.1 

Hs.94382 adenosine kinase 3.1 

Hs.189917 ESTs 3.1 

Hs.173034 amphiphysin (Stiff-Mann syndrome with br 3.1 

Hs.112165 Homo sapiens CDNA FLJ12198ns, clone MA 3.1 

Hs.61779 Homo sapiens cDNA FLJ13591 fis, done PL 3.1 

Hs.139336 ATP4)indlng cassette, sub-family C (CFTR 3.1 

gb:EST96097 Testis I Homo sapiens cONA 5 3.1 

Hs.106532 ESTs, Weakly similar to 138588 reverse t 3.1 

Ks.12433 ESTs 3.1 
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446466 H38026 Hs.308 arrestin 3. retina) (X^nesdn) 3.1 

457888 BE219794 Hs^93471 ESTs 3.1 

420058 AK001423 Hs.94694 Homo sapiens cDNA FLJ10561 fis, done NT 3.0 

409248 AB033035 Hs.51965 KIAA1 209 protein 3.0 

452747 BE153855 Hs.61450 Ig superfamily receptor LNIR 3.0 

418926 AA232658 Hs.105794 UDP-gluoose:glycoproteingluoosyltransfe 3.0 

419346 AI830417 Hs.44143 po/ybromol 3.0 

429826 N93266 Hs.40747 ESTs 3.0 

435147 AL133731 Hs.4774 Homo sapiens mRNA; cDNA DKFZp761C1712 (f 3.0 

420139 NM.005357 Hs.95351 Fipase. homwne-sensttlve 3.0 

405609 NA ENSP00000241065*:CONA 3.0 

404274 NNL002944*:Homo sapiens v-ros avian UR2 3.0 

449777 AI971362 Hs.231945 ESTs 3.0 

415459 H07118 Hs.6099 ESTs 3.0 

415245 N59650 Hs.27252 ESTs 3.0 

406291 NA Target Exon 3.0 

414210 BE383592 gb:601 297871F1 NIH JM6C J9 Homo sapiens c 3.0 

432055 AW972359 Hs.293334 ESTs 3.0 

442246 AI7919S8 Hs.129115 ESTs 3.0 

451353 N21043 Hs.42932 ESTs 3.0 

451177 AJ969716 Hs.13034 ESTs 3.0 

418026 BE379727 Hs.83213 fatty acid binding protein 4. adipocyte 3.0 

401326 NA C10000447'.'gl|1168375|sp|P43467|AGA1_PED 3.0 

409920 BE1 69746 Hs.12504 likely ortholog of mouse Arkadia 3.0 

432887 A1926047 Hs.162859 ESTs 3.0 

411789 AF245505 Hs.72157 DKFZP564I1 922 protein 3.0 

401045 C11001883*:gi|5753278|ref|NP.033938.1| c 3.0 

432162 AA584062 Hs.272798 hypothetical protein FLI20413 3.0 

434627 AI221894 Hs.39311 ESTs 3.0 

442611 BE077155 Hs.177537 hypothetical protein DKFZp761B1514 3.0 

425477 AW958879 Hs.270535 ESTs 3.0 

457183 H91882 Hs.118569 Dvt-blnding protein IDAX (Inhibition of 3.0 

433014 NM^014711 Hs.279912 KIAA0419 gene product 3.0 

415542 R13474 Hs.290263 ESTs. Weakly similar to 138022 hypotheti 3.0 

416173 R52782 gb:yg99d09.r1 Soares Infant brain IN IB H 3.0 

408155 AB014528 Hs.43133 KIAA0628 gene product 3.0 

453913 AW004683 Hs78934 mutS (E. coli) homolog 2 (colon cancer, 3.0 

435495 AI754212 Hs.21951 Homo sapiens Xq pseudoautosomal region; 3.0 

423629 AW021173 Hs.18612 Homo sapiens cDNA:FlJ21909rts,ck}neH 3.0 

411836 AW901879 Hs.314453 ESTs 3.0 

415030 D31118 Hs.191735 hypothetical protein MGC10520 . 3.0 

419606 AW294795 Hs.1 98529 ESTs 3.0 

440310 AA878939 Hs.125406 ESTs 3.0 

443608 AI375957 Hs.289074 F-box only protein 22 3.0 

414140 AA281279 Hs.23317 hypothetical protein FU14681 3.0 

444761 NM_014400 Hs.11950 GPI-anchored metastasls^ssociated prote 3.0 

435393 AA701259 Hs.189299 ESTs 3.0 

454071 AI041793 Hs.42502 ESTs 3.0 

446922 BE175605 gb:RC54frO58O-10050OO22-H07 HT0580 Homo3.0 

448062 AW295923 Hs.255472 KIAA1843 protein 3.0 

427711 M31659 Hs.180408 solute camerfamily 25 (mitochondrial 3.0 

450382 AA397658 Hs.60257 Homo sapiens cDNAaJ13598fis. clone PL 3.0 

424865 W01938 Hs.337243 ESTs, Weakly similar to ALU7JiUMAN ALUS 2.9 

433043 W57554 Hs.125019 tymphokJ nuclear proteb (LAF-4) mRNA 2.9 

420802 U22376 Hs.1334 v-myb avian myeloblastosis viral oncogen 2.9 

445625 BE246743 Hs.288529 hypothetical protein FU22635 2.9 

403677 NA C4001462:gi|4887715|gb|AAA79329.2| (L088 2.9 

41 1093 BE067650 gb:MR4.BT0358^90300^03-601 BT0358 Homo 2.9 

435255 W87434 Hs.1 06015 ESTs, Moderately similar to ALU1_HUMAN A 2.9 

443127 BE568102 Hs.160312 mitochondrial ribosomal protein S16 2.9 

448104 AI674818 Hs.316433 Homo sapiens cDNA FU11375 fis. done HE 2.9 

427315 AA179949 Hs.175563 Homo sapiens mRNA; cDNADKFZp564N0763(f 2.9 

430414 AW365665 Hs.120388 ESTs 2,9 

423600 AI633559 Hs.310359 ESTs 2.9 

458562 N34128 Hs.1 45268 ESTs 2.9 

402109 NA Target Exon 2.9 

429629 BE501732 Hs.30622 Homo sapiens cDNAFU13010fts.ck)ne NT 2.9 

293 



wo 02/059377 



PCT/IJS02/02242 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



442295 


AI827248 


419752 


AA249573 


404721 




445107 


AI208121 


401967 




430566 


AA481282 


444517 


AI939339 


445563 


AW873606 


427691 


AW194426 


456561 


AI868634 


401458 




421039 


NM 003478 


459504 


BE514127 


424962 


NM.012288 


409617 


BE003760 


416931 


D45371 


413221 


6E161151 


409732 


NIVL016122 


433687 


AA743991 


434340 


AI193043 


454529 


Z45439 


421379 


Y16221 


452291 


AF015592 


457402 


AW452648 


449051 


AW961400 


408761 


AA057264 


401093 




435061 


A1651474 


447985 


AI681475 


449340 


AW235786 


426384 


At472078 


411905 


BE265067 


405953 


NA 


420854 


AW296927 


434265 


AA846811 


428365 


AA295331 


442861 


AA243837 


448337 


AW206453 


452554 


AW452434 


412248 


BE176480 


449450 


AL039852 


434757 


AI038997 


409038 


T97490 


454545 


AW806899 


439842 


AI910896 


428479 


Y00272 


424800 


AL035568 


411086 


BE070800 


400250 


NA 


449168 


NM»016206 


456482 


AA485224 


426044 


AA502490 


431854 


AA383550 


405873 


NA 


440400 


AA994364 


458265 


AI075375 


413708 


BE1 58791 


423739 


AA398155 


424408 


AI754813 


453096 


AW294631 


421825 


AA298758 


417742 


R64719 


402765 




444378 


R41339 


419172 


AW338625 


401497 




402376 





Hs.224398 Homo sapiens cONA FLJ11469 fis. done HE 2^ 
Hs.152618 ESTs.ModefatBly similar to ZN91J1UMANZ 2.9 
NM.005596*:Homo sapiens nuclear facbr I 19 
Hs.147313 ESTs, Weakly siiriitar to 138022 hypothetl 2.9 
NfVL002737*:Homo sapiens protein Idnase C 2.9 
Hs.190149 ESTs 2.9 
Hs.146883 ESTs 2.9 
Hs.149006 ESTs 2.9 
Hs.20726 ESTs 2.9 
Hs.246358 ESTs, Weakly similar to T32250!iypotheli 2B 
Target Exon 2.9 
Hs.101299 cullin5 2.9 
gb:601315974F1 NIH_M6C_8 Homo sapiens cO 2.9 
Hs.153954 Tl^lVl-like protein 2.9 
Hs.55209 Homo sapiens mRI^; cDNA DKFZp434K0514 (f 2.9 
Hs.80485 adipose most abundant gene transcript 1 2.9 
gb:PI\Aa-HT0425.141299^1'F08 HT0425 HomoZ9 
Hs.56148 NY-REN-58 antigen 2.9 
gb:ny57g01,s1 NCI_CGAP_Pr18 Homo sapiens 2.9 
Hs.128685 ESTs, Weakly siimllar(oT17226 hypothetl 2.9 
Hs.270425 ESTs 2.9 
iHs.1 03982 small inducible cytokine subfamily B (Cy 2.9 
Hs.28853 CDC7 (cell division cycle 7, S, oerevisi 2.9 
Hs.149342 activation-induced cytidine dean^nase 2.9 
Hs.333526 HER2 receptor tyrosine kinase (c^rb-b2, Z9 
Hs238936 ESTs. Weakly similar to (defline not ava 2^ 
C12000586*;gi|6330167|dbi|BAA86477.1| (A 2.9 
Hs.163944 ESTs 2.9 
Hs.200949 ESTs 25 
HS.19S359 hypothetteal protein M6C10954 2.9 
Hs.303662 ESTs Z9 
gb:6G1193893F1 Nlli.M6C.7 Homo sapiens cO 2.9 
Target Exon 2.8 
gb:UI-H-BWa«jcK>07-0-Ul.s1 NCLCGAP_Su 2.8 
Hs.130554 Homo sapiens cDNA: FU23089 fis, ctone L 2.8 
Hs.183861 Homo sapiens CDNAFU20042 fis, done CO 2.8 
Hs.57787 ESTs 2.8 
Hs.3782 ESTs 2.8 
Hs.58006 ESTs, Weakly similar to ALU5_HUMAN ALUS 2.8 
gb J{C3-HT0585-160300-022-c02 HT0585 Homo 2,8 
Hs.49136 ESTs,IV{oderatelysinvlartoALU7JiUMANA 2.8 
Hs.132921 ESTs 2.8 
Hs.50002 small inducible cytokine subfamily A (Cy 2.8 
gb:QV4-ST0023.160400-172-c12 ST0023 Homo 2.8 
Hs.132413 ESTs 2.8 
Hs.184572 cell division cycle 2. 61 to S and G2 (o 28 
Hs.153203 MyoD family inhibitor 2.8 
gb-JlC3-BT0502-251199-011-ca7 0TO5O2 Homo 2.8 
Eos Control 2.8 
colon carcinoma related protein 2.8 
gb:aa41b12.s1 NCLCGAP_GCB1 Homosapiens2a 



Hs.23142 

Hs.336695 
Hs.271699 

Hs.125594 
Hs.128193 

Hs.97600 
Hs.146428 
Hs.11325 
Hs.183747 



Hs.12569 
Hs.22120 



ESTs Z8 

polymerase (DNA directed) Iota 2.8 

Target Exon 2.8 

ESTs, Weakly similar to T25472 hypothetl 2.8 

ESTs, Weakly similar to IRX2_HUMAN IROQU 2.8 

gb:IL2.HT0397-091299-025-D02 HT0397 Homo 2.8 

ESTs 28 

collagen, type V, alpha 1 2.8 

ESTs 2.8 

ESTs. Moderately similartoCAI3J{UMAN C 2.8 
gb:EST22d11 WATM1 Homo sapiens d}NA cton 2.8 

C1003621*.'gl|12407405|gb|AAG53491.1|AF22 28 

ESTs 28 

ESTs 28 

Taiyet Exon 2.8 

C19000763*:gi|1363912|plr||JC4296 ring f 28 
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405041 NA C3001706*:gi|1345652|sp|P15989|CA36.CHIC 2.8 

408758 NM.003686 Hs.47504 exonudeasel 2.8 

431917 016181 Hs.2868 peripheral myeRn protein 2 2.8 

437583 AA761190 Hs.244627 ESTs 28 

5 453737 AA744852 Hs.194293 ESTs, Weakly sImBar to 154374 gene NF2 2.8 

458094 AF086325 gb:Homo sapiens full length insert cDNA 2.8 

401283 NA Target Exon 2.8 

410784 AW803201 gb:IL2-UM0O77-O7O50aO8O-E06 UM0077 Homo 2.8 

417601 NM^014735 Hs.82292 KIAA0215 gene product 2.8 

10 418236 AW994005 Hs.337534 ESTs 2.8 

435532 AW291488 Hs.1 17305 Homo sapiens, done (MAGE:3682908. mRNA 2.8 
454714 AW815098 gb:QV4^T0212-091199-023-f10 ST0212 Homo 2.8 

418629 BE247550 Hs.86859 growth factor reoeptDr4)ound protein 7 2.8 

442101 A1651930 Hs.135684 ESTs 2.8 

15 405080 AKD00375 Hs.88820 HDCMC28P proteih 2.8 

414661 T97401 Hs.21929 ESTs 2.8 

425589 AI650633 Hs.143688 Homo sapiens cDNA: FU23031 fis. done L 2.8 

429638 AI916662 Hs.211577 tcinectln 1 (kinesin receptor) 27 

428824 W23624 Hs.173059 ESTs 2.7 

20 421566 NM 000399 Hs.1395 early growth response 2 (Krox-20(Drosop 27 
414596 BE3"86870 gb:501 275271 F1 NIH JAGCJO Homosaplensc 2.7 

440868 R79707 Hs.263339 ESTs, Moderately similar to 138022 hypol 2.7 

452943 BE247449 Hs.31082 hypothetical protein FU1 0525 2.7 

443772 AV546449 Hs.282872 ESTs 2.7 

25 432361 A1378552 Hs.159585 ESTs 2.7 

430375 AW371048 Hs.93758 H4 histone family, member H 27 

406504 NA C5000558:giK504675|ref|NP.002175.11inl 27 

423279 AW959861 Hs.290943 ESTs 27 

424871 NM 004525 Hs.1 53595 (ow density iipoprotein-related protein 2.7 

30 453619 H87648 Hs.33922 HomosapIens.ck)neMGC:9084, mRNA.comp 27 

423961 D13666 Hs. 136348 osteoblast spedficfector 2 (^sdclln 2.7 

422156 N34S24 gb7y56d10.s1 Soares.mult!ple_sclerosis_ 2.7 

415752 BE314524 Hs.78776 putative transmembrane protein 2.7 

419987 NM_005014 Hs.94070 osteomodulln 2.7 

35 406182 NA Target Exon 2.7 

416495 X59970 Hs.79350 RYK receptor-iike tyrosine kinase 27 

444701 AI916512 Hs.198394 ESTs • 2.7- 

408171 AA301228 Hs.43299 hypothetical protein FLJ1 2890 2.7 

430153 AW968128 Hs.336679 ESTs 27 

40 413383 AA128978 Hs. 154706 hypothetical protein FLJ 1491 7 2.7 

414831 M31158 Hs.77439 protein kinase, cAMP-dependent. regulato 27 

413278 BE563085 Hs.833 interferon-stimulated protein, 15 kDa 27 

433132 AB026254 Hs.284245 hypothetical protein IMPACT 27 

437030 AA742577 Hs.303781 EST Zl 

45 439031 AF075079 gbHomo sapiens full tength Insert cDNA 27 

449532 W74653 .Hs.271593 ESTs, Moderately similar to A47582 B^l 27 

406153 Target Exon 2.7 

406625 Y13647 Hs.1 19597 stearoyK:aAdesaturase(delte-9-desatur 2.7 

444698 AI188139 Hs.147050 ESTs 27 

50 432328 A1572739 Hs.195471 6-phosphofifuctD-2-kinase/fructose-2,64)t 27 

429628 H09604 Hs.13268 ESTs 27 

420149 AA255920 Hs.88095 ESTs 27 

431207 AA495925 Hs.9394 ESTs 2.7 

438394 BE379623 Hs.27693 pepUdylpralyl isomerase (cydophlllnH 27 

55 443304 AI050073 Hs.135338 ESTs 2.7 

427660 AI741320 Hs.1 14121 Homo saptens cONA: FU23228 fis, done C 27 

408460 AA054726 Hs.285574 ESTs 27 

416515 N91716 , Hs.194140 ESTs, Weakly similar to 138022 hypothetl 27 

429922 Z97630 Hs.2251 17 HI histone family, member 0 27 

60 418203 XS4942 Hs.83758 CDC28 protein kinase 2 2.7 

439509 AF086332 Hs.58314 ESTs 27 
402184 NA ENSP00000245238*:CDNAFLJ10922fis.don 27 

450496 AW449251 Hs.257131 ESTs 2.7 

451963 AI82M40 Hs.224952 ESTs 27 

65 457938 AI373638 Hs.1 33900 ESTs 27 

441541 AA938663 Hs.199828 ESTs 2.7 

441111 AI806867 Hs.126594 ESTs 27 
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423020 AA383092 Hs.1608 replication protein A3 (14kD) 2.7 

445354 AV653485 Hs.6390 Homo sapiens done FLe3344 PRO0845 mRNA, 2.7 

427961 AW293165 Hs.143134 ESTs 2.7 

410889 X91662 Hs.66744 twist (Drosophlla) homolog (acrocephalos 2.7 

5 445234 AW137636 Hs.146059 ESTs 2.7 

413903 AA496493 Hs.23136 ESTs 2.7 

406069 NA Target Exon 2.7 

447410 AI470235 Hs.172698 EST 2.7 

401256 NA NM_02408g*:Honx) sapiens hypothetical pro 2.7 

10 415139 AW975942 Hs.48524 ESTs 2.7 

420218 AW958037 Hs.286 ribosomal protesi L4 2.7 

455511 BE144762 gbrCM0+IT0160^109W65^4HT0180Homo2.7 

438825 BE327427 Hs.79953 ESTs 2,5 

409564 AA045857 Hs.54943 fracture callus 1 (rat) homolog 2.6 

15 452837 AL121053 Hs.5534 Homo sapiens cDNA FU12961 fis, clone NT 2.6 

434876 AF160477 Hs.61460 Ig superfamily receptor LNIR 2.6 

421585 AK001122 Hs.105859 hypothetical protein FU 10260 2.6 

453279 AW893940 Hs.59698 ESTs 2.6 

430785 Z30201 gb:HHEA22G Atrium cDNA library Human hea 2.6 

20 456986 D38299 Hs.1 70917 prostaglandin E receptor 3 (subtype EP3) 2.6 

433068 NM_006456 Hs.288215 sialyltransferase 2.6 

421952 AA300900 Hs.98849 ESTs, Moderately similar to AF16151 11 H 2.6 

429208 AA447990 Hs.190478 ESTs 2.6 

430733 AW975920 Hs.283361 ESTs 2,6 

25 441720 A1346487 Hs.28739 ESTs 2,6 

418986 AI123555 Hs.81796 ESTs 2.6 

432481 AW451645 Hs.151504 Homo sapiens cDNA FU1 1973 fis, done HE 2.6 

434338 AW754311 gb:CM1<rr0337.141299-068-ID7CT0337 Homo 2.6 

417061 AI675944 Hs.188691 Homo sapiens cDNA FU 12033 fis, clone HE 2.6 

30 410530 M25809 Hs.64173 ATPase. H transporting, lysosomal (vacuo 2.6 

456672 AK002016 Hs.114727 Homo sapiens, clone MGC:1 6327, mRNA. com 2.6 

425071 NIVL013989 Hs.1 54424 deiodinase, bdothyronlne, type 11 2.6 

408868 AW292286 Hs.255058 ESTs 2.6 

451531 AA018311 Hs.1 14762 ESTs 2.6 

35 405822 Target Exon 2.6 

418301 AW976201 Hs.53913 hypothetical protein FU1 0252 2.6 

417315 AI080042 Hs.336901 rtbosomal protein S24 2.6 

434699 AA643687 Hs,149425 Homo sapiens cDNAFUl 1980 fis, clone HE 2.6 

443204 AW205878 Hs.29643 HoiTX> sapiens cDNAFUISIOSfts. clone ^^' 2.6 

40 405638 Target Exon 2.6 

452542 AW812256 gb:RC0^0174.191099^1-a07 ST0174 Homo 2.6 

403943 C5000355.-gil4503225Iref|NP.000765.1|cyt 2.6 

404535 Z25884 Hs.1 21483 chlonde channel 1 . skeletal muscle (Th 2.6 

402800 NA Target Exon 2.6 

45 449144 AI989503 Hs.233405 ESTs 2.6 

454934 AW846080 Hs.314324 ESTs 2.6 

424717 H03754 Hs.152213 wingless-type MWnV integration site fami 2.6 

426303 AW974476 Hs.183601 regulalorof G-proteinsfgnall&ig 16 2.6 

427970 AA418187 Hs.330515 ESTs 2.6 

50 450638 AK001826 Hs,25245 hypotheOcal protein FU 11 269 2.6 

453034 BE246010 Hs.271468 Homo sapiens mRNA for FU00038 protein, 2.6 

455097 AW855802 gb:RC1'CT0279-170200^23Kl08 CT0279 Homo 2.6 

427317 AB028955 Hs. 1 75780 KIAA1 032 protein 2.6 

408875 NM.015434 Hs.46604 DKFZP434B1 66 protein 2.6 

55 427510 Z47542 Hs.179312 small nuclear RNA activating complex, po 2.6 

423201 NM.O00163 Hs.125180 growth hormone receptor 2.6 

406271 Target Exon 2.6 

442696 BE566962 Hs.7063 Homo sapiens cDNA: FU20913 fis, done A 2.6 

454018 AW016892 Hs.100855 ESTs 2.6 

60 435420 AI928513 Hs.59203 ESTs 2.6 

434398 AA1 21098 Hs.3838 semm-inducible Idnase 2.6 

455708 BE069326 gb:QV3^T0381-170l00^60-g03 BT0381 Homo 2,6 

439347 W24320 Hs.102941 Homo sapiens d)NA: FU21 531 fis, done C 2.6 

407523 X64984 gb:H.saplens mRNA KTPCRX10 for olfactory 2.6 

65 425101 AA830431 Hs.180811 ESTs 2.6 

435153 AA668763 Hs.291939 ESTs 2.6 

409139 AI681917 Hs.3321 ESTs, Highly sNIar to IRX1 JIUMAN IROQU 2.6 
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455100 
414612 
440283 
423025 
431473 
404440 
403388 
403775 
405037 
407447 
420952 
435447 
440202 
445854 
421247 
414870 
457411 
424676 
404443 
452268 
430832 
444779 
408633 
459089 
447197 
454111 
411165 
406922 
456045 
413111 
423123 
432201 
427032 
445417 
422225 
428330 
410011 
426310 
419589 
437770 
442818 
414251 
428301 
431956 
455732 
458624 
428257 
418054 
458652 
458012 
422996 
410804 
457211 
440029 
448141 
409163 
431385 
414568 
439963 
455935 
425025 
416589 
404826 
422938 
421991 
417404 
448516 



BE160198 
BE274552 
AI732892 
AA831267 
AA625686 

NA 
NA 
NA 

AF290544 

AA282067 

AI672932 

AW516211 

AI702d85 

BE391727 

N72264 

AW085961 

Y08565 

NM_003512 

A1073913 

A1192105 

AW963372 

F13036 

R36075 

AW081681 

NM^000169 



HS2943 

BE065837 

NM_012247 

AI538613 

AF012023 

AK001058 

BE245652 

L22524 

AB020641 

NM_000909 

AW973708 

AA767881 

AK001741 

AL042306 

AW628666 

AK002032 

BE080908 

AI362790 

BE394723 

NM.002318 

AW375610 

A1424899 

BE091089 

U64820 

AW972565 

AW089705 

AI471598 

AA065081 

BE17d536 

AF109298 

AW247529 

BE158687 

AW953168 

AA652687 

NNL001809 
N1VL014918 
NMJX)7350 
AW698595 



gb:QV1-HT0413-010200^59-h03 HT0413 Homo2j6 

Hs76578 protein Inhibitor of activated STAT3 2.6 

Hs.190489 ESTs 2.6 

Hs.12244 hypothetical protein aJ20097 2.6 

Hs.321 176 ESTs, Weak^ similar to S65824 reverse t 2.6 

NMJ}21048:Homo sapiens melanoma antigen, 2.6 

C3001398*'^i|12248917|dbj|BAB20375.1| (A 2.6 

Target Exon 2.6 

NM„021628*:Homo sapiens arachidonate lip 2.6 

gbcHomo sapiens amlnopeptidase mRNA, par 2.6 

Hs.88972 ESTs, Moderately similar to A46010X-lln 2.6 

gbwm72e03jc1 NCLCGAP_Ut2 Homo sapiens 2.6 

Hs.125300 ring finger protein 21, InterfenvHespo 2.6 

H&14556d ESTs 2.6 

Hs.102gi0 general transcription factor IH polype 2.6 

Hs.300670 KiAA1204 protein 2.6 

Hs.130093 ESTs 2.6 

Hs.151678 UDP*N-acetyl-alpha-0^alac^osamine:polyp 2.6 

C8001428*:gi|6572242|embtCAB62951 .1 1 (Z9 2.6 

Hs.28777 H2A histone family, member L 2.6 

Hs.100686 ESTs, Wealdy similar to JE0350 Anterior 2.6 

Hs.147170 ESTs 2.6 

Hs.46677 PRO2000 protein 2.6 
Hs27373 Homo sapiens mRNA; cDNA DKFZp56401763 (f 2.6 

gb:yh88b01 .si Soares placenta Nb2HP Homo 2.6 

Hs.259064 ESTs, Weaidy similar to T426e9hypothetl 2.6 

Hs.69089 galactosidase, alpha 2.6 

gb:stearoyt-CoA desaturase [human, adipo 2.6 

Hs.154188 ESTs 2.6 

gb:RC2-BT0318-110100^12-g12 BT0318 Homo 2.6 

Hs.124027 SELENOPHOSPHATE SYNTHETASE ; Human selen2.6 

Hs.298241 Transmembrane protease, serine 3 2,5 

Hs.173274 integrin cytoplasmic domaln-assodatedp 2.5 

Hs.12580 Homo sapiens CDNA FU10196 Us, done HE 2.5 

Hs.1 18281 zinc finger protein 266 2.5 

Hs.2256 matrix metalloproteinase 7 (MMP7; ulerfn 2.5 

Hs.57856 PFTAIRE protein kinase 1 2.5 

Hs.169266 neuropeptide Y receptor Y1 2.5 

Hs.201925 Homo sapiens cDNAFU13446fis, done PL 2.5 

Hs.122897 ESTs 2.5 

Hs.8739 hypothetical protein FLJ10879 Z5 

Hs.97689 VASA protein 2.5 

Hs.98440 ESTs. Weakly similar to 138022 hypcthefi 2.5 

H5.272245 HomosapienscDNAFU11170fis.clonePL 2.5 

gb:QV1-BT0631-28O200-O84.h07 BT0631 Homo 2.5 

Hs.278639 KIAA1684 protein; likely homobg of mous 2.5 

Hs.275243 S100 calcium-binding protein A6 (calcyci 2.5 

Hs.83354 lysyloxidase-lik6 2 2.5 

Hs.117102 hypothetical protein FLJ1 3046 similar to 2,5 

Hs.188211 ESTs 2.5 

gb:PM4-BT0724-13040(M)06K«7 BT0724 Homo 2.5 

Hs.66521 MachadoJoseph disease (spinocerebellar 2.5 

Hs.32399 ESTs, Weakly similar to S51797 vasodllat 2.5 

Hs.293711 ESTs. Weakly similar to S64329 probable 2.5 

Hs.197531 ESTs 2J5 

gb:zm1 3a03.s1 Stratagene pancreas (93720 2.5 

}4s.11090 membrane-spanning 4-domains, subfamily A 2.5 

Hs.1 18258 prostate cancer associated protein 1 2.5 

Hs.6793 platelekictivaOng fedor aoetylhyd^ 2.5 

gb:CMOW0395-280100-16W)09 HT0395 Homo2.5 

Hs.12407 ESTs 2.5 
Hs.96151 Human DNA sequence from done RP5-1103G7 2.5 

Tajget Exon 2.5 

Hs.1594 centromere protein A (17kD) 2.5 

Hs.110488 KIAA0990 protein 2.5 

1^.82101 pfeckstrin homotogy-like domain, family 2.5 

gb«C1-NN0073-260400-011-g09 NN0073 Homo2.5 
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403356 NA ENSP000O0251525*:Hypothetlcal protein KI 2.5 

404983 ENSP00000252242*:Keratln.typellcytosk 2.5 

418282 AA215535 Hs.98133 ESTs Z5 

427409 AW467143 Hs.135411 actio related protein 2.5 

5 431806 AF186114 Hs.270737 tumornecTOsis factor (IIgan<I)superfami 2.5 

443367 AW071349 Hs.215937 ESTs i5 

421246 AW582962 Hs.102897 CGI47pioteln 2.5 

439217 AF086041 Hs.42975 ESTs 2.5 

400925 Target Exon 2.5 

10 404552 NA ENSP00OO0220888*:ZINC FINGER TRANSCRIPT12.5 

417168 AL133117 HS.B1376 HomosaptensmRNA;cDNADKFZp586L1121 (f 2.5 

418841 Nri4.002332 Hs.89137 low density ilpoproteln-related protein 2.5 

426853 a32974 Hs.172777 baculoviral lAP repeat-containing 4 2.5 

427738 NM.000318 Hs.180612 peroxisomal menibrane protein 3 (35I<D. Ze 2.5 

15 457384 AA501760 Hs.15806 Homo sapiens mRN A; cDNADKFZp434H2019(f 2.5 

447128 AI271898 Hs.164866 cyclinK 2.5 

454693 AW813428 gb:MR3.ST0192^10200.210<05ST0192 Homo 2.5 

434657 AA641876 Hs.191840 ESTs 2.5 

402077 NA TaigetExon 23 

20 400289 X07820 Hs.2258 matrtc metalloproteinase 10 (MMPIO; str Z5 

. 409723 AW885757 Hs.257862 ESTs 2.5 

447020 T27308 Hs,16986 hypothetical protein a J11046 2.5 

455068 AI807894 Hs.47274 Homo sapiens mRN A; cDNADKF2p564B1 76 (fr 2.5 

431232 AI024353 Hs.131755 hypothetical protein FU 14298 2.5 

25 408938 AA059013 Hs.22607 ESTs 23 

411571 AA122393 Hs.70811 hypothetical protein FLJ2a51 6 2.5 

426504 AW162919 Hs.170160 RAB2. member RAS oncogene family-ae 23 

428248 AI126772 Hs.40479 ESTs 23 

408813 AI580090 Hs.48295 RNA helicase family 2.5 

30 423504 N80077 Hs.24792 chromosome 12 open reading frame 5 2.5 

425441 AA449644 Hs.193063 Homo sapiens cDNA FLJ14201 fis, done NT 2.5 

443066 AW297921 Hs.255703 ESTs 2.5 

443556 AA256769 Hs.94949 methyimalonyt-CoA eplmerase 2.5 

428943 AW086180 Hs.37636 ESTs. WealOy similar to KiAAl 392 protein 2.5 

35 425320 U29344 Hs.83190 fatty add synthase 2.5 

430388 AA356923 Hs.240770 nuclear cap binding protein subunft 2, 2 2.5 

423242 AL039402 Hs.125783 DEME-6 protein . 2.5 

416241 N52639 Hs.32683 ESTs 2.5 

440244 A1743977 Hs.205l44 ESTs 2.5 

40 409239 AA740875 Hs.44307 ESTs, Moderately similar to 138022 hypot 2.5 

452464 AW500507 Hs.1d2619 K1AA1600 protein 2.5 

410718 AI920783 Hs.191435 ESTs 2.5 

406877 AA479033 Hs.130315 ESTs, Weakly sfmilar to A47582B-cellgr 2.5 

445150 AI446747 Hs.338704 olfactory receptor, family 7, subfamOy 2.5 

45 407756 AA1 16021 Hs.38260 ubiquftin specific protease 18 25 

407633 NrwL007069 Hs.37189 similar to rat HREV1 07 2.5 

449754 H00820 Hs.30977 ESTs, Weakly similar to B34087 hypotheti 2.5 

419316 AA236255 Hs.298419 ESTs 2.5 

429118 H20669 Hs.35406 ESTs. Highly Similar to unnamed protein 2.5 

50 440331 AL046412 Hs.2Q2151 ESTs 2.5 

449344 A1640355 Hs.312691 ESTs 2.5 

459006 AW298631 Hs.27721 Wolf-Hirschhom syndrome candidate 1-lik 2.5 

423165 A1937547 Hs.124915 hypothetical protein MGC2601 2.5 

411337 AW837349 gb:QV2-LT0038-270300-108Kf12 LT0038 Homo 2.5 

55 438290 AA843719 Hs.122341 ESTs 2.5 

406414 C5000506*:gi|124941|sp|P18614|ITA1J^T I 2.5 

424498 AB033043 H5.14g377 hypothetical protein DKFZp761L0424 2.5 

443464 BE548446 Hs.5167 Homo sapiens mRNA; cDNA DKFZp434F152 (fr 23 

424856 AA347746 Hs.9521 ESTs. Weakly simHar to ZN43^HUMAN ZINC 23 

60 440304 BE159984 Hs.125395 ESTs 25 

409045 AA635062 Hs.50094 Homo sapiens mRNA; cDNA OKFZp434O0515 (f 2.5 

422548 D86983 Hs.118893 Melanoma associated gene 2.5 

428819 AL135623 Hs.193914 KIAA0575 gene product 2.5 

412520 AA442324 Hs.795 H2A histone family, member 0 2.5 

65 430602 D13752 Hs.184927 cytochrome P450, subfamily XIB (steroid 2.5 

408031 AA081395 Hs.42173 Homo sapiens cDNAFU10366fis. clone NT Z5 

403133 Target Exon 2.5 
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413189 BE070231 gb:QV4-BT0407-260100-087-f12 BT0407 Homo 2.5 

400346 AB041269 Hs.272263 Homo sapiens mRNA for keratin 19. par& 2.5 

435509 AI458679 Hs.181915 ESTs 2.5 

458145 AI239457 Hs.130794 ESTs 2.5 



299 



wo 02/059377 



PCT/US02/02242 



TABLE 19A 



Table 19A shows the accession numbers for those pkeys lacking unigenelD's for Table 19. 
For each probeset, we have listed the gene cluster nxunber from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mKNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland Califomia). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probesel identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 

407647 1007366.1 AW860158 AW862385 AW86015g AW682366 AW862341 AW62186gAW6218g3AW062660 AW062656 
407980 103087_1 AA046309 AI263500AA046397 

408254 1049346.1 AW807227 AW807576 AW807137 AW807157 AW807495 AW807494 AW807417 AW807083 AW845786 AW845801 
AW807130 AW807335 AW807081 AW807349 AWB07339 AWB07164 AW807341 AW807224 AW845903 AW177424 
AW807159 AW807123 BE141576 AW807340 AW807334 AW807520 AW807205 AW807505 BE141574 AW807390 
AW807395 AW845789 AW807101 AW807089 AWB07519 AW807239 AW807509 AW807356 AW807526 AWBOZOgS 
AW807307 AW807153 AW807295 AW80731 3 AW807322 AW807265 AW80751 3 AW807516 AW807166 AW807501 
AW807120AW807168 AW807121 AW807527 AW807151 AW807203 AW807489 AW807511 AW807158AW845800 
AW807507 AW845793 AW845796 AW807296 AW845781 AW807444 AW845871 AW807512 AW807242 AW807141 
AW807522 AW807487 AW807514 AW807142 AW807232 AW807379 AW80711 4 AW807518 AW807199 AW80721 1 
AW807498 AW807086 AW807492 AW807218 AW807082 AW807525 AW807493 AW807523 AW807087 AW845784 
AW807037 AW807128 AW807080 AW8071 1 8 AW845807 AW807524 AW845803 AW807249 AW845795 AW807160 
AW807343 AW807515 AW807233 AW807289 AW1 77102 AW807352 AW807394 AW177105 AW807176 AW177103 
AW845870AW177099 AW177101 AW807528 AW607336 AW607038 AW177100AW807411 AW807088 AW845665 
AW807226 AW807517 AW607397 AW807303 AW807177 AW807154 AW807136 AW807146 AW807085 AW807521 
AW807488 AW807385 AW807356 AW807223 AW807155 

409163 110418J AA065081 AA075017 AA084791 AA071015 AA061560AA071459AA545727 AA083100 AA085366 AA115845AA075457 
AA0647O4 AA082878 AA075742 AA069182 

409695 1 14876.1 AA296961 AA296889 AA076945 AA077528 AA077497 

410534 1207247.1 AW905138 AW753008 R13818 Z43519 

410672 1214882.1 AVy7946O0 AW794730 

410784 1221005.1 AW803201 BE079700 BE062940 

410785 1 221055.1 AW803341 AW803265 AW803403 AW803466 AW803402 AW803413 AW803268 AW803396 AW803334 AW803355 
410835 1223785.1 AW806906 AW806915 AW865460 AW866475 AW866462 AW866448 AW866372 AW866604 

41 1050 1230330.1 AW814902 BE156656 BE156667 BE156590 BE156441 BE156447 

411086 1231500.1 BE070800AW875226 BE149115 

411093 1231970.1 BE067650 AW817053 

411111 1232669.1 AW818127 AW818161 R09719 

411171 1234393J AW820260 AW820332 R94406 

411337 1239217J AW837349 AW837355 AW882717 

411514 1248638.1 AW850178 AW850233 AW850445 AW850446 ' 

411670 1253680.1 AW856552AW661101 AW856574 AW861099 AWB61100 AW856573 AW856576 AW856562 
411905 1265181 1 BE265067 BE264978 AW875420 
412102 1277395.1 H56435 H56572 AW892929 
412209 1283610.1 AW901456AW901450 AW901441 
412248 1285000.1 BE176480AW90329BAW903313 

413043 1346556.1 BE158766 BE061699 BE147360 BE147362 BE061666 BE061697 BE061647 BE061678 

4131 1 1 1349546.1 BE065837 BE065805 BE065799 BE065818 BE065839 BE065831 6E065894 BE065789 6E065792 

413189 1352723.1 BE070231 BE070229 BE070255 

413221 1353887.1 BE161151 BE162495 BE161002 BE072205 BE160989 BE162482 
413499 1373910.1 BE1 44884 H97942 

413708 1384140.1 BE158791 BE158806BE158748BE158744BE158740BE158739BE158811 BE158700BE158741BE158683BE1 58685 
414210 1426051.1 BE383592 BE261671 
414598 1465004J BE386870 Z41986 H08501 
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414605 


1465790_-1 


BE390440 


415747 


155189_1 


AA381209 AA381245 AA167683 


416173 


1574973.1 


R52782 R17313 H24192 R19876 


417742 


1696282.1 


R64719Z44680R12451 


417974 


171 237 J 


AA210765 T95700H94407 


418636 


177402J 


AW749855 AA225995 AW750208 AW750206 


419536 


185688 1 


AA603305 AA244095 AA244183 


420854 


197072^1 


AW296927 AI684514 AI263168 AA281079 


422156 


212379_1 


N34524 AA30507 1 AW954803 AA502335 AI433430 AI203597 AW026670 AW265323 AW850787 AA317554 AW993643 
AW835572 AW38551 2 AI334966 W32951 H62656 H53902 R88904 AW835732 


422996 


223666 1 


BE091089 BE091123 AA319959 


423833 


232451.1 


AW503329 N46610AA331571 


423841 


232507.1 


AW753967 AA370795 AA331630 AWg62550 


423945 


233566.1 


AA410943 AW948953 AA334202 AA332882 


425201 


247933.1 


AA352111 AW962247 AA429695 


426650 


270283.1 


AA382814 AA402411 AA412355 


426878 


273265 1 


BE069341 AW748403 AL044891 AI908240 AA393080 


430264 


315008.1 


AA470519 BE303010 BE302954 BE384120 


430785 


323486.1 


Z30201AA486132 T72025 


431676 


336411 1 


AI685464 AW971336 AA513587 AA525142 


433687 


373061 1 


AA743991 AA604852 AW272737 


434338 


383982.1 


AW754311 AA630185 AW803285 


434469 


387447 1 


AA634806 C18732 AA729161 AA729860 


435447 


406400.1 


AI872932 AA682306 BE220163 W88695 T81307 H91447 ' 


437152 


43386.1 


AU)50027BE089051 


437854 


44418 1 


AL1 19723 AL1 19874 A)909018 U50537 


439031 


46798.1 


AF075079 H48601 H48795 


439255 


470321.1 


BE164500 AA832198 BE164502 


444910 


624951.1 


AI201849 BE069007 AW946544 


445432 


63943.1 


AV653771 BE089370 


446922 


69865.1 


BE175605 Z43529 F06610 BE175602 AV661027 


447197 


711623J 


R36075AI366546 R36167 


448420 


76273 1 


BE623004 AA380669 BE263627 BE246433 


448516 


766241.1 


AW898595 AW898588 AW89859d AW898663 AW898592 AI525093 


450522 


837264.1 


AI698839AI909260AI909259 


450736 


844652^1 


AW970060 AI732366 AI792313 AW839644 


451024 


85565.1 


AA442176AA259181 


451067 


85759.1 


BE172186 AA059279 AA020815 AA013437 


451340 


86640 1 


AW936273 AW340350 AA017208 


452542 


921410.1 


AW812256 AW812257 AI906423 AI906422 


452564 


92227.1 


AA026777 N50065 R09961 N54721 


453472 


968371.1 


AL037925 AL037931 AL037957 


454307 


1106070.1 


AW855717 AW362452 AW362443 


454359 


1130674.1 


■N71277AW390764 


454545 


1223779.1 


AW806899AW866451 AW8663g3 AWB66297AW817869 


454693 


1229132.1 


AW813428 AW813444 AW813367 AW813368 AWB13429 AW813424 


454714 


1230493.1 


AW815098BE154843 BE154831 


455047 


1250536.1 


AW852530 AW852527 AW852526 


455092 


1252971.1 


BE152428 AW855572 AW855607 


455097 


1253130.1 


AW855802 AW855794 AW855797 AW855806 AW855796 AW855808 AW855793 AW855807 


455100 


1253334.1 


BE160198 AW935898 T1 1520 AW935930 AW856073 AW861034 


455431 


1289854.1 


AW938484 BE001245 BE001190 


455511 


1321229.1 


BE144762AW979091 


455609 


1337548.1 


BE011668 BE0116a9 BE011627 BE011679 BE011699 BE011578 BE011696 BE011675 BE011622 BE011635 


455651 


1348732.1 


BE0649B2 BE064979 BE064853 BE064857 BE064856 BE064977 BE064960 BE064860 BE064815 BE064957 BE064804 




BE064616 BE064850 6E064806 BE064796 BB)64818 BE064975 BE0648ig BE064810 BE064668 BE066059 


455685 


1350393.1 


BE066976 BE066928 BE086g27 


455700 


1351264J 


BE068115 6G068104 BE068102 BE068096 Be068103 BE068154 BE068198 


455708 


1352232.1 


BE069326 BE069290 BE069352 


455732 


1353874.1 


BE080908 BE072258 BE072190 BE072236 


455838 


1374605.1 


BE145808BE145807BE181883 


455935 


1384144 1 


BE158687 BE158688 


455945 


1385588.1 


BE160836 BE160606 BE160703 


456207 


165078_'1 


AA193450 


455482 


192289 1 


AA485224 AA287308 AA258121 


458094. 


47311.1 


AF(J86325 W72956 W73221 AA2191 12 


458673 


679507.1 


N99626AI302701 
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TABLE 19B 



Table 19B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 19. For each predicted exon, we have Usted the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. TTie 7 digit nunnbers In diis column are Genbank Identtfler (6!) numbers. "Dunham L et a!.' refeis to the pubRcation 

entitled The DMA sequence of human chromosome 22; Dunham I. et al., Nature (1999) 402:489-495. 
Strand; Indicates DNA strand from which exons were predicted. 

Ntj)osition; Indicates nudeotkle positions of predteted exons. 



Pkey 


Ref 


strand 


Nt_position 


400555 


9801191 


Minus 


134694-134817 


400608 


9887666 


Minus 


96756-97558 


400610 


9887671 


Minus 


117606-117928.124040.124147 


400925 


7651921 


Plus 


38183^8391,4390044066 


401045 


8117619 


Plus 


90044-90184.91111-91345 


401049 


7232177 


Plus 


149157-150692 


401093 


8516137 


Minus 


22335-23166 


401256 


9796573 


Minus 


45482-45620 


401283 


9800093 


Minus 


4725647456 


401326 


9212516 


Minus 


226246-227505 


401418 


7452889 


Minus 


124865-125075 


401451 


6634068 


Minus 


119926-121272 


401458 


9187886 


Plus 


76485-77597 


401497 


7381770 


Plus 


92607-92813 


401508 


7534110 


Minus • 


110779-110983 


401575 


7229804 


Minus 


76253-76364 


401747 


9789872 


Minus 


118596-118816.119119.119244J19609-119761.120422-120990J30161-130381.130488-130593,131M^^ 








131258,131866-131932,132451-132575.133580-134011 


401781 


7249190 


Minus 


8321M3435.83531-83656,83740-83901,84237^393.84955-85037,8629(M6814 


401785 


7249190 


Minus 


165776-165996,166189-166314.166408-166569.167112-167268.167387-167469.168634-168942 


401793 


7263868 


Minus 


102945-103083 


401987 


4406829 


Minus 


72893.73021.76938-77049 


402077 


8117414 


Plus 


65014-65195 


402109 


8131678 


Minus 


171722-171859.173197-173303 


402184 


8576001 


Minus 


112844-112986,113505-113636 


402376 


9625329 


Minus 


21753-22385 


402421 


9796341 


Minus 


4660946652,4675846811.86293^6346,89776-89829.90048-90101.102817-102924 


402578 


9884928 


Plus 


66350-66496 


402606 


9909429 


Minus 


81747-82094 


402696 


7328818 


Minus 


23600-23731 


402765 


9367757 


Plus 


109588-109726 


402800 


6010175 


Plus 


4392144049.4618146273 


402820 


6456853 


Minus 


82274-82443 


402892 


8086844 


Minus 


194384-194645 


403133 


7331427 


Plus 


38314-38634 


403356 


8569930 


Plus 


92839^3036 


403388 


9438331 


Plus 


112733-113001,114599-114735 


403426 


9719529 


Minus 


157156-158183 


403585 


8101208 


Minus 


131266-131769 


403593 


6862650 


Minus 


62554-62712.69449-69602 


403637 


8671936 


Minus 


142647-142771.145531-145762 


403639 


8671948 


Plus 


113234-113326,115186-115287,119649-119786 


403677 


7331517 


Minus 


55008-55083,62860-63051 


403775 


7770580 


Minus 


102247-102326,103095-103148 


403943 


7711864 


Plus 


100742-100904,101322-101503 
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404091 


7684554 


Minus 


82121-83229 


404097 


7770701 


Plus 


55512-55781 


404142 


9856692 


Minus 


80316^59 


404253 


9367202 


Minus 


55675-56055 


404274 


9885189 


Plus 


104127-104318 


404285 


2326514 


Plus 


32282-32416 


404360 


9858450 


Minus 


122873-122966,151324-151469.153093-153253 


404440 


7528051 


Plus 


80430-81581 


404443 


7579073 


Minus 


87198-87441 


404552 


7243881 


Plus 


19854-20010 


404561 


9795980 


Minus 


69039-70100 


404580 


6539738 


Minus 


240588-241589 


404721 


9856648 


Minus 


173763-174294 


404826 


6572184 


Plus 


47726-48046 


404983 


4432779 


Minus 


51178^1374,52000-52173 


405037 


7543748 


Minus 


127374-127578 


405041 


7547195 


Plus 


121230-121714 


405095 


8072599 


Plus 


138877-139066 


405153 


9965565 


Minus 


175317-175500 


405196 


7230083 


Minus 


135715-135851 


405232 


7249042 


Plus 


125904-126063 


405248 


7259728 


Plus 


637-777 


405336 


6094535 


Plus 


33267-33563 


405394 


6624123 


Minus 


31900-32373 


405460 


7684569 


Minus 


52223-52389 


405494 


8050952 


Minus 


70284-70518 


405547 


1054740 


Plus 


124361-124520,124914-125050 


405609 


5757553 


Minus 


4281443010,4358343783^86345033.4642946554,4781548018,49961-50153,51624-51727,51823-5195^^ 
52918,55469-55601,571 1 1-57307,58169-58296,60215-60332,61482-61727 


405638 


6289229 


Plus 


199260-199372,199826-199929 


405654 


4895155 


Minus 


53624-53759 


405718 


9795467 


Plus 


113080-113266 


405822 


6273498 


Minus 


154660*154974,155203-155379 


405848 


7651809 


Minus 


28135-28244 


405873 


6758747 


Minus 


32129-32764 


405906 


7706124 


Minus 


10835-11059 


405917 


7712162 


Minus 


106829-107213 


405925 


6758795 


Plus 


129935-130282 


405953 


7960374 


Minus 


65101-65574 


406069 


9117732 


Plus 


68880-69374 


406151 


7144806 


Minus 


94087-94285 


406153 


9929734 


Minus 


12902-13069 


406182 


5923650 


Minus 


28256-28935 


406271 


7534217 


Pius 


36179-36692 


406291 


5686274 


Plus 


9562-8867 


406348 


9255985 


Minus 


71754-71944 


406414 


9256407 


Plus 


4959349850 


406446 


9454509 


Minus 


116424-116527.118721-118859,121187-121364 


406504 


7711360 


Minus 


107068-107277 


406554 


7711566 


Pius 


106956-107121 
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TABLE 20: 544 GENES XJP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT TISSUES THAT ARE 
LIKELY TO ENCODE EXTRACELLULAR OR CELL- 
SURFACE PROTEINS 



10 



15 



20 



Table 20 shows 544 genes up-regulated in breast cancer compared to normal adult tissues 
that are likely to encode extracellular or cell-surface proteins. These were selected as for 
Table 19, except that the ratio was greater than or equal to 3.0, the "average" normal adult 
tissue level was set to the 85*^ percentile value amongst 144 non-malignant tissues, and the 
96*^ percentile value amongst the 73 breast cancer specimens was greater than or equal 100 
units, and the predicted protein contained a structural domain that is indicative of 
extracellular localization (e.g. ig, fii3, egf, 7tm domains, signal sequences, transmembrane 
domains). The predicted protein domains are noted. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Aocession number, Genbank aocession number 

UnfgenelD: Unfgene number 

PredProtDomalns: Predicted Protein Domains 

Unlgene Title: Unigene gene title 

R1 : Ratio of 93^ percentQe of tmtm to BS** percentite of normal body tissue 



25 



30 



35 



40 



45 



50 



55 



Pkey 


ExAccn 




Pred.ProtDoma{ns 


UnigeneTitie 


R1 


408591 


AF015224 


Hs.46452 


SS,Uleroglobin,SS,Uteroglobin 


mammagtobin 1 


168.6 


400291 


AA401369 


Hs.1 90721 


TM 


ESTs 


7Z2 


449746 


AI668594 


Hs.176588 


.SS.P450 


ESTs» Weakly similar to CP4Y_HUMAN CYTOC 


65.7 


407277 


AW170035 Hs.326736 


TM 


Homo sapiens breast cancer antigen NY-6R 


57.6 


400292 


AA250737 


Hs.72472 


death.ZU5,TMActlvin.recp.pkinase, 


BMP-RIB 


555 


424735 


U31875 


Hs.272499 


.SS.TM 


short-chain a\ocM dehydrogenase family 


53.8 


426878 


BE069341 




TM 


gb:QV3-BT0381-270lOO-073^BT0381 Homo 


50.3 


428848 


NM_000230Hs.1 94236 


SS,Leptln,SS,Leptln. 


leptin (murine obesity homolog) 


40.8 


407178 


AA195651 


Hs.104106 


,SS,Dihydroorotase, 


ESTs 


39.3 


408000 


LI 1690 


Hs.620 


Plectin_repeat,SH3^peclrin,SS.PIectln_r 


bullous pemphigoid antigen 1 (230/240kD) 


37.3 


427585 


D31152 


Hs,179729 


SS.C1q.Collagen.SS,C1q,Collagen, 


coliagen, type X, alpha 1 (Schmid metaph 


35.2 


429441 


AJ224172 


Hs.204096 


.SS.UteroglobIn, 


lipophilin B (uteroglobin family member) 


30.0 


450375 


AA009647 


Hs.8850 


.SS.TM,dlslntegrfn,PepJ412B_propep,Repro a disintegrin and metalloprotelnase doma 


25.7 


420931 


AF044197 


Hs.1 00431 


SSJL8,SS 


small inducible cytokine B subfamily (Cy 


25.2 


422109 


873265 


Hs.1473 


SS,Bombesin,SS 


gastrin-releasing peptide 


24.8 


445730 


AI624342 


Hs.1 70042 


,SS,TM,Catlon_ef«ux • 


ESTs 


24.1 


451110 


Ai955040 


Hs.265398 


SS 


ESTs, Weakly simj/ar to transfomiation-r 


24.0 


400297 


AI127076 


Hs.334473 


TM 


hypothetical protein DKFZp56401278 


23.8 


420813 


X51501 


Hs.99949 


ss,ss 


proiactin-induoed protein 


22.8 


452744 


AI267652 


Hs.30504 


,SS.TM.GNS1_SUR4,cNMP_blndIng.Rlla 


Homo sapiens mRNA; cDNA DKFZp434E082 (fr 


22.6 


424634 


NIVl_003613Hs.151407 


ig,tsp„1,SSMA 


cartilage intermediate layer protein, nu 


21.7 


420757 


X78592 


Hs.99915 


homione rec,AndtDgen_rBcep,rf-C4, 


androgen receptor (dihydrotestosterone r 


21.7 


424399 


AI905587 


Hs.2533 


SS 


aldehyde dehydrogenase 9 family, member 


20.3 


447350 


AI375572 


Hs.1 72634 


.pklnase, 


ESTs 


19.2 


456207 


AA193450 




.SS.TM,p450,p450 


gb:zr40e07.n Soares_NhHMPu_Sl Homosapl 


18.3 


431448 


AL1 37517 


Hs,334473 


TM 


hypothetical protein DKFZp56401278 


18.2 


427217 


AA399272 


Hs.1 44341 


SS 


ESTs 


18.2 


456938 


X52509 


Hs.161640 


,SS,TM,aminolran_1_2,Cadherin_C.tem),cad tyrosine aminotransierase 


18.1 


435496 


AW840171 Hs.265398 


ss 


ESTs, Weakly similar to transfomation-r 


17.9 


402578 






SS.p450,SS,TM.p450 


C1001134:gi|2117372|pir||l65981 fatly ac 


17.8 


453160 


AI263307 


Hs.239884 


ss 


H2B histone family, member L 


17.8 


422505 


AL120862 


Hs.124165 


ss 


programmed cell death 9 (PX09) 


17.7 


444342 


N(\/L014398Hs.1G887 


Lamp,SS,TM,Lamp, 


similar to iysosom&assodated membrane 


17.5 
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449765 
428227 
425692 
424001 
448595 
449448 
418007 
418994 
453596 
452401 
446591 
419296 
452838 
422805 
448390 
447342 
411869 
443709 
459587 
442580 
400289 
411598 
415263 
433805 
407276 
443348 
421037 
424086 
400295 
452461 
427365 
433365 
409269 
432596 
408771 
432912 
447033 
421155 
424905 
425398 
438167 
459583 
423945 
439820 
402606 
445263 
430217 
447164 
431385 
423887 
415385 
425704 
429859 
425523 
428368 
418912 
422026 
451952 
438199 
400608 
413472 
432374 
402408 
445537 
451621 
405654 
434988 



N92293 

AA321649 

D90041 

W67883 

AB014544 

D60730 

M13509 

AA296520 



Hs.206832 

Hs.2248 

Hs.155956 

Hs.137476 

Hs.21572 

Hs.57471 

Hs.83169 

Hs.89546 



AA441838 Hs.62905 
NM_007l15Hs.29352 



H44186 

AA236115 

U65011 

AA436989 

AL035414 

A1199268 

W20027 

AI082692 

AA031956 

AI733682 

X07820 

BE336654 

AA948033 

AA706910 

AI951118 



Hs.15456 

Hs.120785 

Hs.30743 

Hs.121017 

Hs.21068 

Hs.19322 

Hs.23439 

Hs.134662 

Hs.130239 

Hs.2258 

HS70937 

Hs.130853 

Hs.112742 

Hs.326736 



AW873596 Hs.182278 
AI684808 Hs.197653 



AI351010 

W72838 

N78223 

AI873274 

AF026944 

AA576953 

AJ224741 



Hs.102267 

Hs.2533 

Hs.108106 

Hs.190721 

Hs.293797 

Hs.22972 

Hs.278461 



AW732573 Hs.47584 

BE007371 Hs.200313 

A1357412 Hs.157601 

H87879 Hs.102267 
NM_O02497Hs.153704 

AL049689 Hs.156369 

R28363 Hs.24286 
A1907673 
AA410943 
AL360204 

H57646 
N47863 
AF02e941 
BE178536 
AL080207 
R17798 
U79293 
NM_007050Hs.225952 

AB007946 Hs.158244 

BE440042 Hs.83326 
NM_000685Hs.89472 

U80736 Hs.1 10826 

AL120173 Hs.301663 

AW015531 Hs.122147 

BE242870 Hs.75379 

W68815 Hs.301885 
NA 

AJ245671 Hs.12844 

AI879148 HS56770 
NA 

A(41805S Hs.161160 



Hs.283853 

Hs.42586 

Hs.336901 

Hs.17518 

Hs.11090 

Hs.134585 

HsJ535 

Hs.159264 



SS.IL8. 

,SS.Ace^transf2» 

ipkinase, 

LRRCT,LRR,SS,LRRCT,serine_cart)pept 
SS 



ESTs, Moderately simitar to ALU8J1UMAN A 
small Inducit)te cytokine subfamily B (CX 
N-acelyltransferase 1 (arylamine N-acety 
paternally expressed 10 
KIAA0644 gene product 
ESTs 



SS»hemopexin,Peptldase_M10,SS,Peptldase^matrtx metalloprotelnase 1 (MMP1; inters 
SS,!ectin_c.siJshl,EGF,SS,EGF,lectin_c,su selectin E (endotherial adhesion molecul 



SS 

.SS.CUB^dink, 

PD2,SS 

SS 

SS.SS 

histone,SS,histone,histone 
SS 

.SS.Iipocatin 
,SS.PeptidaseJ/l1, 
,SS,TM.SNF 
,SS.LIM, 
SS 



hypothetical protein FU 14834 
tumor necrosis factor, alpha-induced pro 
PDZ domain containir\g 1 
ESTs 

preferentially expressed antigen in mela 
M2A hislone family, member A 
hypothetical protein 

Homo sapiens. Similar to RIKEN cDNA 2010 

ESTs 

ESTs 

gb:zk15e04.s1 Soares_pregnanLuterus_NbH 
ESTs 



hemopexln,Peptidase_M10.SS,PeptidaseJ/110matrix metaltoproteinase 10 (MMP10; str 



histone,SS,hi5tone,histone 
,SS,hlslone,histone,iinker_histDne 
,SS,RibosomaLL7Ae, 
TM 

.SS.DENN 
SS 

.SS,Lysyl_oxldase 
SS 

.SS>G9a.PHD. 
TM 

^S,TPR 

SSJM,UPF0016.SS.TM,UPF0016 

SS,EGF,vwa,SS,TM,vwa, 

TM,K^letra,ion_trans,SS,TM.K^telra.ionJ 

.SS.TM.Folate^cairier 

SS 



H3 histone farrdy, member A 
ESTs 
ESTs 

Homo sapiens breast cancer antigen NY-BR 
calmodulin 2 (phosphorylase kinase, delt 
programmed cell death 9 (PDCD9) 
lysy! oxidase 

aldehyde dehydrogenase 9 tamily, member 

transcription factor 

ESTs 

ESTs 

hypothetical protein FLJ13352 
matrilin 3 

potassium voltage^ated channel, delayed 

ESTs 

ESTs 



SS,LysyLoxidase;\k)ose.epim,Epimerase,S lysyl oxidase 



pkjna5e,SS,TM.pldnase,poIyprenyLsynt, 

SS 

.SS.TM.71nLl.p450,nfn 

.pklnase, 

death,ZU5JM,Adivln_recp,pkInase, 
SS 
SS 

,SS,AcyltransfeFase, 



NIMA (never in mitosis gene a)-related k 
hypothetical protein similar to tenascin 
ESTs 

gb:iL-BT152-080399^04 BT152 Homo sapien 
gb:zt32h03.r1 Soares ovary tumor NbHOT H 
Homo sapiens mRNA full length insert cDN 
NM_024626:Homo sapiens hypothetical prot 
KIAA1560 protein 



3S,RNAJDoU^,R^^AJ?oLA2.RibosomaLS24e,ribosom^ protein S24 



JM.IBR 
,SSJM 

,SS,TM,BRCT,anKABCJran.ABC.tran 
.SS.Forl^Jead, 
SS 

.SS,TM,Y4)hosphatase,IVlAM.fn3. 
.SS,lam(nin_B,laminirLEGF,laminin_Ntenn 



Homo sapiens clg5 mRNA. partial sequence 
membrane-spanning 4-domains, subfamily A 
DKFZP434G232 protein 
COBW-like protein 

Human clone 23948 mRNA sequence 
protein tyrosine phosphatase, receptor t 
KIAA0479 protein 



SS,Peptidase_M10,herTX)pexin,SS,Peptidase^matrix metalloproteinase 3 (stromelysin 



SS,TM,7tnLl.SS,TM,7tm_1, 

SS 

,SS,pkinase, 
,SS,ArfGap, 
SS.TM.SS,1M 
SS 
SS 

,SS,carb_anhydrase 
,SS,TM,ras 
SSJipocalinJIpocaltn. 
BTB,SS 
SS 



angiotensin receptor 1 
trinucleotide repeat containing 9 
ESTs 
ESTs 

C10001899:gi|7508633|p{r||T25392 hypolhe 
solute cam'er family 1 (glial high affi 
Homo sapiens cDNA FLJ11346fis, clone PL 
NM-.030920*:Homo sapiens hypothetical pro 
EGF-like-domain, multiple 6 (EGFL6) 
fatty acid binding protein 7, brain 
C12001521:gi|7513934|pir||T31081 cca3 pr 
ESTs 



17.3 

17.0 

16.7 

16.5 

16.3 

16.2 

15.7 

15.5 

15.5 

15.0 

14.9 

14.8 

147 

14.3 

14.2 

145 

13.9 

137 

13.7 

13.5 

13.5 

133 

13.2 

13.1 

13.1 

13.0 

12.9 

12.8 

12.7 

12.5- 

12.4 

123 

12.0 

11.9 

11.9 

11.9 

11.8 

11.8 

11.7 

11.6 

11.5 

11.5 

114 

11.4 

m 
iu 
11.1 
11.1 
11.1 

10.9 

10.9 

10.7 

10.4 

103 

10.3 

103 

103 

103 

10.2 

10.1 

10.0 

9.9 

9.8 

9.7 

9.6 

9.6 

9.6 
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416220 
431808 
414142 
400298 
418601 
415539 
421451 
429432 
442441 
426429 
406687 



N49776 Hs.170994 
M30703 Hs.270833 
AW368397 Hs.1 50042 
AA032279 Hs.61535 
AA279490 Hs.86368 



AI733881 

AA291377 

AI678059 

At820662 

X73114 

1^1126 



HS72472 

Hs.50831 

Hs.202676 

Hs.129598 

Hs.169849 

Hs.272620 



419948 
426214 
427718 
414812 
400285 
422330 
416182 
420077 
452281 
434531 
408380 
443788 
411078 
445495 
433426 
424871 
426215 
409045 
435525 
409203 
424902 
431725 
418092 
439840 
427811 
420807 
426320 
447078 
415786 
410102 
404347 
433687 
421373 
422634 
453310 
'435957 
407771 
443646 
446142 
444649 
435147 
439138 
429220 
428804 
453511 
439809 
414869 
416276 
452862 
452926 
453331 
420802 
450603 
422857 
418004 



AW004d54 Hs.228320 
AB041035 Hs.93847 



H59846 
Ai798660 
X72755 
NA 

D30783 



Hs.128355 

H5.25933 

Hs.77367 

Hs.1 15263 



NM_004354Hs.79069 
AW512260 Hs.87767 



T93500 

AA6420O7 

AF123050 

AI732643 

AI222020 

BE622641 

H6gi25 



Hs,28792 

Hs.1 16369 

Hs.44532 

Hs.144151 

H5.182364 

Hs.38489 

Hs.133525 



NM_004525Hs.153595 
AW963419 Hs.155223 
AA635062 Hs.50O94 
A1831297 Hs.123310 
AA780473 Hs.687 
NM_003866Hs. 153687 
X65724 Hs.2839 
R45154 Hs.106604 
AW449211 Hs.105445 
M81057 Hs.180884 
AA280627 Hs.57846 
W47595 Hs.169300 
AW885727 Hs.301570 
AW419196 Hs.257924 
AW248508 Hs.279727 

AA743991 

AA808229 Hs.1 67771 
NM_016010Hs.1 18821 
X70697 Hs.553 
N39015 Hs.190368 
AL138272 Hs.62713 
AI085198 Hs.164226 
AI754693 Hs.145968 
AW207523 Hs.197628 
AL133731 Hs.4774 
AI742605 Hs.193696 
AW207206 Hs.136319 
AK000713 Hs.193736 
AL031224 Hs.33102 
R41396 H8.101774 
AA157291 Hs.21479 
U41060 Hs.79136 
AW378065 Hs.8687 
AI742170 Hs.31297 
A1240665 Hs.8895 
U22376 Hs.1334 
R43646 Hs.12422 
L32137 Hs.1584 
U37519 Hs.87539 



,SS,TM 

SS,TM,EGF,SS 

.SS.UDPGT 

TM 

SS,TM,ca!retIculin,SSJM.calreticulin, 

death^U5JM,Activin_recp,pklnase» 

TM 

SS 

ss 

,SS,TM,lh3jg, 



hypothetical protein M6C10946 

amphiregulin (schwannoma-derived growth 

Homo sapiens cDNA FIJ14438 fts, done HE 

six transmembrane epithelial antigen of 

calmegin 

BMP-R1B 

ESTs 

synaptonemal complex protein 2 
ESTs 

myosfn-blnding protein C, slow-type 



SS,PeptldaseJA10,hemopexin,SS,Peptfdase_matrix metailoproteinase 11 (MMP11; stro 



SS 

Fem(Lreduct,TM.FerTk:_reduct 
SS 

.SS,TM,hIstDne,Sec1,hislone,sugarJr 
SS.IL8.SSjL8 

.TM^BC tranABC^mennbrane, 

SS,TM,EGF,SSJM 

cyclin,SS 

SS 

,SS.TGF-beta.TGFb_propeptide, 
SS 

,SS,TM,ubiquitIn,7tm_3ANF_receplor,sush 

m 
ss.ss 



hypothetical protein FLJ23537 
NM_016931'>tomo sapiens NADPH oxidase 4 ( 
ESTs, Moderately stmilar to ALU7.HUMAN A 
ESTs 

monokine induced by gamma interferon 

Eos Control 

epiregulin 

cyclin G2 

ESTs 

Homo sapiens cDNA FIJI 1041 fis, clone PL 

ESTs 

diubiquilin 

ESTs 

CoooaCrisp 



SS.SS,ENTH,LLWEQ.EMrHLLWEQ,DNAjTiisjeESTs. Weakly similar to 138022 hypothell 



,SS,TM 

SS,EGF,WLrecepLa.»di_receptb,SS.TM,E 
SS 

TM 

SS,p450,SS,p450 
SS.SS 

SS,CysJaK)t,SS 

,death,ZU5,pklnase^Act{vin_recp, 

SS 



ESTs 

low density lipoprotein-related protein 
stannk)calcin 2 

Homo sapiens mRNA; cDNA DKFZp434O0515 (f 
ESTs 

cytochrome P450, subfamily IVB, polypept 
inositol polyphosphate-4<pho6phatase, ty 
Norrie disease (pseudoglioma) 
ESTs 

GDNF family receptor alpha 1 



SS^n_carbOpept.PrDpepJ/i14,SS,Propep.M14carboxypeptidase B1 (tissue) 



SS,cpn10 
SS,TGF-belaJGFtLpropeptide,SS 
,SS,kazal, 
SS 
SS 
SS 
TM 

,SS,1MPDH^C.IMPDH_N.CBS 
SS 

TM,SNF.SS,TM.SNF. 
.SS,TM 

,TM,cpn60_TCP1,Sema. 
,TSPN»vwctsp J ,EGF,thlored, 
,TM,cadherin,Cadherin^C_temfi, 
,SS,rrm, 

JM,SDF,UPAR_LY6, 

TM 

SS 

»SS.UDPGT 

SS,SS 

SS 

SS 

SS.TMJM 

,SS.Pep_M12B,propep,Reprolyslh,tsp.1 , 
,SS,TM 



ESTs 

transforming growth factor, beta 2 
ESTs 

hypothetical protein FU13782 

Homo sapiens cDNA FLJ14035 fis, done HE 

TaigetExon 

gb3iy57g01.s1 NCLCGAPJ'rIS Homo sapiens 
ESTs 

CGI'62 protein 

solute earner family 6 (neurotransmitte 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

Homo sapiens mRNA; cONA DKFZp761C1712 (f 

ESTs 

ESTs 

hypothe&al protein FU20706 
transcriptk}n tador AP-2 beta (activatl 
hypothetical protein FIJ23045 
ubinudein 1 

LiV-1 protein, estrogen regulated 
ESTs 

duodenal cytochrome b 



,SS,TM,dislntegr1n.Pep„M1 2B_propep,Repro ESTs 
SS.NA,myb_DNA-binding v-myb avian myeloblastosis viral oncogen 

SS ESTs 

SS,EGF,tspJ,SS,E2F_TDP, cartilage oligomeric matrix protein (COM 

SS.aldedh,SS,aldedh, aldehyde d^drogenase 3 family, member 

306 



9.5 
9.5 
9.4 
9.4 
9.4 
9.4 
9.3 
9.3 
9.1 
9.1 
9.1 
9.1 
9.1 
9.0 
8.9 
8.8 
8.8 
8.8 
8.8 
8.7 
8,7 
8.6 
8.6 
8.6 
8.5 
8.5 
8.5 
8.4 
8.4 
8.4 
8.3 
8.3 
8.3 
8.3 
8.3 
8.2 
8.2 
8.2 
8.2 
8.1 
8.1 
8.0 
8.0 
8.0 
8.0 
7S 
7.9 
7.8 
7.8 
7.8 
77 
7.6 
7.6 
7,6 
7.6 
75 
7J5 
7.5 
7J5 
73 
7.4 
7.4 
7.3 
7.3 
7.2 
7.2 
7.2 
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426451 


A)908165 H&169946 


SS,GATA, 


GATA-binding protein 3 (T-cetl receptor 


7.1 


450701 


H39960 H&2884a7 


.SS.LRR 


Homo sapiens cDNA FIJI 2280 lis, clone MA 


7.1 


419519 


AI198719 Hs.176376 


SS 


ESTs 


7.1 


410555 


U92649 Hs.64311 


,TM.dis!ntegrin,Reproiys!iv 


a disintegiin and metalloproteinase doma 


7.1 


433138 


AB029496 Hs.59729 


SS,ig,Sema,SS,Sema,efhand 


sennaptiortn sem2 


7.0 


411558 


AA102670 Hs70725 


SSJM,SS,TM 


gamma-anninotxityric add (GABA) A recepto 


7.0 


409079 


W87707 Hs.82065 


,TM,fn3. 


fnterfeukin 6 signal transducer (gp130, 


7.0 


417275 


X63578 Hs.295449 


SS,efhand,SS,efhand,ras 


parvalbumin 


7.0 


432731 


R31178 Hs.287820 


.SS.fn3,fn1,fn2.fn2,fn1 


fibronectin 1 


6.9 


442818 


AK001741 Hs.8739 


WD40,SS 


hypothetica) protein FU10879 


6.9 


407366 


AF026942 


JM.1BR 


gbiHonno sapiens cig33 mRNA, partial sequ 


6.8 


427427 


AF077345 Hs.177936 


SSJectln c,SS 


ESTs 


6.8 


410785 


AW803341 


SS 


gb:IL2-UM0079^90300^50^ UM0079 Homo 


67 


401045 




lCEjp20,SS.lCE_p10,ICEj>20,lCE4)10JCE4) 






C11001883*:gi|6753278|ref|NP_033938.1| c 


67 




418986 


AI123555 Hs.81796 


,SS,RepfDlysln,tsp_1, 


ESTs 


67 


442082 


R41823 Hs7413 


JM,EPH.Ibd,pkinase,SAMM 


ESTs; calsyntenin-2 


67 


442861 


AA243837 Hs.57787 


SS 


ESTs 


6.6 


418836 


A1655499 Hs.161712 


,TMActivirurecp,pldnase,death,ZU5. 


ESTs 


6.6 


422060 


R20893 Hs.325823 


.SS,TM,CD36 


ESTs, Moderately siirdlar to ALU5 J^UMAN A 


6.6 


444381 


BE387335 Hs.2837l3 


,SS,mito_caiT 


ESTs. Weakly similar to S64054 hypotiieti 


6.6 


404091 


NA 


,TM,7lnL.3»ANF receptor, 


Target Exon 


6.6 


417866 


AW067903 Hs.82772 


SS.Conagen,COLFl,TSPN,SS,TSPN 


collagen, type XI, alpha 1 


6.6 


428819 


AL1 35623 Hs. 193914 


SS,SS 


KIAA0575 gene product 


6.5 


410275 


U85658 Hs.61796 


,SS,Ribosomdl S4e 


transcription factor AP-2 gamma (activat 


6.4 


425236 


AW067800 Hs.155223 


SS 


stanniocalcin 2 


6.2 


415669 


NM_005025Hs.78589 


,SS,serpIn. 


serine (or cysteine) proteinase inhiblto 


6.2 


416319 


A1815601 Hs.79197 


SS,TM.Ig.SSTM 


CD83 antigen (activated B lymphocytes, 1 


6.2 


412140 


AA219691 Hs.73625 


,SS,kinesin, 


RAB6 interacting, kinesin-like (rabkines 


6.2 


442942 


AW1 67087 Hs.131562 


,SS,ig,SeiTia.pkinase, 


ESTs 


6J2 


446163 


AA026880 Hs^5252 


.SS.TMfn3, 


prolactin receptor 


6.1 


443162 


T49951 Hs.9029 


fllament,SS.ritament.filament 


DKFZP434G032 protein 


6.1 


409602 


W25713 H8.256972 


,SSJM,DAGKa,DAGKc, 


ESTs 


6.1 


428479 


Y00272 Hs.184572 


,SS,pkinase»pkinas8 


cell division cyde 2, G1 to S and G2 to 


6.1 


400300 


X03363 


,SS,TM,pldnase.Recep_L-<lomaln,SH2,PH,FurHER2 receptor tyrosfne kinase ((>erW)2, 


6.1 


433404 


T32982 Hs.102720 


SS 


ESTs 


6.1 


410079 


U94362 Hs.58569 


Glyco_transfJ,SS 


glyoogenin 2 


6.1 


401781 




,SS,filament.Prtbosyltran,filamentArmad 


TaigetExon 


6.1 


447359 


NM^012093Hs.18268 


SS.adenytatekinase, 


adenylate kinase 5 


6.1 


402230 


NA 


.SSJM,p450, 


Target Exon 


6.1 


427674 


NM 003528HS.2178 


histone,SS,histone. 


H2B histone family, member Q 


6.1 


428398 


AI249368 Hs.98558 


.SS.TM 


ESTs 


6.0 


458098 


BE550224 HsJ4170 


SS 


metallothionein 1 E (functional) 


6.0 


419968 


X04430 Hs.93913 


SSJ1J5,L6, 


interteukin 6 (interferon, beta 2] 


6.0 


416636 


N32536 Hs.42645 


,SS,TM 


sotute carrier family 16 (monocarboxylic 


6.0 


419703 


AI793257 Hs.128151 


,SS,zf-C2H2, 


ESTs 


5.8 


424687 


J06070 Hs.151738 


SS.Peptfdase.M10,fn2.hemopexln.SS,TM.Pepmatrix metalloproteinase 9 (gelathase B 


5.8 


449679 


AI823951 Hs.129700 


SS 


tolk)ld-l]ke1 


5.8 


421296 


NIVL002666HS. 103253 


SS 


perHipin 


5.8 


442117 


AW664964 Hs. 128899 


,ss,nvi 


ESTs 


57 


400303 


AA242758 HsJ9136 


,SS.TM 


LIV-1 protein, estrogen regulated 


57 


419440 


AB020689 Hs.90419 


SS 


KIAA0882 protein 


5.7 


444858 


AI199738 Hs.208275 


SS 


ESTs, Weakly similar to ALUA^HUMAN llll 


57 


432239 


X81334 Hs.2936 


SS,Pepfidase.M10^emopexin,SS,Peptida2 


se^matrix melaDoprotelnase 13 (ooRagenase 


5.6 


440705 


AA904244 Hs.153205 


TM 


ESTs 


5.6 


400266 


NA 


SS,TM,ABC_tranA8C.membrane,SS 


C16000922:gi|7499103|plr||T20903 hypotlie 


5.6 


446466 


H38026 Hs.308 


arrestin.SS 


arrestin 3, retinal (X-arrestin) 


5.5 


423201 


NIVL000163HS.125180 


SS,TM,fh3.SS 


growth hormone receptor 


5.5 


433043 


W57554 Hs,125019 


SS 


lymphoid nudear protein (lJVF-4) mRNA 


5.5 


439509 


AF086332 Hs^14 


,SSJM,Syntaxln 


ESTs 


5.4 


425247 


NM_005940Hs.155324 


SS,PepfidaseJM10.hemopexin,SS 


matrix metalk)protBfnase 11 (MMP11; stro 


5.4 


409757 


NM-.001898HS.123114 


,SS,cj^tin, 


cystaUnSN 


5.4 


425292 


NIVL005824HS.155545 


SS 


37 kDa leudne-rich repeat (LKR) prot^ 


5.4 


448045 


AJ297436 Hs.20166 


,SS,TM 


prostate stem cell antigen 


5.4 


452681 


AF153330 Hs.30246 


.SS.TM 


solute carrier family 19 (thiamine trans 


5.3 


452243 


AL355715 Hs.28555 


SS 


programmed cell death 9 


53 


439310 


AF086120 Hs.102793 


,SS,TM,UDPGT.casein_kappa 


ESTs 


5.2 
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441111 
452355 
427711 
418636 
429353 
441690 
430447 
429698 
425325 

423600 
414737 

403593 
407758 
445234 
411165 
420633 
414117 
416783 
401093 
411096 
457411 
436007 
450506 
417975 
421072 
427032 
447752 
403199 
427122 
445900 
413048 
419563 
442432 
452093 
442323 
450606 
435542 
417576 



A1806867 
N54926 
M31659 
AW749855 
AL1 17406 
R81733 
W17064 
A1665086 
X52730 



Hs,126594 
Hs.29202 
Hs. 180408 



445413 
424420 
432378 
452190 
434674 
419986 
421582 
410361 
426327 
406639 
452834 
427315 
446733 
442118 
421524 
453060 
453403 
444301 
453619 
432656 
426384 
431701 
416931 
420854 
418867 
443514 
447499 



Hs^00102 
Hs.33106 
Hs.332848 
Hs.26339 
Hs.1892 

phenylethanolamfne N-methyltransferase 
A1633559 Hs.310359 SS 
AI150386 Hs.125087 
NA 

D50915 Hs.38365 
AW137636 Hs.146059 
NM_000169Hs.69089 
NM»014581Hs.274480 
W88559 Hs.1787 
AA206186 HS798B9 



U80034 Hs.68583 

AW085951 Hs.130093 

A1247716 Hs.232168 
NM-004460HS.418 

AA641836 Hs.30085 

AI215069 Hs.89113 

AF012023 Hs.173274 

M73700 Hs.105938 
NA 

AW057735 Hs.323910 

AF070526 Hs.13429 

M93221 Hs.75182 

AA526235 Hs.193162 

BE093589 Hs.38178 

AA447453 Hs.27860 

AW016669 Hs.29190 

A(668605 Hs.60380 

AA587376 Hs.269533 

AA33944g Hs.82285 

AI860021 Hs.270651 

AA151342 Hs.12677 

BE614743 H8.146688 

AI493046 Hs.146133 



ESTs 

G prolein-cajpled receptor 34 
solute carrier family 25 (mitochondrial 
gb:QV4-aT0534-281299^53-o05 BT0534 Homo 
ATP-bindIng cassette transporter MRP8 
ESTs 

SWl/SNF related, matrix associated, acti 
ESTs, Weakly similar to S21348 probable 



,SS,TM,Phospliodiest, 
TM,7tm_1.TM 
SS 

,SS.TM.HECT 

,SS,TM,ABC Iran^ABC membrane, 
.SS,HECT,zf-UBR1,PABP,14^-3, 
SS 

,SS.ras. 

SS.NNIVn-.PNIVIT.TEMr.SSJ^NMT_PNMTjm 

5.1 
ESTs 

SS ESTs 
,CIDE-N,pklnase Target Exon 

SS.SS K1AA0125 gene product 

,SS,T1VI ESTs 
SS.Mellbtase,BTK.PH,pk}nase,SH2,SH3.Ribo galactosklase, alpha 
SS,lq9ocalin,SSJipocaltn odorarit-binding protein 2A 

,TM,ion_trans,K_tetra, proteollpid protein 1 (Pelizaeus-Merzbac 

SS.TM.TM monocyte to macrophage differenfetion-a 

TN/ILRRCTJIVI.LRRCT, C12000586*:gil6330157|dbj|BAA86477.1| (A 

Peptidase_l^3» mitochondrial intennediate peptidase 

SS ESTs 
,SS,adtuzinc, ESTs 
SS.DPPIV_NJermPeptid^e_S9,SSJ)PPIV_N_fibrDblast activation protein, alpha 



,SS,trypsln 
SS 

,SS.14-« 

SS,transferTin,7tnLl .transferrin. 
SS,TM,Folate_carrier,SS,TM,Folate_carrie 



hypothetical protein FU23186 
ESTs 

integrin cytoplasmic domain-associated p 
lactetransferrin 

NM_025243*:Homo sapiens solute earner f 



,SS,TM,pkinase,Recep_L_domaln,SH2.PH,FurHER2 receptor tyrosine kinase (c-erb-b2, 



,SS,Ca_channeLB, 

SS,TM,lectin_c,fri2,Ricin_BJeclin,SS,TIW 
SS 
SS 

.SSJM,7tm_1. 
,SS,TM,CBS,voltage^CLC 
.TM,Glyco_hydro_1 
,SS»pkinase.RhoGEF.(g,PH,SH3, 



Homo sapiens clone 24787 mRNA sequence 
mannose receptor, C type 1 
Homo sapiens cDNA FU1 1983 fis. done HE 
hypothetical protein FU23468 
Homo sapiens mRNA; cDNA DKFZp586M0723 if 
ESTs 

ESTs. litoderately similar to AIJU6_HUMAN A 
ESTs 



AIRS.fonnyLtransf,GARS.SS,GARS,AIRS,for phosphoribosytglycinamide fomnyltransfer 



H26735 

AA831879 

At345455 

A1910275 

BE391804 

W03242 

M97711 

A1638627 



Hs.91668 

Hs.136985 

HSJ8915 

Hs.1406 

Hs.62661 

Hs.44898 

Hs.105685 



AA179949 Hs,175553 
AA863360 Hs.26040 
AA976718 Hs.202242 
AA3120&2 Hs.105445 
AW294092 Hs.21594 
BE466639 Hs.61779 
AK000136 Hs.10760 
H87648 Hs.33922 
NM_000246Hs,3076 
AI472078 Hs.303662 
AW935490 Hs.14658 
D45371 Hs.80485 
AW296927 
D31771 Hs.89404 
BE464288 Hs.141937 
AW262580 Hs.147674 



.pkinase 
SS.UPF0099,SS,UPF0099, 
,SSJM.MAPEG, 
,SS,TM,UDPGT 

,SSJM,PH,SH2,Furin-like,pldnase,Recep^L 

,SS,HlsLdeacetyl, 

pkinase,OPR, 

SS,trefoa,SS,TM,ldLrecepLa,SRCR,tiyps 

SS.TM,GBPJM.GBP 

SS 

SS.SS.b. 

,SS,DEAD,ForIehead 
SS 

,SS,TM.p450, 

.ig.Sema, 

SS 

,SS,ras,Yj)hosphatase.ras 

,SS,HI^_box,filament, 

SS,LRR,SS 

SS 

SS,LRR, 

,SSArfGap, 

,SS,B1R 

SS,C1q,Collagen,SS,C1q, 
,SS,TM,PeptidaseJ/I1, 
SS,homeobox,homeobox. 
,SS.TM,MIP, 

.SSJM,cadhei1n,cadhenn 

308 



ESTs, Moderately similar to A47582 B-cet 
CGH47 protein 
prostaglandin E synthase 
ESTs 

Homo sapiens ctone PP1498 unknown mRNA 
ESTs 

GA-blndIng protein transcription factor, 

trefoil factor l(pS2) 

guanylate binding protein 1, Interferon- 

Homo sapiens clone TCCXJTA00151 mRNAsequ 

gb:Human T-cell receptor (V beta 18.1, J 

K1AA1688 protein 

Homo sapiens mRNA; cDNA DKFZp564N0763 (f 
ESTs. Weakly similar to fatty add omega 
ESTs 

GDNF family receptor alpha 1 
hypolheticat protein MGC15754 
Homo sapiens cONA FLJ13591 fis, clone PL 
asporin(LRR class 1) 

Homo sapiens, clone MGC:9084, mRI^A, comp 

MHC dass U transactivator 

ESTs 

Human chromosome 5q13.1 clone 5G8 mRNA 
adipose most abundant gene transcript 1 
gb:Ul-H-BW0-ajc^>07-0-ULsl NCLCGAP.Su 
msh (Drosophlia) homeo box homolog 2 
ESTs 

protocadheffn beta 16 



5.2 
5.2 
5.2 
5.2 
5.1 
5.1 
5.1 
5.1 



5.1 
5.1 
5.1 
5.0 
5.0 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.8 
4.8 
4.8 
4.8 
4.6 
4.7 
4.7 
4.7 
4.6 
4.6 
4.6 
4.6 
4.6 
4.6 
4.6 
4.6 
4.5 
4,5 
4.5 
4.5 
4.5 
4.5 
4.5 
4.5 
4.5 
4.5 
4.4 
4.4 
4.4 
4.4 
4.4 
4.4 
4.4 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
43 
4.3 
4.3 
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441560 
409064 
422667 
454032 
432663 
401747 
432682 
437036 
447754 
443194 
451871 
457211 
421566 
431657 
427899 
444779 
442295 
436396 
446039 
422938 
405922 
439285 
424800 
429922 
447178 
409038 
452747 
420139 
408877 
403329 
439926 
430832 
432481 
452410 
418661 
431958 
425071 
447197 
428722 
428330 
423242 
449048 
414831 
413589 
408875 
418629 
450787 
414870 
450325 
407633 
426172 
442262 
427961 
445563 
403943 
408761 
423279 
420440 
445107 
428303 
411667 
427809 
418203 
'430376 
444190 
433495 
429638 



F13386 Hs.7888 
AA062954 Hs.141883 
H25542 Hs.1 33471 
W31790 Hs.194293 
A1984317 Hs.122589 

NM_013257Hs279696 
AI571514 Hs.133022 
AW073310 Hs.163533 
Ai954968 Hs.279009 
AI821005 Hs.118599 
AW972565 Hs.32399 
NM.000399Hs,1395 
AI345227 Hs.105448 
AA829286 Hs.332053 
AI192105 Hs.147170 
AI827248 Hs^24398 
AI683487 Hs.152213 
AI150491 Hs.90756 
NI\A_001809Hs.1594 
S70284 

AL133916 Hs,172572 
AL035588 Hs.153203 
Z97530 Hs.226117 
AW594641 Hs.192417 
T97490 Hs.50002 
BE153855 H&61460 
NM_005357Hs.95351 
AA479033 Hs.130315 
NA 

AW014875 Hs.137007 
AI073913 Hs.100686 
AW451645 Hs.151504 
AL133619 Hs.29383 
Nf\L001949Hs.1189 
X63629 Hs.2877 
NM„013989Hs.1 54424 
R36075 

U76456 Hs.1 90787 
L22524 Hs.2256 
ALO39402 Ks.125783 
Z45051 Hs.22920 
M3115e H&77439 
AW452831 Hs.313803 
Nl\L0l5434Hs.48604 
BE247550 Hs.86859 
AB006190 Hs.25475 
N72264 Hs.300670 
AI935962 Hs.26289 
NM_007069Hs.37189 
AA371307 Hs.125056 
BE170651 Hs.8700 
AW293165 Hs.143134 
AW873606 Hs.149006 

AA057264 Hs.238938 
AW959861 Hs.290943 
NM.002407HS.97644 
AI208121 Hs.147313 

AW974476 Hs.1 83601 
BE160198 

M26380 Hs.180878 
X54942 Hs.83758 
AW292053 Hs.12532 
AIB7891B Hs.10526 
AW373784 Hs.7l • 
AI916662 Hs.211577 



.pkinase, 

.SS.CUB, 

.SS.mFMOIIke 

.SSJM 

TM 

,SS»filament,fIlament 

pkinase.pkinase^C, 

,SSJM 



.SS.TM 
,SS.GDNF. 
WH1,WH1 
zf-C2H2,SS 
,SS,TM,pkinase 



,SS,SAAjDrDtelnsABCjnerribrane,ABC.tran,serum amyloid A1 



Homo sapiens clone 23736 mRNA sequence 

ESTs 
ESTs 

ESTs, Weakly similar to 154374 gene NF2 
ESTs 

Homo sapiens keratin 17 (KRT17) 
serumfgluGocorticoid regulated kinase-li 
ESTs 

Homo sapiens cDMA FIJ14142 lis. clone MA 

matrix Gla protein 

ESTs 

ESTs. Weakly similar to 851797 vasodDat 
early growth response 2 (Krox-20 (Drosop 
ESTs, Weakly similar to B34087 hypotheU 



SS 

,COLFI,vwcCollagen, 
.wnt, 

JM,Glyco_hydroJ 
,SSJM,lhiolase, 
SS,TM»Desaturase.SS 
,SS,ig,pkinase,UlRNT.I-RRCT, 
HLH.SS 

.SS,TM,llnker.histone,7tni.1 
,SS.TM 
SS.IL8.SS.IL8 

,SS.TM,p450. 
.SS,TM 

SS,SS 
SS 
SS 

.SS.Collagen.COLFl.TSPN. 
,SS,TM,ras 
SS 



ESTs 



Homo sapiens cDNA FIJ11469 fls. clone HE 
wlngless-type MMTV integratton site fami 
ESTs 

centromere protein A (17kD) 
gb:stearoyl-CQA desaturase [human, adipo 
hypothetical protein FLJ20093 
MyoD femily inhibitor 
HI histone family, member 0 
ESTs 

small inducible cytokine subfamily A (Cy 
Ig superfamSy receptor LMIR 
lipase, homnone-sensttive 
ESTs. Weakly simUar to A47582 B^ieli gr 
Target Exon 
ESTs 

ESTs, Weakly similar to JE0350 Anterior 
Homo sapiens cDNA FU11973 Hs, done HE 
Homo sapiens mRNA; cDNA DKFZp434E2321 (f 
E2F transcriptkin factor 3 
SS,TIv1,Cadherfn_CJerm,cadherin,SS,TM,cadcadherin 3. type 1, P-cadherin (placenta 
SS,T4_dek)dlnase,T4_dek)dinase, detodinase, iodothyronine, type II 

,TM,SDF, gb:yh88b01.s1 Scares placenta Nb2HP Homo 

,SS,TIMP, tissue inhibitor of metalloprotelnase 4 

SSPeplldase M10,SS,Peptidase IWIChenropematrix metalloprotelnase 7 (1VIMP7; uterin 
SS protein 
SS,SS.TM similar to S68401 (cattle) glucose indue 

,SS.cNMP_blndlng,Rlla.HMG_box protein kinase, cAIWP-dependenI, reguteto 

,SS.abhydrolase ESTs, Highly similar to AF157833 1 noncl 

SS DKFZP434B168 protein 

SS,SH2,PH,SS JM.PH,SH2,Furin-like,pkInas growth factor receptor-bound protein 7 



SS.TM.MIP,SS,TM,MIP, 
SS 
SS 

TMJM 
,SS,DENN 
.SS,START, 
SS 

.SS,WH1,WH1 
p450,SS.p450 
.SS.TM.7lnLl. 
SS 

,SS,SRCR,Uterogkibin 
,SSJIVI 

SS,RGS,RGS,RGS 
TM 

»SS,lipase,PUT.Sec7PH. 

CKS,SS,CKS, 

SS 

SS 

SS.Ig.MHCJ.connexln,SCAN,SS,TI\4 
SS.TM,SS 



aquaporin 7 
K1AA1204 protein 
ESTs 

similar to rat HREV107 
ESTs 

deleted in liver cancer 1 

ESTs 

ESTs 

C5000355:gi|4503225|ref|NP_000765.1 1 cyt 
ESTs, Weakly similar to (defHne not ava 
ESTs 

mammsgbbin 2 (MGB2; mammagtobin B; lip 
ESTs. Weakly similar to 138022 hypotheO 
regulator of G-protein signalling 16 
gb:QV1>HT0413-010200^-h03 HT0413 Homo 
lipoprotein lipase 
COC28 protein kinase 2 
chromoson^e 1 open reading frame 21 
cysteine and glycine-rich protein 2 
alpha-2-glycoprotein1,zinc 
kinedln 1 (Wnesln receptor) 



4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.2 
4.2 
4.2 
4.2 
4.2 
4.2 
A2 
4,1 
4.1 
4.1 
4.1 
4.1 
4.1 
4.1 
4.1 
4.1 
4.1 
4.1 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
3.9 
3.9 
3.9 
3.9 
3.9 
3.8 
3.8 
3.8 
3.8 
3.8 
3.8 
3.8 
3.8 
3.6 
3.8 
3.8 
3.8 
3.8 
•3.8 
3.8 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 



309 



wo 02/059377 



PCT/US02/02242 



454071 
451859 
420281 
427691 
428824 
424676 
418026 
457465 
417601 
407999 
425546 
446619 
411213 
406625 
417511 
428769 
407137 
401865 
451195 
426044 
426310 
440029 
408573 
431830 
444781 
431493 
414175 
411789 
418851 
453968 
407104 
449051 
434398 
454042 
459496 
414998 
407756 
442101 
449722 
452554 
421991 
420058 
425776 
407846 
406925 
445873 
418054 
414921 
452268 
428862 
412520 
410530 
401780 
447131 
418334 
415138 
421168 
431473 
421379 
411984 
408101 
405366 
414612 
411393 
435767 
416406 
433068 



AI041793 Hs.42502 
H44491 Hs.252938 
AI623693 Hs.191533 
AW1 94426 Hs^0726 
W23624 Hs.173059 
Y08565 Hs.151678 
BE379727 Hs.83213 
AW301344 Hs.122908 
NM.014735Hs£2292 
At126271 H&49433 
AA890023 H&1906 
AU076643 Hs.313 
AA676939 Hs.69285 
Y13647 Hs.119597 
AL049176 HS32223 
AW207175 Hs.106771 
T97307 

U10492 Hs.438 
AA502490 Hs.336695 
NNL000909HS.1 69266 
AW089705 Hs.293711 
AA284775 Hs.43148 
Y16645 Hs.271387 
NM-014400HS.11950 
AI791493 HS.129B73 
AI308876 Hs.103849 
AF245505 Hs.72157 
AI417828 Hs.192435 
AA847843 Hs.62711 
S57296 Hs^23910 
AW961400 Hs.333526 
AA121098 Hs.3838 
H22570 Hs.172572 
AA808940 Hs.274450 
NM„002543Hs.77729 
AA1 16021 Hs.38260 
Ai651930 Hs.135684 
BE280074 Hs.23960 
AW452434 Hs.58006 
NM_014918Hs.110488 
AK001423 Hs.94694 
U25128 Hs.159499 
AA426202 Hs.40403 
L34041 Hs.9739 
AA250970 Hs.251946 
NM_00231BHs.83354 
BE390551 Hs.77628 
NIVL003512HS.28777 
NM_000346Hs.2316 
AA442324 Hs.795 
M25809 Hs.64173 

NM_004585Hs.17466 
AA319233 Hs^l 
C18356 Hsi95944 
AF182277 Hs.330780 
AA825686 Hs.321176 
Y15221 Hs.103982 
NM.005419HS.72988 
AW968504 Hs.l23073 

BE274552 Hs.76578 
AW797437 Hs.69771 
H73505 Hs.117874 
D86961 Hs.79299 
NH.006456HS.286215 



TMJtmJ, 

,SS,TM,E6F.ldLrecepLa,ldLrecepLb,EGF 
,SS,AAA, 

,SS,Glyc»s„transL2, 
SS 

Glycos^lransf_2,RfciruB_leclin.SS,Glycos 

lipocalin.SS.ripocallnjipocaltn^ferrifi 

,SS,Pribosyltran.Sulfatase 

PHD,pldnase.SS 

SS 

SS.TM.fn3,SS.TM,fh3, 
,SS.TM.efhandjof\_trans 



SS.mCUB.F5_F8^type»C,MAM,SS,TM.CUB,F5jieurop!IIn 1 



ESTs 

ESTs. Weakly slmdar to ALU1.HUMAN ALU S 

ESTs 

ESTs 

ESTs 

UDP-N-acetyl-alpha-D-5atectosamlne:polyp 

fatty acid binding protein 4, adipocyte 

DNA replication factor 

KIAA0215gene product 

ESTs. Wealdy similar to YZ28„HUMAN HYPGT 

prolactin receptor 

secfeted ptiosphoprotetn 1 (osteopontin, 



stearoyi-CoA desaturase (delta-Q-desatur 

chordin-like 
ESTs 

gb:ye53h05.s1 Scares fetai fiver spleen 
Target Exon 

noesenciiyme homeo box 1 
ESTs 

neuropeptide Y receptor Y1 
ESTs. Weakly similar to S64329 probable 
ESTs 

small inducible cytokine subfiamfly A (Cy 
GPI-anchored metastasis^ociated prote 
ESTs, Weakly similar to A36036 cytochrom 
,TM.hemopexin,Peptidase_M10.hemopexln,Pehypothetical protein DKFZp761D112 
ig.LKRCT. DKFZP564I1922 protein 

,SSJM ESTs 

,SS,HMGJ)OX, Homo sapiens, clone IMAGE:3351 295, mRNA 

,SSJI^,SH2.PH,pklnase,Recep„Ujdcmain,Fuiv-erb4)2 avian erytiiroblastic leukemia v 



SS,TM,Desaturase,SS 

SS 

,SS,7trTUl,SPRY, 

.SS.TM.GDA1„CD39 

.SS.filannent. 

SS,homeobox,Ets,SS,homeobox, 
SS 

SS.TM.7tmJ, 
SS 

.SS,TM.PMP22_Claudln, 
.SS.TM.IL8 
.SSPHJactamase^ 
.SS.p450.p450 



HER2 receptor tyrosine kinase (c-erb-b2, 
serum-indudble kinase 
hypotfietical protein FLJ20093 
EST 

oxidised km density lipoprotein (lectin 
ubiquitin specifte protease 18 
ESTs 
cyclinBI 

ESTs, Weakly similar to ALUS JiUMAN ALU S 
KIAA0990 protein 

Homo sapiens cDNA FLJ10561 lis, done NT 
parathyroid hormone receptor 2 



SS 

pklnase,POLO_box.SS,pkinase,POLO_box, 
.SS,ig,pkinase.LRRNT.LRRCT, 
.SSJM,KRAB.SCAN,zf<:2H2.Ig 
.SS.TM 

SS,UCH-1,UCH-2,SS,TM,6jkLtranspept 
SS 

cydin.SS,TMiCyclln, 
SS 
SS 
SS 

SS,TM.7tmJ,SS,TM.7tnL2 

JM,ABQ_membrane,ABC_tran,RibosomaLS4eCbp/p300-interacting transactivator, v/W 
,SS,TM,transporU)rot.SWIB,RhoGAP,DAG_PEglycerol^-phosphate dehydrogenase 1 (so 
,SS.nm,PABP,pklnase,1 4-3-3,mn poly(A)-bInding protein, cytoplasmic 1-1 

.SS.TM.mito^can-.LysyLoxidase lysyl oxidase-like 2 

SS,START,SS,START.NNMT_PNI^jrEMT, steroidogenic acute regulatory protein r 
SS.histone.Catc_CGRPJAPP,lg.MHCJ,SPRY,H2A histone family, member L 



SS.HMG_box, 
histone,SS,histone,BoiA 
ATP-synLab,SS,7tm^l ,ATP-synLab 
filamentSS.fiiamenLfilanfjent 
TM 

,SS,TM,RibosomaLL27e, 

.KunH?_BPTI, 

SS.p450,SS 

SS 

SS,IL8, 

SH2,STAT,SS,STAT 
.pkinase, 

RhoGEF,PH,SS,RhoGEF,PH, 
SAP,SS.FG^AP.vwa 
SS,sushl,tiypsin.Wtfa,rrm,fibrinogen C,fn 
,SS.PeptWase.S8,P,PeptWase^S8,P 
,SS.TM 

»SSPribosyttran, 
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SRY {sex detemnining region Y)-box 9 (ca 
H2A histone family, member 0 
ATPase, H transporting, lysosomal (vacuo 
NH-005557*iHomo sapiens keratin 16 (foca 
retinoto add receptor responder (tazaro 
ESTs 

tissue factor pathway inhibitor 2 
cytochrome P450. subfamily )IB (phenobar 
ESTs, Weakly similar to S65824 reverse t 
small indudble cytokine subfamily B (Cy 
signal transducer and activator of trans 
CDC2'related protein kinase? 
NM_00337r;Homo sapiens vav 2 oncogene ( 
protein Inhibitor of activated STAT3 
B-fador, properdin 
ESTs 

lipoma HMGiC fusion partner-ilke 2 
sialyltransferase 



3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.6 
3.6 
3.6 
3.6 
3.6 
3.6 
3.6 
3.6 
3.6 
3.6 
3.6 
3.6 
3.6 
3.6 
3.6 
3.6 
3.5 
3.5 
3.5 
3.5 
3.5 

as 

3.5 
3.5 
3.5 
3.5 
3.5 
3.5 
3.4 
3.4 
3.4 
3.4 
3.4 
3.4 
3.4 
3.4 
3.4 
3.4 
3.4 
3.4 
3.4 
3.4 
3.4 
3.4 
3.4 
3.4 
3.4 
3.4 
3.4 
3.4 
3.4 
3.3 
3.3 
3.3 
3.3 
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445462 
439452 
452017 



10 



15 



20 



452106 
447519 
426928 
438825 
414575 
417837 
422128 
445941 
429973 
444542 
459561 
425741 
426501 
456508 
434228 
415752 
400419 
439750 
423858 
428514 
25 428698 
446988 
432072 
417433 
452194 
444051 
4200^ 
457292 
421458 
431104 
443767 
419589 
415447 
443464 
423431 
413278 
458451 
440449 
413753 
434876 
435575 
415773 
446440 
450847 
426075 
452110 
439963 
402837 
439451 
406664 
417315 
413011 
414987 
429197 
446030 
407604 
419092 
456672 
443171 
452256 
432201 
406642 
400903 



30 



35 



40 



45 



50 



55 



60 



65 



AA378776 Hs.288649 

AA918317 Hs.57987 

AF109302 Hs.27495 

AK000725 Hs.50579 

AI141031 Hs.21342 

U46258 Hs.339665 

AF037062 Hs.172914 

BE327427 Hs.79953 

H11257 HS.2K68 

AL079905 Hs.1103 
AW881145 
A1267371 
AI423317 
AI161293 
Ai547306 
AF052152 



Hs.172636 
HS.1646B0 
Hs.280380 
Hs.134g81 
Hs.159412 
AWD43782 Hs.293616 
AA502764 Hs.123469 



Z42047 

BE314524 

AF084545 

AL359053 

AL137326 



Hs.283976 
Hs.78776 

Hs,57664 . 
Hs.133483 
AW236861 Hs,193139 
AA852773 Hs.334838 
Y09763 Hs.22785 
N62937 Hs.269109 
BE270266 Hs.82128 
AI694413 Hs.332649 
N48373 Hs.10247 
AW015140 Hs.161723 
AI921270 Hs.334862 
NI\/L003654Hs.1O4576 
AW970859 Hs.313503 
BE562136 Hs.9736 
AW973708 Hs.201925 
297171 Hs.78454 
BE548446 Hs.5167 
AA326062 
BE563085 Hs.833 
AW297181 Hs.195922 
AA865430 Hs.201925 
U17760 Hs.75517 
AF160477 Hs.61460 
AF213457 Hs.44234 
R21651 H5324725 
AV658411 Hs.42656 
NWL003155HS.25590 
AW513691 Hs.270149 
T47667 Hs.28005 
AW247529 Hs,5793 
NA 

AF086270 Hs.27B554 
L34041 Hs.9739 
A1080042 Hs.336901 
AWD68115 Hs.821 
AA524394 Hs.294022 
H24471 Hs.26930 
N30714 Hs.325960 
AW191962 Hs^4g23g 
J05581 Hs.89603 
AK002016 Hs.114727 
BE281128 Hs.9030 
AKD00933 Hs^8661 
AI538613 Hs.298241 
AJ245210 
NA 



SS.SS 

SS.SS 

SS 

SS 

SS 

SS 

,SS,adh_short,TGF-beta,TGFbj3ropeptide 
,SS,TM,histoneANF_receptDr,guanytate„cy 
,SS,pktnase.tg, 

SSJGFbj>ropeptide.TGF-beta,SS 
SS 

SS.SSjipoxygenase.PLAT 
,SSJ4x))C,UDPGT 

SS,SSPeptldase_M1.EGF,lg,lecfiruc,susht 
SS 

,pkinase» 

SS 

SS 

,SS,TM.7tnu1 
TM 

,SSPeptidase„M1, 

,TM,lntegrin_B»Ricin3_teclin/nn 

.SS,TM 

.SS,START,NNMT_PNMT_TEMT. 

SS 

SSJM.TM 
,Sema,ig, 

SS.TM,LRRCT.LRRNT,LKR,TM,LRRCT. 

,SS,TM,7tnL3,ANF_receptDr^ushl 

,SS,lg. 

,SS,CUB, 

SSJM.SS»TM,G-patch 
SS 

,Sema,^, 

,SS,PCI,RasGEF,hormonejec,zf-C4, 

.FGF, 

SS,OLF.OLF.OLF,RibosomaLU 

SSJM.SSF.SS,TM 

,SS,p450,p450 

,SS.TM,ubiquitin,laminlrL.G,lamlnln_EGF.k 
,SS,RlbosomaLL14 
.FGF. 



hypothetical protein MGC3077 

B-cell CLL/lymphoma 1 1 B (zinc finger pro 

prostate cancer associated protein 7 

hypothetical protein FU20718 

ESTs 

ESTs 

retinol dehydrogenase 5 (11-ds and 9^ 
ESTs 

Homo sapiens clone IMAGE:451939. mRNA se 
transforming growth factor, beta 1 
gb:QV0-OT0033^1 0400-182-307 OT0033 Homo 
ESTs 
ESTs 

aminopeptidase 
ESTs 

Homo sapiens clone 24628 mRNA sequence 
ESTs 

ESTs, Weaidy similar to AF208855 1 BNW1 
Homo sapiens PR02751 mRNA, complete cds 
putative transmembrane protein 
Target 

Homo sapiens mRNA full length Insert cDN 
Homo sapiens mRNA; cl3NA DKFZp434B0650 (f 
ESTs 

KIAA1866 protein 

gamma-aminobutyrtc acid (GABA) A recepto 
ESTs 

5T4 oncofetal trophoblast glycoprotein 
oifectory receptor, family 2. subfamily 
acGvated leucocyte cell adhesion molecu 
ESTs 

hypothetical protein RJ14251 
carbohydrate (Iceratan sulfate Gal-6) sul 
ESTs 

proteasonf)e (prosome, macropaln) 26S subu 
Homo sapiens cDNA FIJ13446 fis, done PL 
myodlin, trabecular meshwork inducible 
Homo sapiens mRNA; cONA DKFZp434F152 (fr 
gb:EST29171 Cerebellum II Homo sapiens c 
interferon-stimulated protein, 15 kDa 
ESTs 

Homo sapiens cDNA FU13446 (is, done PL 



SS,iamlnin,EGF,lam1niii|iterm,adhjshort.S laminin. beta 3 (nicein (125kD), kafinin 



.SS,HLH 
SS.ig.SS,TM 

,SS,TI\A,RibosomaLS3Ae,G-gannma 
SS 

.SS.homeobox, 
.SS,fn3. 

,SSJU Activin_recp,pkinase 

.TIW,p450,Ets 

SS 

,SS,Chromo_shadow,chronx), 



ig superfamily receptor LNIR 
triggering receptor expressed on myek>kl 
ESTs, IVIoderately similar to A47582 B-cel 
KIAA1681 protein 
stannkicaldn 1 

ESTs, Weakly similar to 2109260A B cell 
Homo sapiens cDNA FLJ11309 fls. clone PL 
platelet-activating factor acetylhydrola 
ENSP00000241312*:DJ947L8.1.8 (novel Sush 
heterochronr^atin-like protein 1 



SS,TM,transportj)rot.SWIB.RhoGAP,DAG_PEglyceral-3-phosphate dehydrogenase 1 (so 
SS,RNAj)oLA.RNAj»IJ\2,RibosomaLS24e.riboson^l protein S24 



SS,LRR.LRRNT.SS,LRRNT.LRR. 
,SS,connexin,hormone rec;zf-C4,oonnexin 
,SS,Gelsolin, 
.SS,TIi/l 
,SS,TM,C1q, 
SS,TM.SEA, 
,S$,PKPK 
SS,TM,7tnLl,rmn,SS 
,TIW,GDI,7tnLl. 

SS.TM.trypsin,SS,TM.treft)li,trypsln,tref 

SS 

SS 

311 



biglycan 
hypothetical protein FLJ14950 
ESTs. Weakly slmRar to T20272 hypotheQ 
membrane-spanning 4-donelns. subfamily A 
collagen, type VIII, alpha 2 
mudn 1. transmembrane 
Homo sapiens, done MGC:16327. mRNA, com 
TONDU 

Homo sapiens cDNA FU10071 fis, clone HE 
Transmembrane protease, serine 3 
gbiHomo sapiens mRNA for immunogk)buIln 
Target Exon 



3^ 
3.3 
3.3 
33 
3.3 
3.3 
33 
33 
3.3 
33 
33 
3.3 
33 
33 
33 
33 
33 
33 
33 
33 
33 
33 
33 
33 
33 
33 
33 
33 
3.2 
3.2 
Z2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
Z2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
Z2 
3.2 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
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,SS,G!yco_hydro_2 


ESTs 


1 


43^if4 AWybzoHf Hs.oloOo 


SS.WD40.SS,WD40. 


KIAA1547 protein 


1 1 
J.1 




fn3.SS,TM,EF1BD 


class 1 cytokine receptor 




^O/tOCji A\AioQQT5c Lining ion 


SS 


ESTs, Weakly similar to G01763 atrophin- 


0.1 


A^MQC AIOOC/XO Ll« CQOOQ 


UBACT^repeat.SS,UBACT_repeatThlF„famityhypothetical protein FU10808 


0.1 


4iyio0 T29d1o Hs.oswO 


,TM,pkinase,fn3, 


TEK tyrosine kinase, endothelial (venous 


3.1 




.SS,SRCR, 


Homo sapiens, Similar to RIKEN cDNA 5830 


0.1 


410624 AI7340oO nS.1 04211 


.Semajg, 
,SS,abhydrolase, 


ESTs 


3.1 


436291 BE568452 Hs.5101 


protein regulator of cytokinesis 1 


3.1 


il4 4 AAA kUA J J A t f ^ A AjI A J A 

411000 N40449 Hs.201619 


SS 


ESTs. Weakly similar to S38383 SEB4B pro 


3.1 


412519 AA19o241 Hs.73980 


SS ,Tropon[n,HemagglutInin,SS JM,C2,Tropo 


troponin T1, skeletal, slow 


3.1 


450223 AA418204 Hs.241493 


,SS,pro_lsomerase, 


natural klller-tumor recognition sequenc 


3.1 


422790 AA609o75 Hs.25933 


JM,histone,Sec1 ,h)stone,sijgac.tr 


ESTs 


3.1 


424269 AW137691 Hs.199754 


,SS,TM,7tnfL2,GPS 


ESTs 


3.1 


435854 AJ278120 Hs.4996 


.SS,WD40 


putative ankyrin-repeat containing prote 


3.1 


447388 AW630534 ns.76277 


.SSJM.tTn),oxkloredl.q6,oxidoredjq6 


Homo sapiens, clone MGC:9381, mRNA, comp 


0.1 


451631 R00866 


SS 


gb:ye79c02,s1 Scares fetal liver spleen 


3.0 


448105 AW591433 Hs.298241 


.SS,TM.trefo!l.trypsonMoit 


Transmembrane protease, serine 3 


3.0 


438637 BE500941 Hs.1 25730 


,TM,PH, 


ESTs. Weakly similar to K1AA1214 protein 


3.0 


423024 AA593731 Hs,325823 


.SSJM.CD36 


ESTs. Moderately similar to ALU5_HUMAN A 


3.0 


456592 R91600 


,SS,Ran BP1,LIM,Ran_BP1,GRIP,TPR,LlM 


gb:yq10c02.r1 Scares fetal liver spleen 


3.0 


jIAPAAA a I AAAA^^ 1 1 JAAAAA 

425920 AU)49977 Hs.162209 


SS.TM,SSJM.PMP2?,Claudln,PMP2i.ClaudInclaudln8 


3.0 


444670 n5o373 nS.33293o 


,SSJM 


hypothetical protein MGC5370 


3.0 


401197 


arf.arf, 


ENSP0000Q229263*:HSPC213. 


3.0 


437755 AW2042S6 Hs.291887 


,wnt 


ESTs 


3.0 


452560 BE0770o4 rfS.33D432 


.SS.rnn^-RanBP,pk]nase.C2,pkinase.C,DA ESTs 


oSj 


410274 AA3o1o07 nS.Dl762 


SS.SS 


hypoxia-inducible protein 2 


0.0 


450098 W27249 Hs.8109 


SS 


hypothetical protein aJ21080 


3.0 


404825 


,SS,TM 


Target Exon 


3.0 


458389 H70284 Hs.160152 


,SS.RA 


ESTs, Weakly similar to FPHU alpha-fetop 


3^0 


408196 AL034548 Hs.43627 


HMG_box,pklnase,zf-CCHC^,TM^MG«box,SRY (sex determining region Y)-box 22 


3.0 


433675 AW977653 Hs.75319 


,SS,nbonuc_red_sm, 


ribonucleotide reductase IV12 polypeptide 


3.0 


418848 AI820951 Hs.193465 


,death,ZU5.pldnase^ctivin^rBcp, 


ESTs 


3.0 


422095 AI868872 Hs.282804 


SS,Cu-ox(dase,SS,Cu-oxidase»Cu-oxidase 


hypothetical protein FU22704 


3.0 


415992 C05837 Hs.145807 


,SSJM 


hypothetical protein FU 13593 


3.0 


424631 AA688021 Hs.179808 


SS 


ESTs 


3.0 


409956 AW1 03364 Hs727 


SSJGF-beta.TGFbj)ropeptide,SSJGF-beta,inhibin, beta A (activin A, adivin AB a 


3.0 


419667 AU077005 Hs.92208 


SS,dlsintegrin,Reprolysin,Pep_M12B4)rope 


a disintegrin and metalloproteinase doma 


3.0 


450946 AA374569 Hs.127698 


SS 


ESTs. Moderately similar to 21 0926QA B c 


3.0 


447770 AB032417 Hs.19545 


Frfzzl8d.Fz.SS,TM,Frizzled,Fz. 


frizzled (Drosophila) homok)g 4 


3.0 


423826 U20325 Hs.1707 


SSiSS 


cocaine- and amphetamine-regulated trans 


3.0 


418838 AW385224 Hs.35198 


,SS.TM,Phosphodlest. 


ectonucleotide pyrophosphatase/phosphodl 


3.0 


442804 AW300118 Hs.131257 


,SS.TM,G^amma 


ESTs 


3.0 


432284 AA532807 Hs.10S822 


.SSJHpkinase, 


ESTs 


3.0 
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TABLE 20A 

Table 20A shows the accession numbers for those pkeys lacking unigeneBD's for Table 20. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoiibleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

10 

Pkey: Unique Eos probeset identifler number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 

15 



Pkey 


CAT number 


Accessions 


410785 


1221055 1 


AW803341 AW803255 AW8034O3 AW803466 AW803402 AW803413 AW803268 AW803396 AW803334 AW803355 


411667 


1253334.1 


BE160198 AW935898 T1 1520 AW935930 AW856073 AW861034 


418636 


177402^1 


AW749855 AA225995 AW750208 AW750206 


420854 


197072_1 


AW296927 AI664514 AJ263168 AA281079 


422128 


211994.1 


AW881 145 AA490718 M85637 AA304575 T06067 AA331991 


423431 


228162J 


AA326062 AA325758 AW962182 


423945 


233566J 


AA410943 AW948953 AA334202 AA332882 


426878 


273265J 


BE069341 AW748403 AL044891 AI908240 AA393080 


433687 


373061J 


AA743991 AA604852 AW272737 


447197 


711623 J 


R36075A1366546R36167 


451631 


878098 1 


R00865 R01523AI806815 


456207 


165078_-1 


AA 193450 


456592 


202684.1 


R91600 T87079AA291455 
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TABLE 20B 



Table 20B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 20. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number conBspondlng to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. 'Dunham I. et a!.' refers to the publication 

entitled The DNA sequence of human chromosome 22.* Dunham I. el ai., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkey Ref 



Strand NLposition 



400608 9887666 IVIinus 96756-97558 

400903 2911732 Plus 59112-59228 

401045 8117619 Plus 90044-90184,91111-91345 

401093 8516137 Minus 22335-23156 

401197 9719705 Plus 176341-176452 

401747 9789672 Minus 118596-118816,119119-119244^19609-119761,120422-120990,130161-130381.130468-130593,131^ 
131932 132451-132575 133580-134011 

401780 7249190 Minus 28397-28617,28920-29045.29135-29296,29411-29567,29705-29787,30224-30573 

401781 7249190 Minus 83215-83435.83531-83656,83740-83901,84237-84393,84955^5037.86290-86814 
401866 8018106 Plus 73125-73623 

402230 9966312 Minus 29782-29932 

402408 9796239 Minus 110326-110491 

402578 9884928 Plus 66350^96 

402606 9909429 Minus 81747-82094 

402837 9369121 Minus 2013-2186,9570-9758.11136-11309.19429-19677,21210-21455,23368-23562.24342-24527,29132-29320 

403199 9958183 Minus 58895-59036.66618-66789 

403329 8516120 Plus 96450-96598 

403593 6862650 Minus 62554-62712,69449^9602 

403943 7711864 Plus 100742-100904,101322-101503 

404091 7684554 Minus 82121-83229 

404347 9838195 Plus 74493-74829 

404826 6572184 Plus 4772648046 

405366 2182280 Plus 22478-22632 

405654 4895155 Minus 53624-53759 
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TABLE 21: 210 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT TISSUES THAT ARE 
5 LIKELY TO ENCODE EITHER ENZYMES OR PROTEINS 

AMENABLE TO MODULATION BY SMALL MOLECULES 

Table 21 shows 210 genes up-regulated in breast cancer compared to normal adult tissues 
that are likely to encode either enzymes or proteins amenable to modulation by small 

10 molecules. These were selected as for Table 19, except that the ratio was greater than or equal 
to 3.0, the "average" normal adult tissue level was set to the 85'^ percentile value amongst 
144 non-malignant tissues, and the 96^*^ percentile value amongst 73 breast cancers was 
greater than or equal 80 units, and the predicted protein contained a structural domain that is 
indicative of enzymatic function or of being modulatable by small molecules (e.g. pkinase, 

1 5 peptidase, phosphatase, ATPase, or ion_transporter domains). The predicted protein domains 
are noted. 
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Pkey: Unique Eos piobeset identifier number 

ExAocn: Exemplar Accession number. Genbanic accession number 

UnigenelD: Unigene number 

Unigene Trtte: Unigene gene title 



R1: 



Ratio of 93rd peicentiie tumor to 85o> peioentiie of nomial body tissue 



Pkey ExAccn UnigenelD Predicted Protein Domains 



449746 
400292 
424735 
407178 
408045 
450375 
429170 
445730 
424834 
420757 
424399 
447350 
456207 
456938 
402578 
425692 
424001 
418007 
421727 
411869 
400289 
443348 
424086 
400295 
408771 
421155 
424905 
438167 
459583 
423945 
445283 



AI668594 Hs.1 76588 
AA250737 Hs.72472 
U31875 Hs.272499 
AA195651 Hs.104106 
AW138959 Hs.245123 
AA009647 Hs.8850 
NM.001394Hsi359 
AI624342 Hs.170042 
NIVL003613HS.151407 
X78592 Hs.99915 
A1905587 Hs.2533 
A1375572 Hs.l72634 
AA193450 

X52509 Hs.161640 

D90041 Hs.155956 
W67883 Hs.137476 
IW13509 Hs.83169 
Y13153 Hs.107318 
W20027 lHs.23439 
)(07820 Hs.2258 
AW873596 H5.182278 
AI351010 Hs.102267 
W72838 Hs.2533 
AW732573 Hs.47584 
H87879 Hs.102267 
NM„002497Hs.1 53704 
R28363 Hs.24286 
AI907673 
AA410943 

H57646 Hs.42586 



SS.p450 

deatii ZU5,TM,Activin_recp,pklnase, 

SS.TM 

SS.Dihydroorotase, 
PhospIiodiestSomatomedin_B, 

SS.TM.dislntegrtn.Pep_M1 2B_propep,Reprol a disintegrin and metailoproteinase doma 



UnigeneTitie 

ESTs. Weal(ly similar to CP4Y.HUIii1 AN C YTOC 
BIV1P-R1B 

short-ciiain alcohol dehydrogenase family 
ESTs 
ESTs 



DSPcRhodanese, 
SS,Tf^,Catk)n„effkix 
b.tsp„1,SS,AAA 

honnone_rec.AndPogenLrecep,zf-C4, 
SS 

p)<lnase, 

SS.TM.p450,p450 



dual spedfictty phosphatase 4 
ESTs 

cartilage intermediate layer protein, nu 
androgen receptor (dihydrotestosterone r 
aldehyde dehydrogenase 9 family, member 
ESTs 

gb:zr40e07.r1 Soares„NhH(^Pu^S1 Homo sap] 



SS,TM,aminotranJ_2,Cadherlr)_CJemi,cadhtyroslne aminotransferase 
SS,p450,SS.Tfvl,p450 01001 134:gl|2117372lpir|ll65981 fatly ac 

SS.Acetyltransf2. N-acety Itransferase 1 (a rylamlne N-acety 

pkinase. patemally expressed 1 0 

SS,hemopexln,Pepfldase„M10,SS,Peptkiase„matrtx metailoproteinase 1 (I^MPI; Inters 

i<ynurenlne 3-monooxygenase (kynurenine 3 
SS.Peptidase.M1. ESTs 

hemop6}dn,Peptidase_M10.SS,Peptidase„M10matrix metalbproteinase 10 (MMPIO; str 
SS,DENN calmodulin 2 (phosphoiylase kinase, delt 

SS.LysyLoxidase lysyl oxidase 

SS aldehyde dehydrogenase 9 fiamily, member 

TM.K^tetra Jon_trans,SS JM.K^tetra.ionJ potassium vottage^ated channel, delayed 
SS,LysyLoxidaseAldose_eplm,Epimerase,S lysyl oxidase 
pkInase.SS,TM.pkinase.poIyprenyLsynt, NIMA (never in mitosis gene a)-relatBd k 



SS,TM,7tm_1,p450,rrm 
pldnase, 

death,ZU5,TM,Actlvin_recp,pkinase, 
SS,AcyltFansferase; 



ESTs 

gb:lL-8T152-080399-004 BT152 Honw sapien 
gb:2t32h03,r1 Scares ovary tumor NbHOT H 
KtAA1560 protein 



R1 

65.7 
55.9 
53.8 
39.3 
34.9 
25.7 
24.9 
24.1 
21.7 
21.7 
20.3 
19.2 
18.3 
18.1 
17.8 
16.7 
16.5 
15.7 
15.3 
13.9 
13.5 
13.0 
12.8 
12.7 
11.9 
11.8 
11.7 
11.5 
11.5 
11.4 
11.2 
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60 



65 



423887 
429859 
428368 
418912 
451952 
402408 
415539 
406687 
419948 
400285 
408380 
409203 
424902 
431725 
418092 
427811 
423554 
426261 
413374 
432677 
456986 
453331 
407721 
418004 
410555 
443895 
423545 
401045 

442082 
418836 
404091 
450865 
424085 
449550 
432304 
415669 
442942 
428795 
428479 
400300 
447359 
402230 
424687 
432328 
432239 
400286 
425247 
434737 
439310 
441111 
452355 
427711 
429353 
425325 

448706 
403593 
432777 
446232 
411165 
414117 
411096 
450506 
417975 
447752 
427122 



AL080207 Hs. 134585 
NM_007050Hs.225952 
BE440042 Hs.83326 
NM>000685Hs.89472 
A1120173 Hs.301663 
MA 

AI733881 Hs.72472 
M31126 Hs.272620 
AB041035 Hs.93847 
NA 

AF123050 Hs.44532 
AA7e0473 Hs.687 
NIVL003866HS.153687 
X65724 Hs.2839 
R45154 Hs.106604 
M81057 Hs.180884 
M90516 Hs.1574 
AW242243 Hs.168670 
NM001034Ha75319 
NIVL004482HS.278611 
D38299 Hs.170917 
AI240655 Hs.8895 



SS.TM,BRCT,ank.ABC^tran,ABC^tran DKFZP434G232 protein 
SS,TM,Y^hosphatase,MAM,fn3, protein tyrosine phosphatase, receptor t 

SS,Peptidase_M10,hemopexin,SS.Peptidase_matiix metalloproteinase 3 (stfomelysin 
SS,TM,7tm J »SS JM,7tnu1. angfotensin receptor 1 

SS^pkinase, ESTs 

SS,carb_anhydrase NM_030920*:Homo sapiens hypothetical pro 

death,ZU5JM,Activin.recp,pl<&iase, BMP-RIB 
SS,PeptldasejM10,hemopexin,SS,Peptidase_matrix metalloproteinase 11 (MMP11; stro 



FefTic„reduct,TM,Ferrlc„reduct, 
TM,ABC_tran,ABC_membrane. 
SS.TM.ubiquftln.7tiTL3.ANF_recepk}r,sushi 
SS.p450.SS.p450 
SS.SS 

SS,Cys.knot,SS 
death,ZU5.pkinase^ctlvinjecp, 



NIVL016931:Homo sapiens NADPH oxidase 4 ( 
Eos Control 
diubiquHin 

cytochrome P450. subfan^y IVB, poiypept 
inositol polyphosphate-4-phosphatase, ty 
Noriie disease (pseudoglioma) 
ESTs 



Y12735 
U37519 
U92649 



Hs.38018 
Hs.87539 
Hs.64311 



AW204099 Hs.337720 
AP000692 Hs.129781 



C11001883*:gl|6753278|ref|NPJ)33938.1| c 



SS,Zn_carbOpeptPropep_M14,SS,Propep_M14carboxypeptidase B1 (tissue) 
GATase_2,SiS, glutamlne-fructose-6-phosphate transamin 

SS,Tl^,WD40,ubiquitin.E1-E2_ATPase.Cation peroxisomal famesylated protein 
SS ribonucleotide reductase M2 polypeptide 

SS,TM,Glycos„transL2,RlclTLBJectln,DPP UDP-N-acetyl-alpha-D-^alactosamine.polyp 
SS,TM.7tmJ, prostaglandin E receptor 3 (subtype EP3) 

SS,Bfl,drs]nti^rin,Pep„M12B_propep,Reprol ESTs 

pidnase, dual-spedfidty tyroslne-(Y)-phosphoiyl 

SS,aldedh,SS,aldedh, aldehyde dehydrogenase 3 family, member 

TIVI<d)sintegrin,Reprolysin, a dlsinlegrin and metaiioproteinase doma 

ESTs. Weakly similar to AF126760 1 reUn 
GAFPDEase chromosome 21 open leading frame 5 

ICEj)20.SS,ICEj)10.ICEji20,ICEj)10,ICEj) 



R41823 Hs.7413 
AI655499 Hs.161712 
NA 

AI248013 Hs.106532 
NIVL002914HS.139226 
AF055575 Hs.23838 
AA932186 Hs.69297 
NM005025Hs78589 
AW167087 Hs.131562 
R45503 Hs.97469 
Y00272 Hs.184572 
X03363 

NML012093HS.16268 
NA 

J05070 Hs.151738 
AI572739 Hs.195471 
X81334 Hs.2936 
NA 

NM^005940Hs.1 55324 
AA828246 Hs.291884 
AF086120 Hs.102793 
AI806867 Hs.126594 
N54926 Hs.29202 
IV131659 Hs.180408 
AL117406 Hs.200102 
X52730 Hs,1892 
phenylethanolamine N-methyitransferase 
AW291095 Hs.21814 SS.TM;pkinase. 
NA . 

AA564991 Hs,269477 
AI281848 Hs.194691 
NIVlJ00169Hs.69089 
W88559 Hs.1787 
U80034 Hs.68583 
NM_00446QHs.418 
AA641836 Hs.30085 
W3700 HS.10593B 
AW057736 Hs.323910 



TM,EPHJbd.pkinase.SAM,fn3. 
TM^ctivln_recp,pkinase,death,ZU5, 
TM.7tm_3^F_receptor, 
zf-C2H2 

SS,AAA,ViraUelicase1,mm. 
TM,ion_trans,SS,TM,k}n trans, 
TM.7tm„1. 
SS.serpin. 

SS,fg,Senia,pkinase, 
SS.TM 

SS,pklnase,pkinase 



6.7 

ESTs; calsyntenIn-2 
ESTs 

Target Exon 

ESTs. Weakly similar to 138588 reverse t 
replication factor C (activator 1) 2 (40 
calcium channel, voHage^iependent L ty 
ESTs 

serine (or cysteine) proteinase Inhlbito 
ESTs 

ESTs, Highly similar to A39769 N-acetyll 
cell division cycle 2. G1 to S and G2 to 



SS,TM.pkinase.Recep.Ldomayn,SH2.PH.Fur{HER2 receptor tyrosine kinase {oeM2, 
SS,adenylateklnase, adenylate kinase 5 

SS,TM,p450, Target Exon 

SS,Peptidase_M10,fn2,hennopexin,SS,TM,Pepmatrix metalloproteinase 9 (gelatlnase B 
6PF2K,PGAM, 6-phosphofructo-2-kinase/fnJctose-2,6-bl 
SS,PeptidaseJVl10,hemopexin,SS,Peptidase_matrix metalloproteinase 13 (collagenase 
SS,TI^^BC^tran.ABC_membrane,SS C16000922:gi|7499103lpirl|T20903hypothe 
SS.Peptidase„M1 0.hemopexIn.SS matrix melaOoproteinase 1 1 (MMP11 ; stro 

UCH-1,pkinase,OPR,Rhodanese,Ai\^P-bindtng,ESTs 
SS,TM,UDPGT,casein_kappa ESTs 
SS.TMPhosphodiest, ESTs 
TM,7tm J ,TM G protein-coupled receptor 34 

SS solute earner family 25 (mitochondrial 

SS,TM^BC_tran^BC„membrane. ATP-binding cassette transporter MRP8 

SS.NNMT„PNMT TEiVlT,SS.NNI\iir_PNMT_TEMT,STAR 

5.1 

interteukin 20 receptor, alpha 
CtDE'N,pklnase Target Exon 

alpha-amylase, ESTs 
SS JM,7lmJ,RibosomaLL13 retinolc add induced 3 

SS.IVIeiibiase.BTKPH,pkInase,SH2,SH3,Ribo gaiactosidase, alpha 
TM,lon_trans,K^{etra, proteoOpW protein 1 (Pelizaeus-I\4erzbac 

Pept)dase_lvt3» mitochondrial intemnediate peptidase 

SS,DPPIV_N.tenn.Peptidase_S9,SS,DPPO Jbrobbst activation protein, alpha 
SS.trypsin hypothetkaj protein FU231B6 

SS,transfenin,7tm J .transfenin, lactotransferrin 
SS,TM.pkinase,Recep.U.domain,SH2.PH,FuriHER2 receptor tyrosine kinase [c-eMZ, 
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10.9 

10.4 

10^ 

10.3 

10.3 

9.8 

9.4 

9.1 

9.1 

6.8 

8.6 

83 

8.3 

8.3 

8.3 

8.2 

8.1 

7.8 

7.6 

7.6 

7.5 

7.3 

7.2 

7.2 

7.1 

6.9 

6.6 



67 
6.6 
6.6 
6.5 
6.5 
6.4 
6.2 
6.2 
62 
6.1 
6.1 
6.1 
6.1 
6.1 
5.8 
5.8 
5.6 
5.6 
5.4 
5.4 
5.2 
5.2 
5.2 
5.2 
5.1 



5.1 
5.1 
5.0 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.8 
4.8 
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400181 NA SS.TM.3BetaJ<SD, ENSP00000171555:CDNAFU10727fls,clon8 4.6 

452093 AA447453 Hs.27860 SSJM.7tmJ, Homo sapiens mRNA; cDNA DKFZp586M0723 (f 4.6 

435542 AA687376 Hs.269533 SS.pk!nase,RhoGEF.^,PH.SH3, ESTs 4.6 

417576 AA339449 Hs.822d5 A)RS,formyLfransf,GARS,SS,GAR$^lRS,fbrphosphon^osylglydnamktefbn^ 4.6 

S 446089 AI860021 Hs.270651 pkinase ESTs. Moderately similar to A47582Bk»I 4.6 

424420 BE614743 Hs.146688 SS,TM.MAPEG, prostaglandin E synthase 4.5 

452190 H26735 Hs.91668 SSJMPH.SH2fuiin-like.pl<inase,Recep_L_ Homo sapiens done PP1498 untoio^^ 4.5 

419986 AI345455 Hs.78915 pMnase.OPR, GA-binding protein transcription factor, 4,5 

421582 AI910275 Hs.1406 SS.tr8foil,SS,TM,ULreceptja.SRCR,tryps trefoil fador 1 (pS2) 4.5 

10 446733 AA863360 Hs.26040 SSJM,p450. ESTs. Weakly similar to fatty add omega 4.4 

453060 AW294092 Hs.21594 SS,ras.Yj)hosphalase,ras tiypothetical protein MGC1 5754 4.4 

400205 NA NIvL006265*:Homo sapiens RAD21(S.pombe) 4,4 

420854 AW296927 SS.TM.Peptldase_M1, gb:UI-H-BW0-aic^07-O-UI.s1 NCIjCGAP.Su 4.3 

432690 AF181490 Hs.278527 SS,pyrjedox,SS,Rlbosomal_L39 prenyicystdne lyase 4.3 

15 441560 F13386 Hs.7888 pkinase, Homo saplensdone 23736 mRNA sequence 4.3 

416445 AL043004 Hs.79337 SS.pkinase, KIAA0135 protein 4.3 

439024 R96696 Hs.35598 SSJM.trypsin.vwd,^ ESTs 4.3 

432862 NM 013257HS.279696 pklnase.pkrnase^C. senim^hioocortiookJ regulated kinas&^l 4.2 

447754 AW073310 Hs.163533 pkinase. Homo sapiens cDNA FU14142 fis, done MA 4.2 

20 453775 NM X2916Hs.35120 SS.AAA,PI3_P14_klnase,PI3Ka,PI3K_rbd,PI3 replication factor C (adivator 1 ) 4 (37 4.2 

431657 AI345227 Hs.105448 SS,TM,pklnase ESTs, Weakly similar to B34087hypotheti 4.1 

427899 AA829286 Hs.332053 SS,SA/LprDteins,ABCjneml)rane.ABCJran, serum amyloid A1 4.1 

422938 NM_001809Hs.1694 SS,TM,thk)Iase, centromere protein A (17kD) 4.1 

418478 U38945 Hs.1174 ank,ank cyclin-dependenl kinase inhtoitor 2A (me 4.1 

25 406922 S70284 SS,TM,Desaturase,SS gb:stearoyl-CoA desaturase [human, adipo 4.1 

4392B5 AL133916 Hs.172572 SS.ig.pkinase.LRRNT.LRRCT, hypothetical protein FLJ20093 4.1 

429922 Z97630 Hs.226117 SS,TM,linker_histone.7tnu1 H1 histone family, member 0 4.1 

420139 NM_005357Hs.95351 SS,TM,p450. lipase, hormone-sensitive 4.0 

425071 NM 013989HS.1 54424 SS.T4_deiodinase,T4_dek)dinase, defodinase. iodothyronlne, type II 4.0 

30 424511 BE300512 Hs.193557 SS.Y_phosphatase,Band_41 ESTs, Moderately similar to ALU7_HUMAN A 4.0 

428722 U76456 Hs.190787 SS.TIMP. tissue inhibRor of metaltoprotetnase 4 3.9 

428330 122524 Hs.2256 SS.Peptidase.M10.SS,Peptkiase.M10,hemopematrix metatbprotelnase 7 (MMP7; utertn 3.9 

414831 M31158 Hs.77439 SS.d^MP_b]nding,Rlla.HMG.ba)( protein kinase. cAMP-dependentregulato 3.9 

413589 AW452631 Hs.313803 SS,abhydrolase ESTs. Highly sintilar to AF157833 1 nond 3.8 

35 418629 BE247550 Hs.86859 SS,SH2.PH,SS,TM.PH.SH2,Furin^ike,pkinas growth fador receptor4)ound protein 7 3.8 

413453 AA129640 Hs.128065 SS,Peptidase_C1,gpdh ESTs 3.8 

403943 p450.SS.p450 C5000355:gi|4503225|ref|NP^000765.1|cyt 3.8 

444618 AV653785 Hs.173334 ELL-RELATED RNA POLYMERASE II. ELONGATIO 3.8 

408761 AA057264 Hs.238936 SS,TM.7tm_1. ESTs. Weakly similar to (defTine not ava 3.8 

40 427809 M26380 Hs. 180878 SS,lfpase,PLAT.Sec7.PH, lipoprotein lipase 3.7 

418203 X54942 Hs.83758 CKS.SS.CKS. CDC28 protein kinase 2 3.7 

454071 AI041793 Hs.42502 TM,7tm_1. ESTs 3.7 

424676 Y08565 Hs.151678 Glycos_transfJWn_BJectln,$S,Glyoos UDP-N-acetyl-alpha-0-galadosamlne:polyp 3.7 

457465 AW301344 Hs.122908 SS.Pribosyltran,Sulfatase DNA replication factor 3.7 

45 417601 NM_014735Hs.82292 PHD.pklnase.SS KIAA0215 gene product 3.7 

446619 AU076643 Hs.313 SS,TM.efhand.k)n trans secreted phosphoprotein 1 (osteopontin, 3.7 

406625 Y13647 Hs.1 19597 SS,TM.Desaturase.SS slearoyK)oA desaturase (delta-9-desatur 3.6 

428769 AW207175 Hs.106771 SS.7trrLl,SPRY, ESTs 3.6 

426310 NMJ)00909Hs.169265 SS.TM.7tmJ. neuropeptide Y receptor Y1 3,6 

50 417531 NMJ)03157Hs.1087 SS,pkinase,vwa.vwa,Glyco_transf_8 serine/threonine kinase 2 3.6 

444781 NM 014400HS.11950 SS,PHJactamase3 GPI-anchored metastasis-assodated prote 3.6 

431493 AI791493 Hs.129873 SS.p4504)450 ESTs, Weakly similar to A36036 cytochrom 3.6 

428966 AF059214 Hs.194687 cholesterol 25-hydroxylase 3.6 

414175 AI308876 Hs.103849 TM.hemopexin.PeptkJaseJ/l10,hemopexin,Pephypotheticat protein DKFZp761D1 12 3.6 

55 455325 AW895719 TM.ion_trans.K^tetra, gb:QV4-NN0039-290300-154-«)6NN0039Homo 3.6 

429597 NM.00381 6Hs.2442 a disintegrin and metalloproteinase doma 3.6 

425320 U29344 Hs.83190 AcyLtransf.adfL2(nc,ketoacyl-synt,pp-b( fatty add synthase 3.5 

431854 AA383550 Hs.271699 IMS.SS polymerase (DNA directed) k)ta 3.5 

407104 S57296 Hs.323910 SS.TM,SH2,PH,pkinase.Recep_Ldomain,Furiv.erb-b2 avian eiythroblastic leukemia v 3.5 

60 449051 AW961400 Hs.333526 SS HER2 receptor tyrosine kinase (c-erb4)2. 3.5 

434398 AA121098 Hs.3838 pkinase,POLOJ)ox.SS,pkinase,POLO_box, senim-indudble kinase 3,5 

454042 H22570 Hs.172572 SS,ig.pkinase.LRRNTJ.RRCT. hypothetical protein aj20093 3.5 

407756 AA116021 Hs.38260 SS,UCH-1,UCK-2,SS,TM.G_glu.transpept ubiquftinspednc protease 18 3.5 

401464 AF039241 Hs.9028 Peptidase M24. histone deacetylase 5 3.5 

65 412970 AB026436 Hs.177534 DSPc.Myosin_teH. dual specificity phosphatase 10 3.5 

412049 NS3437 Hs.18268 SS,adenylatek]nase» adenylate kinase 5 3.5 

425776 U25128 Hs.159499 SS,TM,7tnrL2.SS,TM,7tnL2 parathyroWhomione receptor 2 3.4 
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407846 AA426202 Hs.40403 TM^BC_membrane^BC_tran,RibosomaLS4eCbp/p300-lnteracUngtransa^^ 3.4 

406925 L34041 Hs.9739 SSJM.transporLprot.SWIB,RhoGAPPAGJ^E-glycerol-3i)hosphate dehydrogenase 3.4 

445873 AA250970 Hs.251946 SS.rrm.PABP.pkinase,14^-3.rrm poly(A).bind{r>g protein, cytoplasmic 1-1 3.4 

418054 NM_002318Hs.83354 SS,TM.mitD_carr.Lysyl.oxkfase tysyl oxidase-like 2 .14 

406815 AA833930 Hs.288036 SS.IPPT, tRNA Isopentenylpyrophosphate Iransferas 3.4 

410530 M25809 Hs,64173 ATP-synLab,SS,7trru1,ATP-synLab ATPase, H transporting, lysosomal (vacuo 3.4 

407021 U52077 gb:Human marinerl transposase gene, confip 3.4 

421168 AF182277 Hs.330780 SS,p450,SS cytochrome P450, subfamily IIB (phenobar 3.4 

431473 AA825686 Hs.321176 SS ESTs. Weakly simitar to S65824 reverse t U 

408101 AW968504 Hs.123073 pklnase. C0C2-retated protefri kinase 7 3.4 

422083 NM_001141Hs.1112S6 lipoxygenase,PLAT, arachidonate 15-lipQxygenase. second typ 3.3 

411393 AW797437 Hs.69771 SS,sushf,trypsin,vwa,n7n,ftbrinogen,C,fn B-fector, properdin 3.3 

435767 H73505 Hs.1 17874 SS.Pep«dase^S8.P.Peptidase_S8.P ESTs 3.3 

433058 NiVL006456Hs.288215 SS.Prlbosyltran, sialyltransferase 3.3 

426928 AF037062 Hs.172914 SS,adh shortTGF-betaJGFbjjropeptide retinol dehydrogenase 5 (ll-ds and 9-cl 3.3 

414575 H11257 Hs,22968 SS.pkinase.lg. Homo sapiens clone IIWAGE:451939,mRNAse 3.3 

445941 AI267371 Hs.172636 SS,SS.Iipo)cygenase,PLAT ESTs 3.3 

444542 AI161293 Hs.280380 SS.SS.PepBdase_M1,E6F,ig.lec8n_c,sushi aminopeptidase 3.3 

425741 AF052152 Hs.159412 pkinase, Homo sapiens clone 24628 mRNA sequence 3.3 

434228 Z42047 Hs,283978 SS,7W,7tni.1 Homo sapiens PR02751 mRNA. complete cds 3.3 

433264 D85782 Hs.3229 cysteine dioxygenase, type I 3.3 

400419 AF084545 SS,PeptidaseJVI1. Target 3.3 

439750 AL359053 H&57664 TM,lntegrin_B,RicirLB lectln,rrm Homo sapiens mRNA full length insert cDN 13 

417757 R19897 Hs.106604 deathZU5.pkinase.Activin_necp, ESTs 3.3 

452194 AI694413 Hs.332649 SS.TIW,7tm_3.ANF^receptor.sushi olfado^ receptor, family 2. subfamily 3.2 

421458 NAit0O36S4Hs.1O4576 SS carbohydrate (kenatan sulfate Gal-6) sul 3.2 

443767 BE562136 Hs.9736 SS.PCI.RasGEF,hormone_reczf-C4. proteasome (prosome, macropain) 26S subu 3.2 

422648 D86983 Hs.1 18893 peroxidase.LRRCT, Melanoma associated gene 3.2 

423431 AA326062 SS,p450.p450 gb:EST29171 Cerebellum II Homo sapiens c 3.2 

451254 AI768235 SS.Trehalase gbwg82g08.x1 Soares_NSF_F8_9W_OT_PA^P^S 3.2 

452110 T47667 Hs.28005 SS,TM,Activin_rBcp.pkinase Homo sapiens cDNAFUl 1309 fis, ctone PL 3.2 

439963 AW247529 Hs.6793 TIVI.p450,Els platelet-actlvating factor acetylhydrola 3.2 

453941 U39817 Hs.36820 SS.OEAD.HRDC.helicase C, Bkwm syndrome 3.1 

406664 L34041 Hs.9739 SS.TM,transpoit.prot,SWIB.RhoGAP.DAG^PE- glycerol-3- 

phosphate dehydrogenase 1 (so 3.1 

453487 R31770 Hs.23540 TM.7tm 1. ESTs 3.1 

420911 U77413 Ha100293 CMInked N-acelylglucosamine (GlcNAc) tr 3.1 

443171 BE281128 Hs.9030 SS.TM^trrul.nrm.SS TONDU 3.1 

452256 AKD00933 Hs.28661 TM,GDI,7tnLl. Homo sapiens cDNA FU10071 fis, clone HE 3.1 

432201 AI538613 Hs.298241 SS,TM.Uypsin,SS.TM,trefbi].trypsin.trer Transmembrane protease, serfne 3 3.1 

419150 T29618 Hs.89640 TM.pklnase.fn3. TEK tyrosine kinase, endothelial (venous 3.1 

444443 AI149286 Hs.550g9 SS rabS GTPase activating protein (GAP and 3.1 

426283 NMJ)03937Hs.169139 kynureninase (L-kynurenine hydrolase) 3.1 

436291 BE568452 Hs.5101 SS,abhydrolase, protein regulator of cytokinesis 1 3.1 

450223 AA418204 Hs.241493 SS.pro isomerase. natural killer-tumor recognition sequenc 3.1 

424269 AW137691 Hs.1997&4 SS.TIV1.7tm_2.GPS ESTs 3.1 

448105 AW591433 Hs.298241 SS.TM.trefoil,tiypsin,trefoil Transmembrane protease, senne 3 3.0 

452560 BE077084 Hs.336432 SS,mn,z^RanBP,pklnase.C2.pkinase.C.DAGESTs 3.0 
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TABLE 21A 

Table 21 A shows the accession numbers for those pkeys lacking unigenelD's for Table 21 . 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived jfrom Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: 


Unique Eos probeset identifier number 


CAT number. Gene cluster number 


Accessbn: 


Genbank accession numbers 


Pkey 


CAT number 


Accessions 


420854 


197072_1 


AW296927 AI684514 AI263168 AA281079- 


423431 


228162J 


AA326062 AA325758 AW962182 


423945 


233566 1 


AA410943 AW948953 AA334202 AA332882 


451264 


863988J 


AI768235 R31400 H29082 H23107 


455325 


127M75J 


AW895719 N31451 N41451 


456207 


165078 -1 


AA193450 
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TABLE 21B 

Table 21B shows the genomic positioning for those pkeys lacking unigene DD's and 
5 accession numbers in Table 21 . For each predicted axon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number conesponding to an Eos probeset 

1 0 Ref: Sequence source. The 7 digit numbers in this coiumn are Genbanit Identifier (Gl) numbers. "Dunham I. et al.* refers to the publication 

entitled The DNA sequence of human chromosome 22.' Dunham I. et al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predlclfid. 

NLposltion: t ndicates nucleotide positions of predicted exons. 

15 

Pkey Ref Strand Ntposttion 

401045 6117619 Plus 90044-90164,91111-91345 

402230 9986312 Minus 29782-29932 

20 402408 9796239 Minus 110326-110491 

402578 9884928 Pius 65350-66496 

403593 6862650 Minus 62554-62712,69449^9602 

403943 7711864 Plus 100742-100904.101322-101503 

404091 7684554 Minus 82121-83229 
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TABLE 22: 739 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT BREAST 

Table 22 shows 739 genes up-regulated in breast cancer compared to normal adult breast. 
These were selected as for Table 19, except that the ratio was greater than or equal to 3.0, the 
denominator was the 85* percentile value for 12 non-malignant breast specimens, and the 
96* percentile value amongst the 73 breast cancers was greater than or equal 100 units. 



Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

Unl^nelD: Unigene number 

1 S Unigene Title: Unigene gene title 

R1: Ratio of 90t^ percentile tumor to SS^^* percentile nomnal breast tissue 



Pkey ExAccn UnlgenelD UnigeneTltte R1 

400292 AA250737 Hs72472 BMP-RIB 51.5 

424735 U31875 Hs.272499 shortKi^ain alcohol dehydrogenase family 38.3 

400297 A(127076 Hs.334473 hypothetical protein DKFZp56401 278 29.9 

431448 AL137517 Hs.334473 hypothetical protein DKPZp56401 278 26.9 

25 451110 AI955040 H&265398 ESTs.Weatdy similar to translbmfiaUonw- 25.8 

431211 M86849 Hs.323733 gap junction protein, beta 2. 26kD (conn 23.2 

418203 X54942 Hs.83758 CDC28 protein kinase 2 22.6 

407980 AA046309 gb:zf12fQ1.s1 Soaies.1i9taLhearLNbHH19W 19.8 

414646 AA353776 Hs.901 CD48 antigen (Boell membrane protein) 18,9 

30 446921 AB012113 Hs,16530 small inducible cytokine subfamily A (Cy 18.0 

409041 AB033025 Hs.50081 KIAA1 199 protein 17.6 

412140 AA219691 Hs.73625 RAB6 interacting, kinesin-like (rabkines 17.6 

407824 AA147884 Hs.9812 Homo sapiens cDNAFLJ14388fis, done HE 17.1 

453160 AI263307 Hs.239884 H2B histone family, member L 17.0 

35 407137 T97307 gb:ye53h05.s1 Scares fetel liver spleen 16.1 

425692 D90041 Hs.155956 N-acetyltransferase 1 (aiylamlne N-acety 16.1 

438533 AI440266 Hs.170673 ESTs. Weakly similar to T24832 hypotheti 16.0 

428227 AA321649 Hs.224B small indudble cytokine subfamily B (OX 15.5 

444342 NM»014398Hs. 10887 similar to lysosome-associated membrane 15.1 

40 422505 AL120862 Hs.124165 programmed cell death 9 (PDC09) 14.9 

430515 AA746503 Hs.2B3313 ESTs 14.7 

417308 H60720 Hs.81892 KIAA0101 gene product 14.4 

452744 AI267652 Hs.30504 Homo sapiens mRNA;cONADKFZp434E082(fr14.4 

412446 AI768015 Hs.92127 ESTs 14.2 

45 415539 AI733881 Hs.72472 BMP-RIB 14.1 

435496 AW840171 Hs.265398 ESTs. Weakly similar to transfomiatlon-r 13.8 

438209 AL1 20659 Hs.6111 aiyWiydrocarbon receptor nuclear transi 13.8 

400205 NA NM.006265*:Homo sapiens RAD21 (S. pombe)13.5 

430965 AA489732 Hs.154918 ESTs 13.4 

50 415263 AA948033 Hs.130853 ESTs 13.3 

461952 AL120173 Hs.301663 ESTs 13.2 

449722 BE280074 Hs.23960 cyclinBI' , 13^ 

406685 M18728 gb•>^uman nonspecific crossreacting anlig 13.0 

406690 M29540 Hs.220529 carcinoembryonlcant^en-relaledcellad 12.8 

55 429925 NM_000786Hs.226213 cytochrome P450. 51 (lanosterol14-aIpha 12.8 

416498 U33632 Hs.79351 potassium channeUubfamily K, member 1 12.7 

432378 AI493046 Hs.146133 ESTs 125 

441377 BE218239 Hs.202656 ESTs 12.5 

456207 AA193450 gb:zr40e07.r1 Soares.NhHMPu^SI Homo sapi12.4 

60 422805 AA436989 Hs.121017 H2A histone family, member A 12.2 

40781 1 AW190902 Hs.40098 cysteine knot superfamiiy 1, BMP antagon 12.2 

407178 AA195651 Hs.104106 ESTs 12.2 
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420931 AF044197 Hs.100431 small Inducible cytokine Bsubfamfly(Cy 12.1 

421727 Y13153 Hs.107318 kynuren/ne 3-nionooxygenase (kynurenlne 3 120 

434408 At031771 Hs.132586 ESTs 12i) 

446591 H44186 Hs.15456 PDZ domain containing 1 11.9 

5 431385 BE178536 Hs.11O90 membrane-spanning 4-domatns, subfamily A 11.8 

443348 AW873596 Hs.182278 calnrxxJulin 2 (phosphoiylase kinase, dell 117 

416602 NM_006159Hs.79389 ne) (chicken)^ 2 11.7 

433365 AF026944 Hs.293797 ESTs 11.6 

437866 AA156781 HsJ4170 metal!othk)nein1E(funcltonal) 11.5 

10 412472 AW975398 Hs.293836 ESTs 11.4 

416030 H15261 Hs.21948 ESTs 1U 

439979 AW600291 Hs.6823 hypothetical protein FU 10430 1U 

420757 X78592 Hs.99915 androgen receptor (dlhydrotestDsterone r 11.3 

411598 BE336654 Hs.70937 HShistone family, member A 11.2 

IS 423600 AI633559 Hs.310359 ESTs 11.2 

430770 AA765694 Hs.123296 ESTs 110 

421037 AI684608 Hs.197653 programmed cell death 9 (PDCD9) 10.9 

452461 N78223 Hs.1O8106 transcrfptlon factor 107 

409269 AA576953 Hs.22972 hypothetical protein FU 13352 10.6 

20 417791 AW965339 Hs.1 11471 ESTs 10.6 

447268 AI370413 Hs.36563 hypoihet'cal protein FLJ224 18 10.4 

424001 W67883 Hs.137476 paternally expressed 10 10.4 

447342 AI199268 Hs.l9322 Homo sapiens. Similar to RIKEN cDNA 2010 104 

424905 NM_002497Hs.153704 NIMA (never In mitosis gene a)-related k 10.1 

25 453619 H87648 Hs.33922 Homo sapiens, clone MGC:9084. mRNA.compI 0.1 

442942 AW167087 Hs.131562 ESTs 10.1 

434377 AW137148 Hs.306593 HomosapienscDNAFLJ11382fis, clone HE 10.1 

427217 AA399272 Hs.144341 ESTs 10.1 

445730 AI624342 Hs.170042 ESTs 10.0 

30 432887 A1926047 Hs.162859 ESTs 10.0 

452243 AL355715 Hs.28555 programmed ceil death 9 9.9 

424590 AW966399 Hs.46821 hypothetk:al protein FU20086 9.9 

432169 Y00971 Hs,2910 phosphoilbosyl pyrophosphate synthetase 9.9 

438950 H23789 Hs.144530 EST 9,9 

35 418836 AI655499 Hs.161712 ESTs 9.8 

430291 AV660345 Hs.238126 CGM9 protein 9.8 

444665 BE613126 Hs.47783 B aggressive lymphoma gene 9.7 

407377 C16391 gb:C16391 Clontech human aorta polyA mRN 9.7 

445413 AA151342 Hs.12677 CGM47 proteh 97 

40 443462 A1064690 Hs.171176 ESTs 97 

442145 AI022650 Hs.8117 erbb2'interactlng protein ERBIN 97 

435570 AF212222 Hs.177812 uncharaderized bone man^ow protein BM04 9.7 

439820 AL360204 Hs.283853 HomosapiensmRNAfull length insert cDN 9.6 

428966 AF05g214 Hs.194687 cholesterol 25-hydroxylase 9.8 

45 449448 D60730 Hs.57471 ESTs 9.6 

433929 AI375499 Hs.27379 ESTs 9.5 

432731 R31 178 Hs.287820 fibronectlnl 9.3 

411815 AA1 56679 Hs.1 25790 leucine-rich repeat-containing 2 9.3 

415385 R17798 Hs.7535 COBW-llke protein 9.3 

50 422026 U80736 Hs.110826 trinucleotide repeat containing 9 9.2 

432596 AJ224741 Hs.278461 matririn3 9.2 

439451 AF086270 Hs.278554 heterochromatln-like protein 1 9.2 

423945 AA410943 gb2l32h03.r1 Soares ova^ tumor NbHOT H 9.1 

442432 BE093589 Hs.38178 hypothetical protein FU23468 9.1 

55 446715 AI337735 Hs.173919 ESTs. Moderately similar to ZN91,HU MAN Z 9.0 

408771 AW732573 Hs.47584 potassium volta^e^ated channel, delayed 9.0 

437021 AI076089 Hs.292239 ESTs 9.0 

428479 Y00272 Hs.184572 cell dh/rskm cycle 2. 61 to S and G2 to 8.9 

428839 AI7677S6 Hs.82302 Homo sapiens cONAFU14814f]s,ck>ne NT 8.9 

60 402408 NA NI\AJ030920*:Homo sapiens hypothetical pro 8.9 

418601 AA279490 Hs.86368 calmegin 8.8 

426327 W03242 Hs.44898 HomosapfenscbneTCCCTA00151 mRNAsequ8.8 

419519 AI198719 Hs.176376 ESTs 8.8 

440621 AW296024 Hs.150434 ESTs 8.8 

65 446142 AI754693 Hs.145968 ESTs 8.8 

418196 AI745649 Hs.26549 KIAA170d protein 8.7 

447178 AW594641 Hs.192417 ESTs 87 
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427585 D31152 Hs.179729 collagen, type X, alpha 1 (Schmid metaph 8.6 

415857 AA866115 Hs.127797 Homo sapiens cDNAFUII 381 lis. done HE 8£ 

435061 A1651474 Hs.163944 ESTs 8.6 

431374 BE258532 Hs.251871 CTP synthase 8.4 

5 417866 AW067903 Hs.82772 collagen, type XI, alpha 1 84 

437211 AA382207 Hs,5509 e(X)tropic viral integration site 2B 8.3 

437751 AA767373 Hs.35669 ESTs. Moderately similar to ALU1.HUMAN A 8.3 

423887 AL0802O7 Hs.134585 DKFZP434G232 protein 8.2 

440941 BE268362 Hs7535 COBW-lilce protein 8.2 

1 0 429859 NIVL007050HS.225952 protein tyrosine phosphatase, receptor t 8.2 

410193 AJ132592 Hs.59757 zinc finger protein 281 8.2 

431725 X65724 Hs.2839 Norrie disease (pseudogltoma) 8.1 

446258 AI283476 Hs.263478 ESTs 8.1 

416747 AW876523 Hs.15929 hypothetical protein aJ12910 8.1 

IS 434424 AI811202 Hs.325335 Homo sapiens cONA: FU23523 fte. done L 8.1 

421650 AA781795 Hs.122587 ESTs 8.0 

429534 AW976987 Hs.163327 ESTs. Wealtly similar to 21 09260AB cell 8.0 

457465 AW301344 Hs.122908 DNA replication factor 8.0 

427961 AW293165 Hs,143134 ESTs 8.0 

20 436481 AA379597 Hs.5199 HSPC1 50 protein similar to ubiquitin-con 8.0 

418216 AA662240 Hs.283099 AF15q14 protein 8.0 

418250 U29926 Hs.83918 adenosine monophosphate deaminase (isofo 7.9 

400285 NA Eos Control 7.9 

401464 AF039241 Hs.9028 histone deacetylase 5 7.9 

25 407242 1W18728 gbiHuman nonspecific crossreactingantig 7.8 

422232 D43945 Hs.113274 transcription factor EC 7.8 

454024 AA993527 Hs.293907 hypothetical protein FU23403 7.8 

444542 AI161293 Hs.280380 aminopeptidase 7.8 

436396 AI683487 Hs.152213 wlngless^type MMTV integration site fami 77 

30 437204 AL110216 Hs.12285 ESTs. Weakly similar to 155214 salivary 7.6 

408805 H69912 Hs.4d269 vaccinia related kinase 1 7.6 

437207 T27503 Hs.15929 hypothetical pro^n FU12910 7.6 

442818 AK001741 Hs.8739 hypothetical protein FU10879 7.6 

426283 NM_003937Hs.169139 kynurenlnase (L-kynurenine hydrolase) 7.5 

35 424687 J05070 Hs,151738 matrix metaltoprotelnase 9 (gelatlnaseB 7.5 

446315 NM_016293Hs.14770 bridging integrator 2 7.5 

433426 H69125 Hs.133525 ESTs 7.5 

406639 M97711 gb:Human T-cell receptor (V beta 18.1, J 7.5 

420077 AW512260 Hs.87767 ESTs 7.4 

40 457332 AA961694 Hs.105187 Wnesin protein 9 gene 7.4 

422938 NIVL001809Hs,1594 centromere protein A (17kD) 7.4 

447555 AI391662 Hs.160963 Homo sapiens, done MGC:1 231 8, mRNA.com7.4 

444618 AV653785 Hs,173334 ELL-REUTED RNA POLYMERASE II. ELON6ATI07,3 

410361 BE391804 Hs.62661 guanylate binding protein 1. interferon- 7.3 

45 400268 NA NM-003292:Homo sapiens translocated prom7.3 

439509 AF086332 Hs.58314 ESTs 7.3 

407771 AL138272 Hs.62713 ESTs 7.3 

407202 N58172 Hs.109370 ESTs 7.3 

433096 AU076803 Hs.282975 carboxylesterase 2 (intestine, liver) 7.2 

50 422094 AF129535 Hs,272027 F-box only protein 5 ' 7.1 

430832 AI073913 Hs,100686 ESTs, Weakly similar to JE0350 Anterior 7.1 

430287 AW182459 Hs.125759 ESTs, Weakly similar to LEU5,HUMANLEUKE7.0 

423739 AA398155 Hs.97600 ESTs 7.0 

448212 AI475858 gb.1c87d07.x1 NCI CGAP_CLL1 Homo sapiens7.0 

55 407277 AW170035 Hs.326736 Homo sapiens breast cancer antigen NY-BR 7.0 

454440 BE062906 Hs.28338 KIAA1546 protein 7.0 

444783 AK001468 Hs.62180 anillin (Drosophila Scraps homolog), act 7.0 

421373 AA808229 Hs.167771 ESTs 6.9 

431960 AW241821 Hs.301927 c6.1A 6.9 

60 424704 AI263293 Hs.152096 cytodvome P450, subfaniily IIJ (aradildo 6.8 

449517 AW500106 Hs.23643 serine/threonine pioteln kinase MASK 6.8 

439840 AW449211 Hs.105445 GDNF family receptor alpha 1 6.8 

414080 AA135257 Hs.47783 B aggressive lymphoma gene 6.8 

441243 AI767056 Hs.193002 ESTs 6.7 

65 408380 AF123050 Hs.44532 diublquitln 67 

422956 BE545072 Hs.122579 hypothetteal protein FU10461 67 

446651 AA393907 Hs.97179 ESTs 67 
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419839 U24577 Hs.93304 phospholipaseA2. group VII (platelet-ac 6.7 

437740 AA810265 Hs.1 22915 ESTs 6.7 

421582 AI910275 HsJ406 trefoil factor 1 (pS2) 6.7 

427356 AW023482 Hs.97849 ESTs 6.6 

5 429597 NM_003816Hs.2442 a disintegrin and metalloproteinase doma 6.6 

422634 NNL016010HS.118821 CGI-62 protein 6.6 

421072 AI215069 Hs.89113 ESTs 6.5 

427718 A1798680 Hs.25933 ESTs 6.5 

411000 N40449 Hs.201619 ESTs, Weakly similar to 838383 SEB4B pro 6.5 

1 0 449343 AI151418 Hs,272458 protein phosphatase 3 (fonnerly 2B), cat 6.4 

409757 NM.001898HS.1 23114 cystatinSN 6.4 

447164 AF026941 Hs.17518 Homo sapiens cigS mRI^. partial sequence 6.4 

456938 X52509 Hs.161640 tyrosine aminotransferase 6.4 

418848 AI820961 Hs.193465 ESTs 6.4 

15 424902 NM_003866Hs.l53687 inositol polyphosphate4-phosphatase. ty 6.4 

452838 U65011 Hs.30743 preferentially expressed anUgeninmefa 6.4 

439452 AA91d317 Hs.57987 B^ICLL/lymphorna 1 1B (zinc finger pro 6.4 

407266 AJ235664 gb:Homo sapiens mRNA for immunoglotHilin 6.3 

411078 AI222020 Hs.182364 CocoaCrisp 6.3 

20 433001 AF217513 Hs.279905 doneHQOSIO PRO0310p1 6.3 

434340 A1193043 Hs.1 28685 ESTs, Weakly similar to T1 7226 hypotheti .6.2 

429503 AA394183 HsjJ6873 ESTs 6.2 

402578 C1001134;gil2117372|plrl|t65981 fatty ac 6.2 

409646 AW161391 Hs.709 deoxycytidine kinase 6.1 

25 430447 W17064 Hs.332848 SWI/SNF related, matrix associated, acti 6.1 

432415 T16971 Hs.289014 ESTs, Weakly simflar to A43932 mucin 2 p 6.1 

443709 AI082692 Hs.134662 ESTs 6.1 

420929 AI694143 Hs.296251 programmed cell death 4 6.1 

428368 BE440042 Hs.83326 matrix metalbproteinase 3 (stromelysin 6.1 

30 428248 AI126772 Hs.40479 ESTs 6,0 

420344 BE463721 H&97101 putative G protein-coupled receptor 6.0 

453392 U23752 H&32964 SRY (sex determining region Y)-box 11 6.0 

425397 J04088 Hs.1 56346 topolsomerase(DNA) 11 alpha (170kD) 6.0 

418007 IVI13509 Hs.83169 matrix metaltoproteinase 1 (IVIMP1; inters 6.0 

35 428585 AB007863 Hs.185140 KIAAa403 protein 6.0 

437608 AA761605 Hs.292308 ESTs, Weakly similar to ALUIJ^UMAN ALU S6.0 

427408 AA583206 Hs,2156 RAR-related orphan receptor A 6,0 

406687 M31 126 Hs.272620 matrix metalloprotelnase 1 1 (MMP1 1; stro 6.0 

418092 R45154 Hs.106604 ESTs 6.0 

40 447051 AW139130 Hs.160951 ESTs. Weakly similar to Coni [H.sapiens] 6.0 

441233 AA972965 Hs.135568 ESTs 6.0 

432239 X81334 Hs.2936 matrix metalk)proteinase 13 (collagenase 6.0 

435106 AA100847 Hs.193380 ESTs, Highly similarlo AF174600 1 F-box 5.9 

435525 A1831297 Hs.123310 ESTs 5.9 

45 458809 AW972512 Hs.20985 sin3^ociated potypeplide. 30kD 5.9 

410785 AWd03341 gb:IL2-UM0079^9O300^O-DO3 UMX79 Homo5.9 

422576 BE548555 Hs.1 18554 CGI-83 protein 5.9 

451398 AI793124 Hs.144479 ESTs 5.9 

441881 AW968904 Hs.179566 hypothetical protein FU22624 5.8 

50 412022 AI005043 Hs.24143 Wiskott-AWrich syndrome protein interac 5.8 

416636 N32536 Hs.42645 solute earner family 16 (monocarboxyBc 5.8 

447350 AI375572 Hs.172634 ESTs 5.8 

434094 AA305599 Hs.238205 hypothefical protein PRO20 13 5,8 

409151 AA306105 Hs.50785 SEC22. veside trafficking protein (S. c 5.8 

5 5 448807 AI571940 Hs.7549 ESTs 5.8 

452281 T93500 Hs.28792 Homo sapiens cDNAFUl 1041 fis, done PL 5.8 

421281 AI299139 Hs.17517 ESTs 5.8 

430361 AI033965 Hs.239926 stBrol-C4-methyl oxidase-like 5,8 

400289 X07820 Hs.2258 matrix metaltoproteinase 10 (MMP10; str 5.7 

60 440527 AV657117 Hs.184164 ESTs, Moderately simitartoS65657 alpha 5.7 

434674 AA831879 Hs.136985 ESTs 5.7 

426320 W47595 Hs.169300 transfonrilng growth factor, beta 2 5,7 

452401 NM..007115HS.29352 tumor necrosis factor, alpha-induced pro 5.7 

448663 BE614599 Hs.106823 hypothetical protein MGC14797 5.7 

65 438199 AW016531 Hs.122147 ESTs 5.7 

446203 Z47553 Hs.14286 flavin containing monooxygenase 5 5.7 

428336 AA503115 Hs.183752 microsemfnoprotein. beta- 5.6 
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430379 AF134149 Hs,240395 potassium channel, subfamily K, member 6 5.6 
422835 BE218705 Hs.121378 metallothloneln-IIke 5. lestis-speciflc 5.6 
444758 AL044878 Hs.11899 3-hydroxy-3-methyfg[utaiyW:oenzymeA re 5.6 
443426 AF098158 Hs.9329 chromosome 20 open reading frame 1 . 5.6 
5 400301 X03635 Hs.1657 estrogen receptor 1 5.6 
447078 AW885727 Hs.301570 ESTs 5.6 
432015 AL157504 Hs.159115 Homo sapiens mRNA; cDNA DKFZp586O0724 (f5.5 
438691 AA906288 Hs.212184 ESTs 5.5 
439809 R41396 Hs.101774 hypothetical protein FU23045 5.5 
10 415786 AW419196 Hs.257924 hypothetical protein FU1 3782 5.5 
456373 BE247706 Hs.89751 membrane-spanning 4-doma)ns, subfamily A 5.5 
401645 NA C16001440^i|12330704|gb|AAG52890.1|AF35.5 
437967 BE277414 Hs.5947 meltiansfonning oncogene (derived from 5.5 
445885 AI734009 Hs.1 27699 KIAA1603proteh 5.4 
15 439138 AI742605 Hs.193696 ESTs 5.4 
440270 NI\/L015986Hs.7120 cytokine reoeptor-lil^e motecute 9 5.4 
437536 X91221 Hs,144465 ESTs 5,4 
438167 R28363 Hs.24286 ESTs 5.4 
452741 BE392914 Hs.30503 Homo sapiens cDNA FLJ 11 344 fis, done PL 5,4 

20 426214 H59846 Hs.128355 ESTs, IVIoderatety similar to ALU7 JHUMAN A 5.4 
413554 AA319146 Hs.75426 secretogranin II (chromogranin C) 5,4 
422867 L32137 Hs.1584 cartilage digomeric matrix protein (COM 5.4 
434263 N34895 Hs.44648 ESTs 5.4 
446382 AW205168 Hs.1 50823 ESTs 5.4 

25 422406 AF025441 Hs.116206 Opa-inlerading prote&i 5 5.3 
438321 AA576635 Hs.6153 CGM8 protein 5.3 
418310 AA814100 Hs.86693 ESTs 53 
419625 U91616 Hs.91640 nudearfector of kappa light polypeptid 5.3 
450701 H39960 Hs.288467 Homo sapiens cDNA FLJ12280 fis, done MA 5.3 

30 445900 AF070526 Hs.13429 Homo sapiens clone 24787 mRNA sequence 5.2 
449051 AW961400 Hs.333526 HER2 receptor tyrosine kinase (c-erb4)2. 52 
418758 AW959311 Hs.172012 hypothetfcal protein DKFZp434J037 5.2 
431070 AW408164 Hs.249184 transcriptton factor 19 (SCI) 5.2 
417079 U65590 Hs.81134 inteiteukin 1 receptor antagonist 5.2 

35 421928 AF013758 Hs.1 09643 polyadenylate binding proteln-lnteradin 5.2 
428804 AK000713 Hs.193736 hypothetical protein FLJ20706 52 
427427 AF077345 Hs.177936 ESTs 5.2 
403485 C3001813*:gi|12737279jreflXP_012163.1|k 5.2 

422168 AA586894 Hs.112408 S100 catehim-binding protefri A7 (psoffas 5.1 

40 421937 AI878857 Hs.109706 hematological and neurotogical expressed 5.1 
426752 X69490 Hs.172004 titin 5.1 
453310 X70697 Hs.553 solute carrter family 6 (neurotransmitte 5.1 
423198 M81933 Hs.1634 cell divfelon cyde 25A 5.1 
412281 AI810054 Hs.14119 ESTs 5.1 

45 447513 AW955776 Hs.313500 ESTs, Moderately similar to ALU7_HUMAN A 5.1 
453931 AL121278 Hs,25144 ESTs 5.1 
404347 Target Exon 5,1 

431808 M30703 Hs.270833 ampiiiregulln (schwannoma-derived growth 5.1 
429113 D28235 Hs.196384 prostaglandin-endoperoxide synthase 2 (p 5.1 

50 436291 BE568452 Hs,5101 protein regulator of cytokinesis 1 5.1 
450603 R43646 Hs.12422 ESTs 5.1 
434725 AK000796 Hs.4104 hypothetical protein 5.0 " 

435981 H74319 Hs.188620 ESTs 5.0 
407376 AA993138 Hs.142287 ESTs, Weakly similar to ALUF HUMAN !l!l 5.0 

55 431689 AA305688 Hs.267695 UOP-GaltetaGlcNAcbeta 1.3-galactosyltr 5.0 
405348 NA C7001664:gi|12698061 |dbi|BAB21849.1 1 (AB 5.0 

436196 AK001084 Hs.333498 Homo sapiens cDNA FU10222 fis, done HE 5.0 
437065 AIX36450 Hs-103238 ESTs 5.0 
410196 AI936442 Hs.59838 hypoMcal protein FU10808 5.0 

60 429412 NM^006235Hs.2407 POU domain, dass 2. associating fector 5.0 
446619 AU076643 Hs.313 secreted phosphoprotein 1 (osteoponfin. 4.9 
403329 NA Target Exon 4.9 

442875 BE623003 Hs.23625 Homo sapiens done TCCCTA00142 mRNA sequ4.9 
442441 A(820662 Hs.129598 ESTs 4.9 

65 430375 AW371048 Hs.93758 H4 hlstone family, member H 4.9 



424128 AW966163 gb:EST378236 MAGE resequences, MAGI Homo4.9 

408873 AL046017 Hs.182278 calmodulin 2 (phosphorylase kinase, delt 4.9 
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407910 AA6S0274 Hs.41296 nbronecGn leucine rich transmembrane p 4.9 

432606 NM.002104Hs.3066 granzymeK (serine protease, granzyme 3; 4.9 

453204 R10799 Hs.191990 ESTs 4.8 
452020 AA722012 Hs.255757 ESTs, Weakly simnar to AT2A.HU^MNPOTEN4.8 

5 449048 Z45051 Hs,22920 similar to S68401 (cattle) glucose Indue 4.8 

408369 R38438 Hs, 182575 solute carrier family 15 (H??? transport 4.8 

431645 AF078849 Hs.266483 dynein light chain-A 4.8 

423575 C18863 Hs.163443 Homo sapiens cDNAFUl 1576 fis, clone HE 4.8 

444246 H93281 Hs.10710 hypothetical protein RJ20417 4.8 

10 421524 AA312082 Hs.105445 GDNF family receptor alpha 1 4.8 

452827 AI571835 Hs.55468 ESTs 4.8 

414222 AL1 35173 Hs.878 sort)itol dehydrogenase 4.8 

456086 AL161999 Hs.77324 eukaryolictranslatfontemfilnatlon facto 4.8 

419078 M93119 Hs.89584 insulinoma-associated 1 4.8 

IS 418973 AA233056 Hs.191518 ESTs 4.8 

447033 AI357412 Hs.157601 ESTs 4.8 

451621 A1879146 Hs.26770 fatty add binding protein 7, brain 4.7 

419968 X04430 Hs.93913 intetfeukfn 6 (interferon^ beta 2) 4.7 

424326 NMJ)14479Hs.145296 disintegrin protease 47 

20 431585 BE242803 Hs.262823 hypothetical protein FU 10326 4.7 

429294 AA095971 Hs.198793 Homo sapiens cDNA: FU22463 fis, done H 4.7 

416814 AW192307 Hs.80042 dolichyl-P-Glc:Man96lcNAc2-PP^Jolichylgl 4,7 

439897 NM_015310HS.6763 KIAA0942 protein 4.7 

429687 AI675749 Hs.211608 nudeoporin 153kD 47 

25 422880 AF228704 Hs.121524 glutathione reductase 4.7 
405801 NM_000390:Homo sapiens choroideremia (1^4.6 

432435 BE218886 Hs.282070 ESTs 4.6 

439544 W26354 Hs.28891 hypothetical protein FU 11 350; artemlsp 4,6 

425354 U62027 Hs.1 55935 complement component 3a receptor 1 4.6 

30 436027 AI864053 Hs.39972 ESTs. Wealdy similar to 138588 reverse t 4.6 

424623 AW963082 Hs.337404 ESTs 4.6 

403366 NA TaigetExon 4.6 

402542 Target Exon 4.6 

450193 AI916071 Hs.15607 Homo sapiens Fanconi anemia compfementat 4,6 

35 411678 AI907114 Hs.71465 squalene epoxidase 4.6 

456844 AI264155 Hs.152981 CDP-dlacylglycerol synthase (phosphatida 4.6 

448072 AI459306 Hs.24908 ESTs 4.5 

408045 AW138959 Hsi45123 ESTs 4.5 

423782 AI472209 Hs.3231l7 ESTs 4.5 

40 447388 AW630534 Hs.76277 Homo sapiens, clone IWlGC.-9381,mRNA,comp4.5 

448140 AF146761 Hs,20450 BCM-like membrane protein precursor 4.5 

452561 AI692181 Hs.49169 K1AA1 634 protein 4.5 
425331 AW962128 gb:EST374201 MAGE resequences. MAGG Homo4.5 

428801 AW277121 Hs.254881 ESTs 4.5 

45 42B500 AI815395 Hs.184641 fatty add desaturase 2 4.5 

426075 AW513691 Hs.270149 ESTs. Wealtly similar to 2109260AB cell 4.4 

437259 A1377755 Hs.120695 ESTs 4.4 

400409 AF153341 Hs.283954 Homo saplens'wfrigedhellx/rorkhead trans 4.4 

412863 AA121673 Hs.59757 zinc linger protein 281 4,4 

50 426989 AI815206 Hs.99395 ESTs 4.4 

401866 Target Exon 4.4 

418819 AA228776 Hs.191721 ESTs 4,4 

406348 Target Exon 4.4 

412138 AW895387 gb:QV4-NN003a^00300-157-c10NN0038Homo4.4 

55 428550 AW297880 Hs.98661 ESTs 4.4 
411743 AW852214 gb:QV4^T0361-301299^74-b05CT0361 Homo4.4 

429966 BE081342 Hs.283037 HSPC039 protein 4.4 

423291 NM_00412gHs.12659O guanylatecydasel. soluble, beta 2 4.4 

423456 AL1 10151 Hs.128797 DKFZP586D0824 protein 4.4 

60 452190 H26735 Hs.91668 Homo sapiens done PP1498 unlcnown mRNA4.4 

424871 NM-004525HS.153595 low density lipoprotein-reiated protein 4.3 

429576 AA706003 Hs.99387 ESTs 4.3 

429922 Z97630 Hs.226117 HI histone family, member 0 4.3 

421379 Y15221 Hs.103982 small indudWe cyloliine subfamily B (Cy 4.3 

65 400300 X03363 HER2 receptor tyrosine kinase (&erb-b2. 4.3 

437258 AIJ041243 Hs.174104 ESTs 4.3 

446595 T57448 Hs. 15467 hypottietteal protein FLJ20725 4.3 
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403011 NA ENSP00000215330*:Probableseiine/thfeon] 4.3 

419055 AI365384 Hs.11571 Homo sapiens cONAFU11570fis. done HE 4.3 

41B661 NM_001949Hs.1189 E2F transcription factor 3 4.3 

407786 AA687538 Hs.38972 tetraspan 1 4.3 

5 429183 AB014604 Hs.197955 KIAA0704 protein 4.3 

442914 AW188551 Hs.99519 hypothetical protein FU140O7 4.3 

441029 A1091795 Hs.179246 ESTs 4.3 

452194 A1694413 Hs.332649 olfactory receptor, family 2, subfamliy 4.3 

414821 M63835 Hs.77424 Fcfragmenlof igG.h^h affinity ia. re 4.2 
1 0 410102 AW248508 Hs.279727 Homo sapiens cDNA FU14035 fis, done HE 4.2 

452110 T47667 H3.28005 Homo sapiens cONAFU11309fB. done PL 4.2 

442007 AA301116 Hs.142838 nudeolar phosphoprotetn Nopp34 4.2 

417318 AW953937 Hs.12B91 ESTs 4.2 

431818 AW510444 Hs.191705 ESTs. Weakly similar to T471B4 hypotheli 4.2 

15 443646 Ai085198 Hs.164226 ESTs 4.2 

419169 AW851980 Hs.262346 ESTs. Weal<ly similar to S72482 hypottietl 4.2 

446839 BE091926 Hs.16244 mitotic spindle cdled-co'ii related prat A2 

423242 AL039402 Hs.125783 OEM E-6 protein 42 

432116 AA902953 Hs.308538 ESTs 4.2 

20 409038 T97490 Hs.50002 smaliindudble cytokine subfamily A (Cy 4.2 
445625 8E246743 Hs.288529 hypothetical protein FU22635 

425139 AW630488 Hs.325820 protease, serine, 23 4.2 

447397 BE247676 Hs.18442 E-1 enzyme 4.2 

410166 AK001376 Hs.59346 hypothetical protein FU10514 4,1 

25 437295 AW779318 Hs.88417 ESTs 4.1 

430486 BE062109 Hs.241551 chloride channel, caldum acUvaled, fam 4.1 

441790 AW2g490g Hs.132208 ESTs 4.1 

410129 BE244074 HS.58&31 regulator of Fas^ducedapoptosis 4.1 

427521 AW973352 Hs.290585 ESTs 4.1 

30 425247 NM_005940Hs,155324 matrix metalloprotelnase 11 ([VIMPl1;strD 4.1 

412886 AF041163 Hs.74647 Human T-ceB receptor active alpha-chain 4.1 
441153 BE562826 gb:60l336534F1 NiH„MGC^44 Homo saplensc4.1 

444301 AK000136 Hs.10760 asporIn(LRRciass1) 4.1 

426711 AA383471 Hs.1 80669 consen/ed gene amplified in osteosarcoma 4.1 

35 405850 NA TaigetExon 4.1 

440283 Ai732892 Hs.190489 ESTs 4.0 

432441 AW292425 Hs.163484 ESTs 4.0 

400284 NA estrogen receptor 1 4.0 

417341 N91453 Hs.1 02987 ESTs 4.0 

40 429732 U20158 Hs.2488 lymphocyte cytosoPic protein 2 {SH2 doma 4.0 

411393 AW797437 Hs.69771 B-factor. properdin 4.0 

425704 079293 Hs.1 59264 Human done 23948 mRNA sequence 4.0 

419594 AA013051 Hs.91417 topolsomerase (DNA) II binding protein 4.0 

419092 J05581 Hs.89603 mudn 1, transmembrane 4.0 

45 443147 AI034351 Hs.1 9030 ESTs 4.0 

408633 AW963372 Hs.46677 PRO2000 protein 4,0 

433404 T32982 Hs.102720 ESTs 4.0 

421506 BE302796 Hs.105097 thymidine kinase 1, soluble 4.0 

417900 BE250127 Hs.82906 CDC20 (cell division cyde 20, S. cerevl 3.9 

50 414602 AW630088 Hs.76550 Homo sapiens mRNA; dDNA DKFZp564B1264 (0.9 

413762 AW411479 Hs.a46 FK506-bfndlng protein 4 (59kD) 3.9 

404580 NIVL014112*:HomosapienstridK>rhinophaIa 3.9 

452046 AB018345 Hs.27657 K1AA0802 protein 3.9 
459587 AA031956 gb:zk15e04.s1 Soares_pregnantutenis„NbH3.9 

55 416668 U03272 Hs.79432 fibrillin 2 (congenital contractural ana 3.9 

426647 AA243464 Hs.294101 pre-B^ell leukemia transcrtptton factor 3.9 

429353 AL117406 H5.200102 ATP-binding cassette transporter MRP8 3.9 

419038 AW134924 Hs.190325 ESTs 33 

418918 X07871 Hs.89476 CD2 antigen (p50). sheep red blood cell 3.9 

60 421977 W94197 Hs,1 10165 ribosomal protein L26 homolog 3.9 

442567 Ai201183 Hs,130251 ESTs 3.9 

421 168 AF182277 Hs.330780 cytochrome P450. subfamily ilB (phenobar 3.9 
431701 AW935490 Hs,14658 Human chromosome 5q1 3.1 done 5G8 mRNA3.9 

418526 BE019020 Hs.85838 solute camer family 16 (monocarboxylte 3.9 

65 414998 NM.002543HS.77729 oxidised tow density lipoprotein (lectin 3.9 

422790 AA809875 Hs.25933 ESTs 39 

419741 NRU)07019Hs.93002 ubiquibn carrier proteth E2-C 3.9 
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430017 AA263172 Hs.35 protein tyrosine phosphatase, non-recept 3.9 
458814 AI49B957 Hs.170861 ESTs. Weakly similar to Z195_HUMAN ZINC 3.8 



428514 AW236861 Ha193139 ESTs 3.8 

434521 NIVL002267HS.3886 karyopherin alpha 3 (Importin alpha 4) 3.8 

5 409425 U40462 Hs.54452 zinc finger protein, subfamily 1 A. 1 (Ik 3.8 

439560 BE565647 Hs,74899 hypothetical protein FU1 2820 3.8 

424028 AF0550B4 Hs.153692 Homo sapiens cDNAFU14354 fis. done Y7 3.8 

400021 AFFX control. HUMISGF3A/M97935_MA 3.8 

453403 BE466639 M&61779 Homo sapiens cDNA FU13591 f)s. done PL 3.6 

10 445941 AI267371 Hs,172636 ESTs 3.8 

434378 AA631739 Hs.335440 EST 3.8 

429220 AW207208 Hs.1363ig ESTs 3.8 

439176 AI446444 Hs.190394. ESTs, Weakly similar to B28096 line-1 pr 3.8 

401045 C11001883*:gfl6753278|ref|NPJ33938.11c 3.8 

15 430178 AW449612 Hs.152475 ESTs 3.8 

423397 NM.0O1838Hs.1652 chemokine(C-C motH) receptor 7 3.8 

447630 AI660149 H5.44865 lymphoid enhancer-binding factor 1 3.8 

436391 AJ227892 Hs,146274 ESTs 3.8 

413011 AW068115 Hs.821 biglycan 3.8 

20 422121 AI767949 Hs.179833 ESTs 3.8 

452268 NM_003512Hs.28777 H2A histone famity, member L 3.8 

427811 M81057 Hs.180884 carboxypeptidase 81 (tissue) 3.8 

415579 AA165232 Hs.222069 ESTs 3.8 

437330 AL353944 Hs.50115 Homo sapiens mRNA; cDNA DKFZp761J1 112 (f 3.8 

25 427122 AW057736 Hs.323910 HER2 receptor tyrosine kinase (frerb^fi, 3.7 

400286 NA C16000922:gi|7499103|pir||T20903hypothe 3.7 

420281 AI623693 Hs.19l533 ESTs 3.7 

419926 AW900992 Hs.g3796 DKFZP586D2223 protein 3.7 

417541 Ai992191 H&180040 hypothetical proteTn RJ22439 3.7 

30 426172 AA371307 Hs.125056 ESTs 3.7 

429638 AI916662 Hs.211577 kinectin 1 (kinesin receptor) 3.7 

457001 J03258 Hs.2062 vitamin D(1,25>dihydroxyvitaminD3) re 3.7 



424109 AW406878 gb:UI"HF-BLO-adg-^-06-04Jl.r1 NIH_M6C_373.7 



417022 NMJ14737HS.80905 Ras assoaatron {RalGDS/AF-6) domain fem 3.7 

35 435222 AI208737 Hs,122810 Homo sapiens cDNA FLJ 11489 fis, done HE 3.7 

430448 AI633553 Hs.13303 Homo sapiens cDNA: FLJ21784 fis, done H 3.7 

432729 AK000292 Hs.278732 hypothetical protein FU20285 3.7 

413915 rJ49813 Hs.75B15 apolipoprotein C-ll 3.7 

421662 Nl^014141Hs.106552 cell recognition mdecule Caspr2 3.7 

40 441633 AW958544 Hs.112242 nomiai mucosa of esophagus specific 1 3.7 

408761 AA057264 Hs.238936 ESTs, Weakly similar to (defline not ava 3.7 

406153 Target Exon 3.7 

445563 AW873606 Hs.149006 ESTs 3.7 

453464 AI884911 Hs^2989 receptor (cateitonln) activity modifying 3.7 

45 448918 AB011152 Hs.22572 KiAA0580 protein 3.7 

413936 AP1 13676 Hs.297681 serine (or cysteine) proteinase inhibito 3.6 

448069 U76248 Hs.20191 seven in absentia (Drosophila) homolog 2 3.6 

453313 BE005771 Hs. 153746 hypothetical protein FLJ22490 3.6 

425234 AW152225 Hs.165909 ESTs.Weakly similar to 138022 hypolheti 3.6 

50 419941 X98654 Hs.93837 phosphatidylinosltol transfer protein, m 3.6 

402397 AF188625 Hs.189507 phospholfpase A2. group IID 3.6 

■ 430378 Z29572 Hs.2556 tumor necrosis Victor receptor superb 3.6 

448106 AI800470 Hs.171941 ESTs 3.6 

426431 NM.000458HS.1 69853 • transcription factor 2, hepatic; LF-B3; 3.6 

55 431843 AA516420 HS.1B3526 ESTs. Weakly similar to 138022 hypotheti 3.5 

426878 BH)69341 gb:QV3-BT0381-27010CM)73^«8BT0381 Homo3£ 

434061 AW024973 Hs.283675 NPD009 protein 3.6 

445292 AV653264 Hs.13982 Homo sapiens CDNAFU14666 fis. done NT 3.6 

452101 T60298 Hs.10844 Homo sapiens cDNA R.J14476 fe, done I4A 3.6 

60 427581 NM.014788HS.179703 KIAA0129 gene product 3.6 

409047 AW961434 Hs.31539 ESTs 3.6 

416820 NM_O00402Hs.802O6 glucose-6-phosphate dehydrogenase 3.6 

410386 W26187 Hs.3327 Homo sapiens dDNA: FLJ22219 fis, done H 3.6 

440516 S42303 Hs.161 cadherfn 2, type 1, N^dherin (neuronal 3.6 

65 434360 AW015415 Hs.127780 ESTs 3.6 

428970 BE276891 Hs.194691 retinotc add induced 3 3.6 

415079 R43179 Hs.22895 hypothetical protein RJ23546 3.6 
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409619 AK001015 Hs.55220 BCL2^ssociated athanogene 2 3.6 

430044 AA464510 Hs.l52812 ESTs 3£ 

430829 AW451999 Hs.194024 ESTs 3.6 

434224 AA380731 Hs.84 interieukin 2 receptor, gamma (severe 00 3.6 

5 439247 AF088020 Hs.46767 EST 3.6 

431542 H63010 Hs.5740 ESTs 3.5 

430713 AA351647 Hs.2642 eukaryotic translation ebngation fector 3.5 

434988 AI418055 Hs.1611B0 ESTs 3.5 

437748 AF234882 Hs.5814 suppressfonof tumorigeniclty7 3.5 

10 418322 AA284156 Hs.84113 cyclin^ependenl kinase Inhibitor 3 (CDK 3.5 

439569 AW602166 Hs.222399 CEGP1 protein 35 
459583 AI907673 gb:IL-BT152-080399O04BT152Homosapien3.5 

403212 NM_019595:Homo sapiens {ntersedin 2 (IT 3.5 

409099 AK000725 Hs.50579 hypothetksal proten FU20718 3.5 

15 453968 AA847643 Hs.62711 Homo sapiens, done IMAGE:3351295, mRNA3.5 

436338 W92147 Hs,118394 ESTs 3.5 

422890 Z43784 Hs.75893 ankyrin 3, node of Ranvier (ankyrin G) 3,5 

442295 Ai827248 Hs.224398 Homo sapiens cDNAFU11469fis. done HE 3.5 

417975 AA641838 Hs.30085 hypothetical protein FLJ23186 3.5 

20 433730 AK002135 Hs.3542 hypothetical protein FU11 273 ■ 3.5 

439926 AW014875 Hs.137007 ESTs 3.5 

445873 AA250970 Hs.251946 poly(A)-blnding protein, cytoplasmicl-l 3.4 

410153 BE311926 Hs.15830 hypothetical protein FU 12691 3.4 
422128 AW881145 gb:QVO-OT0033-010400-182-a07 0T0033Homo 3.4 

25 414921 BE390551 Hs.77628 steroWogenic acute regulatory protein r 3.4 

441134 W29092 Hs.7678 cellular retinoic acid-binding protein 1 3.4 

444564 AI167877 Hs.143716 ESTs 3.4 

402470 NA TaigetExon 3.4 

418120 AA213437 Hs.192249 ESTs 3.4 

30 422414 AW875237 Hs.13701 ESTs 3.4 

433345 A1681545 Hs. 152982 hypothetk^ protein FLI13117 3.4 

409213 U61412 Hs.51133 PTK6 proteki tyrosine kinase 6 3.4 

422611 AA158177 Hs.118722 fucosyltransferase 8 (alpha (1,6) fucosy 3.4 

423554 l\^90516 Hs.1674 glutamlne-frudose-S-phosphaletransamln 3.4 

35 442619 AA447492 Hs.20183 ESTs. Weakly similar to AF164793 1 prote 3.4 
402359 C1900199r.gi|12656111|gb|AAKD0751.1|AF23.4 

439398 AA284267 Hs.221504 ESTs 3.4 

415208 F01020 Hs.1720M titin 3.4 

452853 AA812633 Hs.10845 ESTs 3.4 

40 429345 R11141 Hs.199695 hypothetical protein 3.4 

449027 AJ271216 Hs.22880 dlpeptidylpeptidase 111 3.4 

412115 AK001763 Hs.73239 hypothetical protein FU 10901 3.4 

432180 Y18418 Hs.272822 RuvB (E coil homok)gHii« 1 3.4 

428977 AK001404 Hs.194698 cycnnB2 3.4 

45 431611 U58766 Hs.264428 tissue speciflc transplantation antigen 3.4 
418286 AA622528 Hs.319825 Homo sapiens, done IMAGE:3616574, mRNA,3.4 

436895 AF037335 Hs,5338 caitonic anhydrase XII (tumor antigen H 3.4 

443378 AW392550 Hs.9280 proteasome (prosome, macropain) subunit, 3.4 

428450 NI\L014791Hs.184339 KIAA0 175 gene pfodud 3.3 

50 449571 AW016812 Hs.200266 ESTs 3.3 

412777 AI335773 Hs.270123 ESTs 3.3 

420542 NM-000505HS.1321 coagulation factor XII (Hageman fador) 3.3 

412754 AW160375 Hs.74565 amytoid beta (A4) precursor-like protein 3.3 

418327 U70370 Hs.84135 paired-like homeodomain transcriptfon fa 3.3 

5 5 449065 AI627393 Hs.258998 ESTs. WeaWy similar to high mobility gr 3.3 

425999 AW513051 Hs.332981 ESTs. Weakly similar to I38Q22 hypofhefi 3.3 

430280 AA361258 Hs.237868 Interieukin 7 receptor 3.3 

407777 AA161071 Hs.71465 squalene epQxidase 3.3 

426516 BE262660 Hs.170197 glutamiooxakiacetic transaminase 2. mit 3.3 

60 414361 AI086138 Hs.204044 ESTs 3.3 

427080 AW068287 Hs.173466 ras>related C3 botuilnum toxin substrate 3.3 

426429 X73114 Hs.169849 myosin-bindlng protein C. stow-type 3.3 

446163 AA026880 Hs.25252 prolactin receptor 3.3 

428566 U41763 Hs.184916 clathrin, heavy polypeptide-Tike 1 3.3 

65 418641 BE243136 Hs.86947 adislntegrrnandmetalloproteinasedoma 3.3 

436293 AI601188 Hs.120910 ESTs 3.3 

411257 AA628967 Hs.115274 ESTs, Highly similar to IHH.HUMAN INDIAN 3.3 
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430253 AK001514 Hs.236844 hypothetical protein FU10652 33 
430066 AI929659 Hs.237625 signal recognltfon particle 72kD 33 
436469 AK001455 Hs.5198 Down syndrome critical negion gene 2 33 
437786 BE142681 Hs.155573 polymerase (DMA directed), ela 3.3 
5 444079 H09048 Hs.23606 ESTs 3.3 
457183 H91882 Hs.118569 Dvt4)indlng protein IDAX (inhibition of 3.3 
431215 AA496078 Hs.121554 Human DNA sequence from done RP11-218C13.3 
424563 AA446932 Hs.151428 ret finger protein 2 3.3 
450828 AW270655 Hs.1 93804 ESTs 3,3 

10 408652 R43409 Hs.5829 Homo sapiens mRNA for KIAA1 644 protein. 3.3 
445142 AW978484 Hs.93842 Homo sapiens cDNA: FU22554 fis, clone H 3.3 
426761 AI015709 Hs.172089 Homo sapiens mRNA; cDNA DKFZp58612022 (f3.3 
439237 AW408158 Hs.318893 ESTs. Wealdy similar to A47582B^llgr 3.3 
422616 BE300330 Hs.118725 selenophosphate synthetase 2 3.3 

IS 443247 BE614387 Hs.333893 c-MyctaigetJPOl 3.3 
406663 U24683 Hs.302063 immunoglobutin heavy constant mu 3.3 
434137 AA907734 Hs.124895 ESTs 3.3 
408877 AA479033 Hs.130315 ESTs, WeaWy similar to A47582 B^ligr 3.3 
439101 C01765 Hs.38750 hypothetical protein aJ11526 3.3 

20 ■ 408221 AA912183 Hs.47447 ESTs 3.3 
447519 U46258 Hs.339665 ESTs 3.3 
404755 NA Target Exon 3.3 

451871 AI821005 Hs.1 18599 ESTs 3.2 
420319 AW406289 Hs.96593 hypothetical protein 3.2 

25 430580 AA806105 Hs.300697 immunoglobulin heavy constant gamma 3 (G 3.2 
' 400202 NA NM.002795*:Homo sapiens proteasome(pros3.2 

400222 NA NI\/L002082*:Homo sapiens G protein-couple 3.2 

425988 BE045897 Hs.274454 ESTs, Weakly similar to 138022 hypotheti 3.2 
458098 BE550224 Hs.74170 metaDothlonein IE (funcUonal) 3.2 

30 430588 AJ002744 Hs.246315 UDP-N-acetyl-alpha-D^alactosamineipolyp 3.2 
431563 AI027643 Hs.120912 ESTs 3.2 
442353 BE379594 Hs.49136 ESTs. Moderately similar to ALU7_HUMAN A 3.2 
422309 U79745 Hs.1 14924 solute carrier fomlty 16 (monocarboxylic 3.2 
419703 AI793257 Hs. 128151 ESTs 3.2 

35 420380 AA640891 Hs.102406 ESTs 3.2 
410853 H04588 Hs.30469 ESTs 3.2 
454417 AI244459 Hs.1 10826 trinudeotlde repeat containing 9 3.2 
432745 AI821926 gb:nt78f05j(5 NCLCGAP^PrS Homo sapiens 3.2 

422032 AA476966 Hs.110857 polymerase (RNA) 111 (DNA directed) poly 3.2 

40 415339 NM_015156Hs78398 KIAA0071 protein 3.2 
426384 AI472078 Hs.303562 ESTs 3.2 
448030 N30714 Hs.325960 membrane-spannlng 4-domains, subfamily A 3.2 
418739 AA310964 Hs.68012 SHP2 interacOng tfansmembrane adaptor 3.2 
442053 R35343 Hs.24968 Human DNA sequence from clone RP1-233G163.2 

45 434747 AA837085 Hs.220585 ESTs 3^ 
427297 AW292593 Hs.334g07 Homo sapiens, cbne MGC:17333. mRNA. oom3.2 
412228 AW503785 Hs.73792 complement component (3d/EpstefnBarrvi 3.2 
452304 AA025386 Hs.61311 ESTs, Weakly similar to S10590 cysteine 3.2 
453953 AW408337 Hs.36972 CD7 antigen (p41) 3.2 

50 407758 D50915 Hs.38365 KIAA0125 gene product 3.2 
451149 AL047586 Hs.10283 RNA binding motif protein 88 3.2 
430015 AW768399 Hs.112157 ESTs 3.2 
433313 W20128 Hs.296039 ESTs 3.2 
418334 AA319233 Hs,5521 ESTs 3.2 

55 450223 AA418204 Hs.241493 natural Wller-lumor recognitk)n sequenc 3.2 
454365 AW966728 Hs.54642 methionine adenosyltransferase II. beta 3.2 
451128 AL118668 gb:DKFZp761l0310_r1 761 (synonym: hamy2)3.2 

417793 AW405434 Hs.82575 small nuctear nlionudeoprotein polypept 3.2 
428027 U22029 Hs.334345 cytochrome P450. subfamily IIA (phenobar 3.2 

60 441197 BE244638 Hs.166 sterol regulatory element binding transc 3.2 
424634 NN/L003613HS.151407 cartilage Intemiediate layer protein, nu 3.2 
419986 AI345455 Hs.78915 GA-binding protein transcrifiion factor. 3.2 
416714 AF283770 Hs.79630 CD79A antigen (immunogtobuiln-assodated 3.2 
449465 NM-004380Hs,23598 CREB binding protein (Rubinstein-Taybi s 3.2 

65 422166 W72424 Hs.112405 S100 calcium-binding protein A9 (calgran 3.2 
409079 W87707 Hs.82065 interleukin 6 signal transducer (gp130. 3.2 
423551 AA327598 Hs.233785 ESTs 3.2 
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453553 AA036849 Hs.61629 Homo sapiens GDNAFU12763fis. done NT 3.2 

442580 AI733682 Hs.130239 ESTs 3.2 
458079 Ai796870 Ms.54277 DMA segment on chromosome X (unique) 9923^ 

425700 AF076292 Hs.159251 forkhead box HI 3.2 

5 417124 BE122762 Hs.25338 ESTs 3.2 

407104 S57296 Hs.323910 v-erb-b2 avian erythroblastic leukemia v 3.2 

442215 AI703172 Hs.129005 ESTs. Weakly similar to 21 09260A B cell 3.1 

430271 T06199 Hs.237506 DnaJ (Hsp40) homolog, subfamily B. membe 3.1 

425317 AW205118 Hs.210546 Interleukin 21 receptor 3.1 

10 426095 AI278023 Hs.89986 ESTs 3.1 

442313 BE388898 Hs.8215 hypothetical protein FLJ1 1307 3.1 

424709 AL137589 Hs.152149 hypothetfcal protein DKFZp434K0410 3,1 

429671 BE379335 Hs.211594 proteasome (prosome, macropain) 268 subu 3.1 

432715 AA247152 Hs.200483 ESTs. Weakly similar to KIAA1074 protein 3.1 

IS 431574 AW572659 Hs.261373 hypothetical protein dJ43401 4.3 3.1 

436876 AI124756 Hs.5337 isodtrate dehydrogenase 2 (NADP).mltoc 3.1 

405017 NA TaigetExon 3.1 

433805 AA706910 Hs.112742 ESTs 3.1 

437352 AL353957 Hs.284181 hypothetical protein DKFZp434P0531 3.1 

20 430105 X70297 Hs.2540 cholinergic /eceptor, nicotinic; alpha p 3.1 

422083 NM_001141Hs.111256 arachidonalelS^ipoxygenase, second typ 3.1 

413507 BE145360 Hs.1900S4 ESTs. Weakly similar to 138022 hypothetl 3.1 

415989 A1267700 Hs.317584 ESTs 3.1 

422907 AI879263 Hs.6986 Human glucose transporter pseudogene 3.1 

25 425548 AA890023 Hs.1906 prolactin receptor 3.1 

422599 BE387202 Hs.118638 non-metastatic cells 1. protein (NM23A) 3.1 

439963 AW247529 Hs.6793 platelet-activating fectoracetylhydrola 3.1 

453883 AI638516 Hs.22630 cofactor required for Spl transcriptiona 3.1 

458021 At885190 Hs.156089 ESTs, Weakly similar to repressor protei 3.1 

30 418478 U38945 Hs.1174 cydin^ependent kinase inhibitor 2A (me 3.1 

400814 NA TaigetExon 3.1 

402327 TaigetExon 3.1 

416935 AA190712 gbzp87f09.r1 Stiatagene HeLa cell s3 93 3.1 

439838 AL355722 Hs.106875 Homo sapiens EST from clone 35214, full 3.1 

35 437036 AI571514 Hs.133022 ESTs 3.1 

449523 NM_000579Hs.54443 chemokine (C-C motif) receptor 5 3.1 

406642 AJ245210 gb.-Homo sapiens mRNA for immunogtobulin 3.1 

406624 AF052762 gb:Homo sapiens ctone csneg8-1 Immunogb 3.1 

421924 BE514514 Hs.109606 CO ronin, actin-binding protein. 1 A 3.1 

40 414523 AU076533 Hs.76353 serine (or cysteine) proteinase inhlblto 3.1 

416379 N38857 Hs.203933 ESTs 3.1 

422823 D89974 Hs.121102 vanln2 3.1 

433904 At3g9956 Hs.208956 ESTs 3.1 

421904 BE143533 Hs.109309 hypothetical protein FU20035 3.1 

45 428834 AW899713 Hs.339315 ESTs 3.1 

436043 AW983838 Hs.168830 Homo sapiens cDNA FU12136 fls, done MA 3.1 

452823 AB012124 Hs.30696 transcrlpfon factor-like 5 (baste helix 3.1 

405381 NA TaigetExon 3.1 

428746 AW503820 Hs.192861 Spl-B transcriptfan factor (Spi-1/PU.1 r 3.1 

50 435147 AL133731 Hs.4774 Homo sapiens mRNA; cDNADKFZp761 C171 2 (f3.1 

425782 U66468 Hs.159525 cell growth regulatory with EF-hand doma 3.1 

423306 W88562 Hs.108198 ESTs 3.1 

419123 AA234276 Hs.88253 ESTs 3.1 

438581 AW977766 Hs.292133 ESTs, Moderately similar to 178885 serin 3.1 

5 5 417105 X60992 Hs.81226 CD6 antigen 3.0 

428361 NM_015905Hs.183858 transcriptional Intemiedfaiy factorl 3.0 

417880 BE241595 Hs.82848 selectin L (lymphocyte adhesion molecule 3.0 

402606 NM_024626:Homo sapiens hypothetical prot 3.0 

401451 NM_004496*:Homo sapiens hepatocyte nude 3.0 

60 421878 AA299652 Hs.1 11496 Homo sapiens cDNAFUl 1643 fls. done HE 3.0 

409518 BE384836 Hs.3454 KIAA1821 protdn 3.0 

416933 BE561850 Hs.80506 small nuclear ribonudeoproteinpolypept 3.0 

414324 Y14768 Hs.890 lymphotoxin beta (TNFsuperfamily, membe 3.0 

425081 X74794 Hs.154443 minlchromosome maintenance deficient (S. 3.0 

65 401519 C15000476*:gl|12737279|ref|XP,012163.1| 3.0 

411704 AI499220 Hs.71573 hypothetfcal protein FU10074 3.0 

428819 AL135623 Hs.193914 KIAA0575 gene product 3.0 
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428423 AU076517 Hs.184276 solute canfer family 9 (sodlumrtiydfogen 3.0 

413835 AI272727 Hs.249163 fatty add hydroxylase 3.0 

412600 L28824 Hs.74101 spleen tyrosine kinase 3.0 

410491 AA465131 Hs-64001 Homo sapiens done 2521 8 mRNA sequence 3.0 

433658 L03678 Hs. 156 110 Immunoglobulin kappa constant 3.0 

427666 AI791495 Hs.180142 calmoduWke skin protein 3.0 

452514 AI904898 gb«C-BT068-130399-085 BT068 Homo saplen3.0 

429500 X78565 Hs.289114 hexabrachionjteiiascinC.cytotacti'n) 3.0 

432485 N90866 Hs.276770 CDW52 antigen (CAMPATH-1 antigen) 3.0 

437400 AB011542 Hs.5599 EGF-l(ke-domaln. multiple 5 3.0 

452234 AW084176 Hs.223296 ESTs. Weakly similar to 138022 hypotheti 3.0 



413269 BE167526 gb:CM4-HT0509-080300-107-g07 HT0509 Homo 3.0 

453216 AL137566 Hs.32405 Homo sapiens mRNA; cDNA DKFZp586G0321 (f 3.0 
400929 ENSP0000Q252232*:Sterol regulatory eleme 3.0 



445145 AI9617Q2 Hs.147434 ESTs 3.0 

432615 AA557191 Hs.55028 ESTs, Weakly similar to 154374 gene NP2 3.0 

423279 AW959861 Hs.290943 ESTs 3.0 

429392 AL109712 Hs.296505 Homo sapiens mRNA full length insert cDN 3.0 

408548 AA055449 Hs.63187 ESTs, Weakly simOar to ALUCJHUI\/!AN lUI 3.0 

451346 NM.006338HS.26312 glioma amplified on chromosome 1 protein 3.0 

413109 AW389845 Hs.1 10855 ESTs 3.0 
401714 NA ENSP00000241802*:CDNAFU11007FIS,CLON 3.0 

421462 AF016495 Hs.104624 aquaporin9 3.0 

421750 AK000768 Hs.107872 hypothetical protein FU20761 3.0 

453293 AA382267 Hs.10653 ESTs 3.0 

457085 AA412446 Hs,98138 ESTs 3.0 

438930 AW843633 Hs.306163 hypothetical protein All 101 15 3.0 
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TABLE 22A 



Table 22A shows the accession niunbers for those pkeys lacking unigenelD's for Table 22. 
For each probeset, we have listed the gene cluster number fix)m which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland Califomia). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey; Unique Eos probeset Identifier number 

CAT number: Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accesstons 



407980 103087^1 AA046309AI263500AA046397 

410785 1221055,1 AW803341 AW803265 AW803403 AW803466 AW803402 AW803413 AW803268 AW803396 AW803334 AW803355 

411743 1258098 1 AW862214 AW859811 AW8e2215 

412138 1279172 1 AW895387 AW895547 AW895564 AW895323 AW895405 AW895539 AW895538 

413269 1 356981J BE157526 BE1 67651 BE076401 R24654 

416935 163179J AA190712AA190665AA252564 

422128 211994.1 AW881145AA490718 M85637AA304575 T06067 AA331991 

423945 233566 J AA410943 AW948953 AA334202 AA332882 

424109 23550S 1 AW406878 AW986560 AW966151 AW966496 AA336174 AA335376 AA335537 

424128 235728_1 AW966163AA335983 AA336011 AA335668 AA335973 

425331 250199 J AW962128AA355353AA427363 

426878 273265 J BE069341 AW748403 ALi)44891 AI908240 AA393080 

432745 353673 J AI821926AA658826AA564492AA835129AI791191 

441153 51084.2 BE562826 BE378727 

448212 755099 J AI475858 AW969013 

451128 859865_1 AL118668 D78823AI762175 

452514 920172 J AI904898AI904849Aig04899 

456207 165078.-1 AA193450 
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TABLE 22B 



Table 22B shows the genomic positioning for those pkeys lacking imigene ID's and 
accession numbers in Table 22. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank identifier (Gl) numbers. "Dunham I. et al.* refers to the publication 

entitled The DNA sequence of human chromosome 22.' Dunham I. et aL, Nature (1 999] 402:489495. 
strand: Indicates DNA strand from Which axons were predicted. 

NLposttion: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


Ntj30sition 


400814 


8569925 


. Minus 


72840-72924.74761-74849 


400929 


7651921 


Minus 


122033-122241,123483-124028 


401045 


8117619 


Plus 


90044-90184.91111-91345 


401451 


6634068 


Minus 


119926-121272 


401519 


6649315 


Plus 


157315-157950 


401645 


7657839 


Minus 


34986-35133 


401714 


6715702 


Plus 


96484-96681 


401866 


8018106 


Plus 


73126-73623 


402327 


7656695 


Minus 


108675-108770.109801-109910 


402359 


9211204 


Minus 


40403-41961 


402408 


9796239 


Minus 


110326-110491 


402470 


9797107 


Plus 


195129-195776 


402542 


9801558 


Minus 


67076-67594 • 


402578 


9884928 


Plus 


66350-66498 


402606 


9909429 


Minus 


81747-82094 


403011 


6693597 


Minus 


3468^23 


403212 


7630897 


Minus 


156037-156210 


403329 


8516120 


Plus 


96450-96598 


403366 


8783692 


Minus 


49323-49652 


403485 


9966528 


Plus 


2886-3001 .3198-3532,36554117 


404347 


9838195 


Plus 


74493-74829 


404580 


6539738 


Minus 


240588-241589 


404755 


7706327 


Minus 


53729-53846 


405017 


6532084 


Plus 


35551-35690 


405348 


2914717 


Minus 


4331043462 


405381 


6006820 


Minus 


7636-8054 


405801 


2924321 


Plus 


63469-63694 


405850 


6164995 


Plus 


13871-14110 


406153 


9929734 


Minus 


12902-13069 


406348 


9255985 


Minus 


71754-71944 
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TABLE 23: 320 GENES DOWN-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT BREAST 

5 Table 23 shows 320 genes down-regulated in breast cancer compared to normal adult breast. 
These were selected as for Table 22, except that the numerator was set to the median value 
for 12 non-malignant breast specimens, the denominator was set to the median value amongst 
the 73 breast cancers, the 90'^ percentile value amongst thel2 non-malignant breast 
specimens was greater than or equal 80 units, and the ratio was greater than or equal to 4.0 
10 4-fold down-regulated in tumor vs. normal breast). 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank aocesslon number 

1 S UnlgenelD: Unigene number 

Uni^e Title: Unigene gene title 

R1: Ratio of SO"* percentile nomial body tissue to 75"^ percentile tumor 



20 Pkey ExAccn UnigenelD UnigeneThle Ratio 

428722 U764S6 H5.190787 fissue Inhibitor of metaRoprot^nase 4 22.4 

428848 NM_000230 Hs.194236 lept]n(munne obesity homotog) 17.4 

445263 H57646 Hs.42586 KIAA1 560 protein 15.4 

25 418335 T28499 Hs.894d5 carbonic anhydrase IV 15.0 

407228 M25079 Hs.1 55375 hemoglobin, beta 14.6 

417511 AL049176 Hs.82223 chordln-Rke 14.6 

439285 AL133916 Hs.172572 hypothetical protein FU20093 14.3 

412442 AI983730 Hs.26530 serum deprivation response (phosphatkiyl 13.6 

30 410544 AI446543 Hs.95511 ESTs 12.6 

412047 AA934589 Hs.49596 ESTs 122 

422667 H25&42 Hs.l33471 ESTs 12.0 

406664 L34041 Hs.9739 glycerol-3-phosphate dehydrogenase 1 (so 12.0 

423201 NM.000163 Hs.125180 growth homione receptor 11.7 

35 422163 AF027208 Hs.112360 promlnin (mouse)-llke 1 10.8 

428769 AW207175 Hs.106771 ESTs 10.6 

407049 X72632 NML021724tHomo sapiens nuclear receptor 10.1 

412295 AW088828 Hs.1 17176 poly(A)-binding protein, nudear 1 9.8 

425126 N32759 Hs.172944 chortonic gonadotropin, beta polypeptkJe 9.8 

40 406791 A1220684 Hs.272572 hemoglobin, alpha 2 9.5 

447471 AF039843 Hs.18676 sprouty (Drosophila) homolog 2 9.5 

451533 NI\L004557 Hs.26530 serum deprivation response (phosphatidyl 9.4 

419407 AW410377 Hs.41502 hypothetical protein FLJ21 276 9.0 

411939 AI365585 Hs.146246 ESTs 9.0 

45 410532 T53088 Hs.155376 hemoglobin, beta 8.9 

425707 AF115402 Hs.11713 E74-like factor 5 (ets domain tianscripl 8.8 

416585 X54162 Hs.79386 leiomodin 1 (smooth muscle) 8.7 

443060 D78674 Hs.d944 procollagen C^endopepfidase enhancer 2 8.6 

435265 AA779958 Hs.185932 ESTs 8.5 

50 422511 AU076442 Hs.117938 collagen, type XVII. alpha 1 8.4 

433138 AB029496 Hs.59729 semaphorin sem2 8.3 

402195 Nl\/L00449r'>1onm) sapiens hepatocyte nude 8.1 

429350 AI754634 Hs.131987 ESTs 8.1 

445107 AI208121 Hs.147313 ESTs. Weakly similar to 138022 hypotheti 8.1 

55 406643 N77976 Hs.272572 hemoglobin, alpha 2 8.0 

410199 AW377424 Hs.205126 Homo sapiens cDNA: FU22667 fls, done H 8.0 

417225 AA815048 Hs.24078 hypothetical protein FU12649 7.8 

437569 AA760849 Hs.294052 ESTs 7,5 

436062 AK000027 Hs.98633 ESTs 7.5 

60 425078 NM-002599 Hs.154437 phosphodiesterase 2A, cGMP-stlmulated 7.5 

430327 AW973636 Hs.55931 ESTs 7.4 
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447577 A1393693 Hs.l83297 

446039 A1150491 Hs.90756 

422060 R20893 Hs.325823 

424455 M452005 Hs.333199 

5 424343 AW956360 Hs.4748 

442792 AI352340 Hs.13l194 

406714 AI219304 Hs.283108 

407571 A(446183 Hs.9572 

429580 AA346839 Hs.209100 

10 453500 AI478427 Hs.43l25 

422233 AB002058 Hs.113275 

420205 AA256395 Hs.88l56 
404368 NA 

447261 NM_006691 Hs.17917 

15 417090 AA193282 Hs.85863 

410677 NM_003278 Hs.65424 

427283 AL119795 Hs.174185 

415011 AW963085 

412068 S72043 Hs73133 

20 416253 BE25G659 Hs.15463 

435885 AA701483 Hs.36341 

402779 NA 

418138 AA213626 Hs. 136204 

439335 AA742697 Hs.62492 

25 427019 AA001732 Hs.173233 

411478 BE143068 

452654 BE004783 

447359 NM_012093 Hs.18268 

414323 NM_014759 Hs.334688 

30 441266 H15968 H&293845 

417011 F08212 HsJ234898 

400089 NA 

433614 W07475 Hs.277101 

440439 N92818 Hs.64754 
35 454404 BE067414 

436704 AA062610 Hs.148056 
406563 NA 

433490 AW451023 Hs.65848 

419313 AA843387 Hs.87279 

40 409196 NhA.001874 Hs.334873 
410882 AW809163 

453469 AB014533 Hs.33010 

441699 AI372588 Hs.8022 

426210 AA372052 Hs.334559 

45 413065 BE063555 

454192 AW876813 Hs.3343 

425187 AW014486 Hs.22509 

429757 AW452355 Hs.256037 

420202 ALJ036557 Hs.95910 

50 416284 AI695473 Hs.298005 

428553 AA181641 Hs.184907 
404689 NA 

438887 R68857 Hs.265499 

406082 S47833 Hs.82927 

55 449748 H23963 Hs.32043 

431048 R50253 Hs.249129 
452205 C15819 

434040 AW444613 Hs.288809 

407744 AB020629 Hs.38095 

60 450606 AI668605 Hs.60380 

414629 AA345824 Hs76688 
401665 

436107 T99079 Hs.191194 

444432 AI161428 Hs75916 

65 434715 BE005346 Hs.1 16410 

447205 BE617015 Hs.11006 

408122 AI432652 Hs.42824 



DKFZP566F2124 protein 7.4 

ESTs 7.2 

ESTs, Moderately sbrular to ALUS HUMAN A 7.2 

ESTs 7.1 

adenylate cyclase activating pdypepttde 7.1 

ESTs 7.0 

hemoglobin, gamma G 6.9 

ESTs. Highly similar to CYA5_HUMAN ADENY 6.8 

DKFZP434C171 protein 6.7 

esophageal cancer related gene 4 protein 6 .7 

purinergicreceptDrP2XJ(e 1. orphan r 6.7 

ESTs 6.6 

ENSP00000241075*:TRRAP PROTEIN. 6.6 

extracellular link domain-containing 1 6.5 

ESTs. Weakly similar to B34612 zinc fing 6.5 

tetranectin (plasminogen4)inding protein 6.5 

edonudeotlde pyrophosphatase/phosphodi 6.5 
gb:EST375158 IVtAGE resequences^ MAGH Homo6.4 

metalbthionein 3 (growth inhibitory fac 6.4 

Homo sapiens, done (MAGE:2959994, mRNA 6.4 

ESTs 6.3 

Target Exon 6.3 

EST 6.3 

ESTs, Weakly similar to B39066 proline-r 6.3 

hypotheticai protein F1J10970 6.2 
gb:MR&l1T0158-030200mb09 HT0158 Homo6.2 
gb:MR2^0114-270400O04-e11 BN0114 Homo6.1 

adenylate kinase 5 6.1 

KIAA0273 gene product 6.1 

Homo sapiens, done IMAGE:3502329, mIRNA, 8. 1 

ESTs, Weakly similar to 2109260A B ceN 6.0 

Eos Control 6,0 

cytochrome c oxidase subunit IV Isoform 5.9 

ESTs, Weakly similar to potential CDS [H 5.9 
gb:MR4-BT0355-200100-201-e05 BT0355 Homo 5.9 

EST 5.9 

Target Exon 5.9 

hypothetical protein DKFZp7610132 5.9 

ESTs 5.9 

carboxypepttdaseM 5.8 
gb:MR4'ST0118-2610g9412-a03 ST0118 Homo 5.8 

KIAA0633 protein 5.8 

TU3A protein 5.8 

Homo sapiens cONA aJ14458 fis, cbne HE 5.6 
gb:GM1-BT0283^1199^3-dO9 BT0263 Homo5.8 

phosphogiycerate dehydrogenase 5.7 

ESTs 5.7 

ESTs 5.7 

putative lymphocyte G0/G1 switch gene 5.7 

ESTs 5.7 

G protein-coupled receptor 1 5.6 

Target Exon 5.6 

ESTs 5.6 

adenosine monophosphate deaminase 2 (Iso 5.6 

ESTs 5.6 

ceO death-indUcing DFFA-IIke effectora 5.5 

gb:C15819 Ctontech human aorta polyA mRN 5.5 

hypothetical protein FU20159 5.5 

ATP-blnding cassette, sub-family A (ABC1 5.5 

ESTs, Moderately similar to ALU6^HUMAN A 5.5 

carboxylesterase 1 (monocyte/macrophage 5.5 

C11000703:gi|10048448lreqNP^065258.1|g 5.5 

ESTs 5.5 

splicing factor 3b, subunit 2. 145kD 5.5 

ESTs 5.5 

ESTs, Moderately similar to T17372 pbsm 5.5 

hypothetical protein FU1071d 5.5 
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454016 AW016806 Hs.233108 

414913 R25621 

459033 AA017590 Hs.129907 

441003 BE172240 Hs.126379 

450637 N49826 Hs.18602 
442398 AA994520 
403612 NA 

407102 AA007629 Hs.9739 

410057 R66634 Hs,268107 

428232 BE272452 Hs.183109 

432769 AA620814 Hs.144959 

431344 R99530 ' Hs.272572 

427032 AF012023 Hs.173274 

406305 BE261320 Hs.158196 

437411 AW613948 Hs.194915 

442800 AI809481 Hs.131227 
402054 NA 

432085 AF212829 Hs.272406 

415313 R59638 Hs.6181 
459159 AI904646 

427164 AB037721 Hs.173871 

441391 BE467930 Hs.170381 

458959 AI285901 Hs.181297 
402698 NA 
401810 NA 

438879 AA827674 Hs.189073 

414657 AA424074 Hs.76780 

427809 M26380 Hs.180878 

456063 NM-006744 Hs.76461 

451186 AW023469 Hs.65256 

451882 AI821324 Hs.100445 
402583 NA 

431130 NM 006103 Hs.2719 

458218 At435179 Hs.126820 

416083 R53467 . Hs.269122 
455282 BE143867 

426488 X03350 Hs.4 

426156 BE244537 Hs.167382 

407891 AA486620 Hs.41135 

408610 AW026692 Hs.224829 

445967 D59597 Hs.1 18821 

434813 AI524307 Hs.162870 

437526 AI076012 Hs.121388 
454775 BE160229 

409451 AF012626 Hs.54472 
409853 AW502327 
405062 

446490 AK000706 Hs.15125 

417622 AW298163 Hs.82318 

421978 AJ243662 Hs.1 10196 

440338 R62431 Hs.12758 

415421 R35009 Hs.24903 
417574 R00348 

409882 AJ243191 Hs.56874 

447998 AI7682B9 Hs.304389 

445613 BE550889 Hs.1 58491 

443074 AW341470 Hs.144907 

451324 AI783600 Hs.206052 

432433 AW014734 Hs.157969 

449654 Aig6g812 Hs.1998S0 
414519 N94587 • Hs.55063 

457531 AW973716 Hs.13913 

433200 AA682722 Hs.192725 

430782 AF026263 Hs.247920 

427555 AW137094 Hs.97990 

433545 AA868510 Hs.1 12496 

420334 AI34S351 Hs.118944 



ESTs 5.5 

gb7M5f06.r1 Soares placenta Nb2HP Homo 5.4 

ESTs 5.4 

ESTs, Weakly similar to 138022 hypotheti 5.4 

ESTs 5.4 
gbx)u42g09.s1 Soares_NFL_T_GBC_S1 Homo s5.4 

Target Exon 5.3 

gtycerol-3-phosphate dehydrogenase 1 (so 5.3 

multimerfn 5.3 

monoamine oxidase A 5.3 

ESTs 5.3 

hemoglobin, alpha 2 5.3 

integrtn cytoplasmic domain-associated p 5.3 

transcriptfonal adaptor 3 (ADA3, yeast h 5.3 

ESTs 5.3 

ESTs 5.3 

Target Exon 5.3 

potassium channel, sid)femity K, merrber 9 5.3 

ESTs 5.2 
gb:QV-BT065«020399-103 BT065 Homo saplen 5.2 

KIAA1300 protein 5.2 

ESTs 5.2 

ESTs 5.2 

ENSP00000251335*:DJ1003J2.1 (sodium and 5.2 

Target Exon 5.2 

ESTs 5.2 

protein phosphatase 1 , regulatory (inhib 5.2 

lipoprotein lipase 5.1 

retinol-blnding protein 4, tnteisfitial 5.1 

ESTs, Wealdy simiiar to leudne-rfch gli 5.1 

ESTs 5.1 

N^y)21620:Homo sapiens PR domain contain .5.1 

H E4; WFDC2; putative ovarian cardnoma m 5.1 

ESTs 5.1 

ESTs. Weaklysimi)artoALU1_HyMAN ALUS 5.1 
gb:(V!R0-Kr0164O70100-013-h02 HT0164 HomoS.I 

alcohol dehydrogenase 1 B (class I), beta 5.1 

natnuretic peptide receptor A/guanylate 5.1 

endomucln-2 5.0 

ESTs 5.0 

CGI-62 protein 5.0 

ESTs 5.0 
ESTs, Wealdy slrriilartD MDHCJ^UMAN MALAT 5.0 
gb:QV1-HT0413^0200-062^12 HTO413 Homo 5.0 

fragile X mental retardation 2 5.0 

gb:Ui-HF-BR0r«ka.a^-0-Ul.r1 NIHJVIGC^5 5.0 

Target Exon 5.0 

hypothetical protein FU20699 5.0 

WAS protein femily. member 3 5.0 

NICE-I protein 5.0 

ESTs 5.0 

ESTs 5.0 

gb:ye69e06.r1 Soares fetal liver spleen 5.0 

heat shoci( 27kD protein family, member 7 5.0 

ESTs 4.9 

ESTs 45 

ESTs 45 

ESTs 4.9 

ESTs 45 

ESTs 4.9 

ESTs 4.9 

KIAA1577 protein 4.9 

ESTs 4.9 

choBnerglc receptor, muscarinic 5 4.8 

ESTs 4.8 

ESTs 4.8 

hypothetical protein FLJ22477 4.6 



337 



wo 02/059377 





AO0U94 


Ue 001000 

nS.2ooo22 


/1 97100 
4// loo 




ns,17o7u 




□bUoo9oo 






AW450451 


Un occocc 
nS,2D03o5 


40O053 


AW1 39474 


Hs^46oo2 


427067 


AAo4371o 


Hs. 177927 


442969 


A inn c Ann 

AI025499 


lis. 132238 


426220 


AI383475 


11-, A^ACf\1 

Hs,171697 


414o9o 


Dc36o7o4 




426893 


AA398716 


Hs.97418 


434046 


AiAMnn0ii s 

AW292616 


Hs.1 13011 


401590 


NA 




457971 


AXktA^ A^tn 

AW1 34679 


ij_ nAno An 

Hs542849 


427722 


AKD00123 


Hs,180479 


443793 


AA045290 


Hs,25930 


407737 


R49187 


Hs.6659 


441955 


AA972327 


Hs.142903 


441499 


AW298235 


Ua AnAcon 

Hs. 101 689 


447517 


AI382726 


1 1 A t\n t^A 

Hs. 182434 


403017 






450580 


N40087 


Hs. 15248 


404611 


H58589 


Hs.35156 


414831 


M31158 


Hs.77439 


459290 


N^yX)1546 


Hs34d53 


444341 


AI142027 


ns.146650 


408614 


AL137698 


Hs.46531 


449638 


AW204277 


H&29U7Z3 


434418 


A PA f% A^fV9 

AF1 34707 


Hs.278679 


447360 


AI375984 


Hs.167216 


419583 


F00312 




A Af\ef\a 

440698 


AI348455 


u_ A Al Ann 

Hs. 147492 


451199 


AI290653 


t 1 AAA 

Hs.124758 


438338 


klA A ^A A t\^A 

NwL014861 


Hs.6168 


433756 


AW015933 


1 1^ A A A^PA 

Hs.1 12654 


423301 


S67580 


Hs.1645 


417237 


H86385 


Hs.81737 


439745 


AL389981 


1 1— A A AAA A 

Hs.149219 


424137 


AA335769 


k 1^ A AA AA 

Hs,16262 


449338 


H73444 


Hs.394 


434744 


N94835 


Hs^3826 


407402 


AP035303 




443510 


L 1 k A AA 

N^1.012190 


HsS520 


415754 


A A dl nf\A A A 

AA169114 


Hs. 12247 


415986 


Z43619 




457416 


BE1 42052 


Hs.62654 


418064 


BE387287 


1 1^ AAA A A 

Hs,83384 


437120 


AI355125 


ns.157767 


453950 


A A Acf*nn<\ 

AA1 55998 


H8.211560 


401093 






436935 


hxkinnf^ An A 

AW20d494 


Hs.253560 


457974 


AW842353 


LJ.. OOW-T4T 

Hs.321717 






nS.IOOUtx) 


442705 


AI264634 


Hs.131127 


437409 


AL35g599 


Hs^850 


458494 


A1380906 


Hs.158436 


410490 


H03589 




416069 


R37101 


Hs^82 


438463 


AA807958 


Hs.314232 


444043 


AI499723 


Hs.135089 


438327 


H87407 


Hs.172944 


457711 


AF147401 


Hs.23917 


400870 






441425 


AA933590 


Hs.28937 


416267 


H45384 




407262 


M12873 




444567 


AV654020 


Hs.184261 


403263 


NA 





Rhesus blood group, D antigen 


4.8 


phosphatidic add phosphatase type 2B 


4.8 


gb;QV3>BT0296-14020aO85^01 BTQ296 Homo 4.8 


ESTs 


4.8 


ESTs 


4.8 


ESTs 


4.7 


ESTs 


4.7 


ESTs, Weakly similar to T13924 sdk prote 


4.7 


gb:601273249F1 NlH.MGC^0 Homo sapiens ( 


:4.7 


ESTs 


4.7 


ESTs 


4.7 


Target Exon 


4.7 


ESTs 


4.7 


hypothetical protein FLJ201 16 


4.6 


ESTs, Weakly similar to 2109260A B cell 


4.6 


ESTs 


4.6 


ESTs 


4.6 


ESTs 


4.6 


ESTs 


4.6 


Target Exon 


4.6 


ESTs 


4.6 


Homo sapiens cDIMA FLJ11027 fis, done PL 


4.6 


protein kinase, cAI\/IP-depe/Kient, regulato 


4.6 


inhibitor of DNA binding 4, dominant neg 


4.6 


ESTs 


4.6 


Honno sapiens mRNA; cDNA DKFZp434C1915 (f 4.6 


hypothe€cal protein MGC2747 


4.6 


a dsintegrin and metaiioproteinase doma 


4.6 


ESTs 


4.6 


gb:HSBB0D101 STRATAGENE Human skeletal 


m4.6 


Homo sapiens cDNA FLJ1 1777 fis. done HE 


4.6 


ESTs 


4.6 


KIAA0703 gene product 


4.6 


Homo sapiens, done MGC:9764, mRNA, comp 


4.5 


cytDciirome P450. subfamily IVA, polypept 


4.5 


pafmitoyi-protein thioestenase 2 


4.5 


Homo sapiens rvANA full iengfii insert cDN 


4.5 


ESTs 


4.5 


adrenomedulfin 


4.5 


Homo sapiens genomic DNA, chromosome 21q 


4,5 


gb:Homo sapiens done 23943 mRNA sequenc 


4.5 


fomnyitetrahydrofblate dehydrogenase 


4.5 


hypothetical protein FU11413 


4.5 


gb:HSC1GE121 normalized infant brain cDN 


4.5 


krlngle-contalning transmembrane protein 


4.5 


SI 00 calcium-binding proton, beta (neur 


4.4 


ESTs, Weakly similar to HXA2 JiUi\MN HOMEO 4.4 


eukaryotic translation Initiation factor 


4.4 


C12000585*:g)|6330167|dbj|BAA86477.1|(A 


4.4 


ESTs 


4.4 


ESTs, Weakly similar to S22765 heterogen 


4.4 


Homo sapiens mRNA; cDNA DKFZp434K098 (fr 


4.4 


ESTs 


4.4 


Homo sapiens mRNA; cDNA DKFZp547C126 (fi 


■4.4 


ESTs 


4.4 


gb:yj42d08.r1 Scares placenta Nb2HP Homo 


4.4 


ESTs 


4.4 


ESTs 


4.4 


ESTs 


4.4 


chorionic gonadotropin, beta polypeptide 


4.4 


ESTs 


4.3 


C11000905:gI|11692565|gb|AAG39879.1|AF28 


4.3 


homeobox protein from AL590526 


4.3 


gb:yn99c10.r1 Scares adult brain N2b5HB5 


4.3 


gb-.Human Ig rearranged H-chaIn mRNA Vt)J4 


4.3 


ESTs, Weakly similar to T26686 hypotheti 


4.3 


Target Exon 


4.3 
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410034 BE057414 

456804 AI421645 Hs.139851 

446427 BE395260 Hs.309438 

416931 D45371 Hs.80485 

5 421296 N^L002666 Hs.103253 
400973 NA 

452602 AW365194 Hs.55962 

412330 NWL005100 Hs788 
405016 

10 435104 AI475671 Hs.88607 
408118 

418558 T02850 

429745 AA480818 Hs.221736 

433088 AW451206 Hs.115699 

IS 444445 AA342329 Hs.1 15920 

453880 AI803166 Ms.28462 

447384 AI377221 Hs.40528 

414541 BE293116 Hs76392 

444975 AV652165 Hs.182482 

20 403921 NA 

451477 AI798425 Hs.42710 
406344 

416970 AA191201 Hs.35861 

413662 BE155856 Hs.25522 

25 458504 AW070634 Hs.144794 
404682 NA 

418089 N69d13 Hs.6858 
403433 

446532 AW975460 Hs.143563 

30 414217 AI309298 Hs.27g896 

41B425 AI871247 Hs.6262 

419589 AW973708 Hs.201925 

457029 AA397789 Hs.161803 

447860 AF193807 Hs.131835 

35 448988 Y09763 Hs.22785 

440610 AI733098 Hs.1 30800 
439590 AF086410 

427240 AA399975 Hs.274l51 

408932 AW594172 Hs.278513 

40 436112 T77545 Hs,187559 

444382 AI144152 Hs.58246 

456716 AA318060 Hs.135121 

419846 NAL015977 Hs.285681 

410036 R57171 Hs.57975 

45 400545 NA 
403051 NA 

420139 NM_005357 Hs.95351 

450244 AA007534 Hs.1 25062 

453261 AA034116 Hs.118494 

50 440246 W52010 Hs.191379 

414516 AI307802 Hs.1 35560 

438232 AI150595 Hs.122226 

410233 AA08^7 

412179 BE270758 Hs.59428 

55 441871 AI306150 Hs.153450 

426411 AKD00708 Hs.169764 

453692 AL110418 

446640 AW817177 Hs.102558 

417481 AA203281 Hs.21798 

60 412912 AW118878 H&110835 

454183 AW807116 

426328 AW631296 

435942 R06285 Hs.191215 

417629 T76945 
65 403593 NA 

402690 

418190 R49591 Hs.270425 



gb:MR4-BT0355-200100.201-e05 BT0355 Homo 4.3 



caveolin 2 4.3 

EST 4.3 

adipose most abundant gene transctlpt 1 4.3 

perilipin 4.3 

ENSP00000236667*:Mucin 5B (Fragment). 4.3 

ESTs 4.3 

A kinase (PRKA) anchor protein (gravin) 4.3 

CY00017r:gil9280405|gb|AAF86402.11AF245 4.3 

ESTs, Highly similar to F-box protein FB 4.3 

EN$P00000246632:CDNA FU20261 fis, done 4.3 

gb:FB12A9 Fetal brain.Stratagene Homos 4.3 

ESTs 4.3 

ESTs 4.3 

Homo sapiens cDNA: FU22816 fis. clone K 4.3 

ESTs, Weakly similar to 138022 hypotheti 4.3 

ESTs 4.2 

aldehyde dehydrogenase 1 family, member 4.2 

ESTs, Weakly similar to T00362 hypotheti 4.2 

C5000212*:gl|10047237|dbi|BAB13407.1| (A 4.2 

ESTs 4.2 

C5001660'.gi|116115371dbjIBAB18935.1|(AB 4.2 

DKFZP585E1621 protein 4.2 

KIAA1808 protein 4.2 

ESTs 4.2 

C9001188*:gi|12738842|refP^PJ73725.1| p 4.2 

ESTs, Weakly similar to 178885 serlne/th 4.2 

NMlOOI 622i^omo sapiens aIpha-2-fl&glyGop 4.2 

ESTs 4.2 

Homo sapiens cDNA: FLJ23165 fis. done L 4.2 

hypothetnal protein MGC8407 4.2 

Homo sapiens dDNAFLJ13446 fis, done PL 4.2 

ESTs 4i 

Rhesus blood group, 6 glyoQprotein 4.2 

gamma-aminobutyric add (GABA) A recepto 4.2 

ESTs 4.2 

gb.Homo sapiens full length Insert dDNA 4.2 

llgatin 4.2 

TP53TG3 protein 45 

ESTs 4.2 

ESTs A2 

hypothetical protein FLJ22415 4.2 

Williams-Beuren syndrome chromosome regl 4.2 

calsequestrin 2 (cardiac muscle) 4.1 

Target Exon 4.1 

Target Exon 4.1 

lipase, hormone-sensi&ve 4.1 

ESTs 4.1 

ESTs 4.1 

ESTs 4.1 

ESTs, WeaMy similar to T43458 hypotheti 4.1 

ESTs 4.1 

gb:zn10g07.s1 Stratagene hNT neuron (937 4.1 

hypothetical protein IV16C3020 4.1 

ESTs, Weakly similar to 1909123A Na glue 4.1 

hypothetical protein FLJ20701 4.1 

gb:DKFZp434K0431_r1 434 (synonym: htes3) 4.1 

Homo sapiens, done MGC:5352,mRNA.oomp 4.1 

ESTs 4.1 

ESTs 4.1 
gb:MR4-ST0062-040100^4-b12 510062 Homo 4.1 
gb:hh83o09.y1 NCLCGAP_GU1 Homo salens 4.1 

ESTs 4.1 

gb:yc92c07.r1 Scares Infant brain 1N1B H 4.1 

Tangel Exon 4.0 

Target Exon 4.0 

ESTs 4,0 
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408641 AW245207 Hs.5555 hypothetical protein MGC5347 4.0 

427899 AA829286 Hs.332053 serum amyloid A1 4.0 

445975 AI811536 Hs.145734 ESTs 4.0 

438831 BE263273 Hs.6439 synapslnll 4.0 

455578 BE006350 Hs.14355 Homo sapiens cDNA RJ13207 fis. done NT 4.0 

401840 NA Target Exon 4.0 

413753 U17760 Hs75517 lamlnin, beta 3 (ntoein (125kD), kafmin 4.0 

445030 AI205925 Hs.147238 ESTs. Highly similar to AAC3_HUMAN ALPHA 4.0 

433873 AW156913 Hs.150478 ESTs, WeaWy sJmHartoAChain A.Cfyst 4.0 

456736 AW248217 Hs.1619 achaete-scute oomplex (Drosophila) homd 4.0 

450112 BE047734 Hs.5473 ESTs, Moderately similar b ALUS J^UMAN A 4.0 

448906 AI589S67 Hs.309719 ESTs 4.0 
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TABLE 23 A 

Table 23 A shows the accession nxunbers for those pkeys lacking unigenelD's for Table 23. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" colunm. 



Pkey: Unique Eos probeset Identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 



409853 1156226J AW502327AW5024a8AW50182gAW502625 AW5Q2687 

410034 1170594J BE067414 BE067958BE067419BE067963AW577127AW601412 

410233 118656 1 AA082947 AA083036 

410490 1205347 1 H03589 AW750687 AW750688 

410882 1225686.1 AW809163 AW809247 AW809177 AW809190AW809225 

411478 1247073J BE143068 AW849143 AW848705AWW8569AW848071 AW848475 AW848092 AW848005 
413065 1347960J BE063555 BE151321 BE151319 BE151657 BE151655 BE063556 BE151322 

413072 1348163.1 BE063965 BE063968 BE064034 BE064028 BE063874 BE063966 BE063869 BE064043 BE064033 BE063884 

414593 1464909.1 BE386764 BE387560 

414913 1506721 1 R25621 C03959 C04010 

415011 151328 1 AW963085AA159005AW963073 

415986 1564410.1 Z43619 R6 1274 HI 2206 R12883 

416267 1583547,1 H45384H49125H41699 

417574 1687770J R00348 R09593 

417629 1690392J T76945 R20210 R05755 

418556 1767866_-1 T02850 

419583 186198J F00312AA247490 F31427AA383663 F22D45 

426328 264901 1 AW631296 AA375484 

439590 47413 1 AF086410 W94386 W74609 

442398 54127T.1 AA994520 AW393574 

452205 90415J C15819 AA024741 AA024742 

452654 925931.1 BE004783 BE004947 Ai911790 

453692 977825.1 ALII 0416 AW876759 

454183 1049636 1 AW8071 16 AW807569 AW807415 AW807338 AW807288 AW807263 AW807316 AW177402 AW807413 AW807068 BE141561 
BE141 569 AW807401 AW807310 BE141565 AW807318 AW8071 19 AW807299 AW807241 AW807225 AW807204 AW807345 
AW807103 BE14161 5 AW807431 AW807393 AW807337 AW807406 AW807259 AW807375 AW845890 AW807220 AW807399 
AW807064 AW807376 AW807024 BE141595 AW807236 AW807027 AW8071 12 AW807378 AW807202 BE141593 AW807216 
AW807138 AW807244 AW807221 AW807297 AW807050 AW807248 AW807404 AW807075 AW807237 AW807212 AW807308 
AW8071 10 AW807104 BE140912 AW807301 AW807382 AW807294 AW807026 AW807020 AW807108 AW807025 AW807433 
AW807124 AW807419 AW807O31 AW807264 AW8a7032 AW807029 AW807052 AW807391 AW807207 AW807215AW807019 
AW807238 AW807201 BE141590 AW807302 AW807323 AW807380 AW807109 BE141586 AW845877 AW807418 AW807407 
AW807309 BE141614 AW845861 AW807396 AW807300 AW807348 AW807311 AW807214 AW807132 AW807402 AW807350 
AW807028 AW807298 AW807291 AW807305 AW807217 AW807312 AW807261 AW807268 AW807260 AW807266 AW8071 98 
AW807346 AW807315 BE140940 AW807351 AW807122 AW807235 AW807076 AW807314 AW807213 AW807143 AW807131 
AW800079AW807066AW807077 AW845923AW807421 BE141619 BE1 40943 AW807420AW807381 BE141621 AW845921 
BE141629 BE141625 BE141624 BE141636 BE141630 AW807405 AW807290 AW807353 

454404 1 170594.1 BE067414 BE067958 BE067419 BE067963 AW577127 AW601412 

454775 1234106 1 BE160229AW819879 AW820179AW819882AW819876AW820169BE153201 AW993736BE152911 
455282 1273020J BE1 43867 AW935060AW886684 
459159 919998.1 A1904646 BE179494 BE179421 
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TABLE 23B 



Table 23B shows the genomic positioning for those pkeys lacking unigene TD's and 
accession numbers in Table 23. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Umque number corresponding to an Eos probeset 

Reft Sequence source. The 7 digit numbers in ttils column are 6enbanl( Identifier (Gl) numbers. "Dunham I. et al," refers to the publication 

entitled The DNA sequence of human chromosonie 22." Dunham I. et al., Nature (1 999) 402:48 W95. 
Strand: Indicates DNA strand from which exons were predicted. 

Ntpositlon: Indicates nucleotide positions of predicted exons. 



PItey 


Ref 


Strand 


N(_position 


400545 


9800107 


Minus 


124618-124881 


400870 


9838306 


tVIInus 


34081-35027 


400973 


7960452 


Minus 


98119-98253 


401093 


8516137 


Minus 


22335-23166 


401590 


9966320 


Minus 


33547-33649 


401665 


7145001 


Plus 


121591-122537 


401810 


7342191 


Plus 


129063-129476 


401840 


7684597 


Plus 


56283-56439 


402054 


8083691 


Minus 


8288^06 


402195 


7689778 


Minus 


147901-148884 


402583 


7684486 


Plus 


94883-95003 


402690 


8346058 


Pius 


13368-13998 


402698 


8570304 


Minus 


108641-108903 


402779 


9588555 


Minus 


38173-39210 


403017 


6693623 


Plus 


78630-79367 


403051 


4827080 


Minus 


5269-5411 


403263 


7770677 


Pius 


52431-52737 


403433 


9719611 


Minus 


72225-72437 


403593 


6862650 


Minus 


62554-62712,69449-69602 


403612 


8469060 


Minus 


94723-94859 


403921 


7711590 


Minus 


3297-3536 


404368 


7630956 


Minus 


102053-102199 


404682 


9797231 


Minus 


40977-41150 


404689 


7534100 


Plus 


119461-119717 


405016 


6524300 


Plus 


51997-53308 


405062 


7657730 


Plus 


101283-101432 


406118 


9143818 


Plus 


53997-54629 


406344 


9255974 


Plus 


20254-20374,20526-20669.20835-21097 


406563 


7711604 


Pius 


34401-34538 
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TABLE 24: 

Table 24 depicts Seq ID No., UnigenelD, UnigeneTitle, Pkey, PredCelLLoc, and ExAccn 
for all of the sequences in Table 25. The infonnation in Table 24 is linked by Seq ID No. to 
Table 25. 



Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigeneiE): Unigene number 

UnigeneTitle: Unjgene gene title 

Pred.Cellloa: Predicted Cellular Localization 

Seq.lD,No.: Sequence Identification Number found in Table 25 

Pkey ExAccn UnigenelD UnigeneTitle 



Pred.Cell.Loc. Seq.iD.No. 



449746 
407276 
415539 
400297 
450375 
102457 
429170 
424399 
422505 
449765 
425692 
426215 
439840 
410102 
429220 
416276 
409079 
442818 
442082 
444381 
446163 
416636 
442117 
433043 
429353 
452190 
446733 
452747 
423242 
417433 
432201 
423961 
439569 
114480 
404561 
325372 
112287 
335824 
424735 
400289 
427585 
429925 
429441 
421155 
420931 
420813 
452744 



AI668594 Hs.176588 
AI951118 Hs.326736 
Af733881 Hs.72472 
AI127076 Hs.334473 
AA009647 Hs,8850 
NI\1.001394Hs.2359 
NM.001394Hs.2359 
AI905687 Hs.2533 
AL120862 Hs.124165 
N92293 Hs.206832 
D90041 Hs.1 55956 
AW963419 Hs.155223 
AW449211 Hs.105445 
AW248508 Hs.279727 
AW207206 Hs.136319 
U41060 Hs.79136 
W87707 Hs.82065 
AK001741 Hs.8739 
R41823 Hs.7413 
BE387335 Hs.283713 
AA026880 Hs.25252 
N32S36 Hs.42645 
AW864964 Hs.128899 
W57554 Hs.125019 
AL117406 Hs.200102 
H26735 Hs.91668 
AA863360 Hs.26040 
BE153855 Hs.61460 
AL039402 Hs.125783 
BE270266 Hs.82128 
AI538613 Hs.298241 
D13666 Hs.136348 
AW602166 Hs^22399 
BH)66778 Hs.151678 

NA 

AB033064 Hs.334806 
NA 

U31875 Hs.272499 

X07820 Hs.2258 

D31152 Hs.179729 
NNL000786HS.226213 

AJ224172 Hs.204096 

H87879 Hs. 102267 

AF044197 Hs.100431 

X51501 Hs.99949 

AI267652 H&30504 



ESTs, Weakly sanilar to CP4Y_HUMAN CYTOC 
Homo sapiens breast cancer antigen NY>BR 
BIWP-R1B 

hypothetical protein DKFZp56401278 

a disintegrin and metaDoproteinase doma 

dual specificity ptiospiiatase 4 nuclear 

dual specificity pliosphatase 4 nuclear 

aldehyde dehydrogenase 9 family, member cytoplasm 

ESTs 

ESTs, Moderately sinrinar to ALU8.HUMAN A 

N-aoetyltransferase 1 (arylamlne N-acety 

stanniocalctn 2 

GDNF family receptor alpha 1 

Homo sapiens cDNA FLJ14035 fis, done HE 

ESTs 

LiV-1 protein, estrogen regulated 
Interieukin 6 signal transducer (gpl30, 
hypothetical protein FLJ 10879 
ESTs 

ESTs, Weakly similar to S64054 hypotheti 
Homo sapiens cDf^ FLJ 13603 Hs, clone PL 
solute caMer family 16 (monocarboxylic 
ESTs 

lymphoid nuclear protein (LAF-4) mRNA 
ATP-binding cassette transporter MRP8 
Homo sapiens done PP1498 unknown mRNA 
ESTs, Weakly sfmiiar to fatty acid omega 
!g superfamity receptor LNIR 
DEME-6 protein 

5T4 oncofetal trophoblast glycoprotein 
Transmembrane protease, serine 3 
osteoblast specific factor 2 (fasciclin 
CEGP1 protein 

UDP-N-acetyl-a(pha-C>galactosamine:polyp 
NM_014l12*:Homo sapiens trichorliinophala mitochodrla 
Phase2&3Exons nudear 
KIAA1238 protein 

ENSP00000249072*:DJ222E13.1 (N-TERIV1INAL 
short-chani alcohol dehydrogenase family 
matrix metalloproteinase 10 (stromelysin 
collagen, type X, alpha 1 (Schmid metaph 
cytochrome P450. 51 (lanosterol 14-a!pha ER 
lipophilin B (uteroglobin family member) 
lysyl oxidase extracellular 
smaD inducible cytokine B subfamily (Cy 
prolactin-induced protein nuclear 
Homo sapiens mRNA; cDNA DKFZp434E082 (fr 
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Seq iD1&2 
Seq ID3&4 
Seq ID5&6 
SeqlD7&8 
Seq 109 &10 
SeqtD11&12 
Seq ID 11 & 12 
Seq ID 13&14 
Seq ID 15&16 
Seq ID 17 & 18 
Seq ID 19 & 20 
Seq ID 21 & 22 
SeqlD23&24 
Seq ID 25 & 26 
Seq ID27&28 
Seq ID 29 & 30 
Seq ID 31&32 
Seq ID 33 & 34 
Seq ID 35 & 36 
Seq ID 37 & 38 
Seq ID 39 & 40 
Seq ID 41 & 42 
Seql043&44 
Seq ID 45 & 46 
Seq ID47&48 
Seq ID49&50 
Seq ID 51 & 52 
SeqlD53&54 
Seq ID55&56 
Seq ID 57 & 58 
Seq ID 59&60 
Seq ID 61 & 62 
Seq ID63&64- 
Seq ID65&66 
Seq ID 67&68 
Seq ID69&70 
Seq ID 71 & 72 
SeqlD73&74 
Seq ID 75 & 76 
Seq ID77&78 
Seq ID 79&80 
Seq ID 81 & 82 
Seq ID83&84 
Seq ID85&86 
Seq ID 87 & 88 
Seq ID 89 & 90 
Seq ID 91 & 92 
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420757 
424905 
429859 
446921 
445537 
428227 
424001 
421727 
452838 
419667 
414812 
426320 
422867 
411558 
417866 
428398 
431958 
428722 
412970 
421379 
415752 
444051 
451110 



X78592 Hs.99915 
NM_002497Hs.153704 
NIVL007050HS.225952 

AB012113 Hs.16530 

AJ245671 Hs.12844 

AA321649 Hs.2248 

W67883 Hs.137476 

Y13153 Hs.107318 

U55011 Hs.30743 

AU077005 Hs.92208 

X72755 Hs.77367 

W47595 Hs.169300 

L32137 Hs.15d4 

AA102670 Hs.70725 

AW067903 Hs.82772 

AI249368 Hs.98558 

X63629 Hs2877 

U76456 Hs.190787 

AB026436 Hs.177534 

Y15221 Hs.103982 

BE314524 Hs78776 

N48373 Hs.10247 

A1955040 Hs.265398 



androgen receptor (dihydrotestosterone r 
NIMA (never In mitosis gene a)-related k 
protein tyrosine phosphatase, receptor t 
small inducible cytokine subfamily A (Cy 
EGF-like-domain, multiple 6 
small Inducible cytokine subfamily B (Cy 
paternally expressed 10 
kynurenlne 3-monooxygenase (kynurenine 3 
preferentially expressed antigen in mela 
a disintegrin and metalloprotelnasedoma 
monokine induced by gamma interferon 
transfomning growth factor, beta 2 
cartilage oligomeric matrix protein (pse 
gamma-amlhobutyric acid (GABA) A recepto 
collagen, type Xt, alpha 1 
ESTs 

cadherin 3. type 1 , P-cadherin (placenta 
tissue inhibitor of metaltoproteinase 4 
dual specificity phosphatase 10 
small inducible cytokine subfamily B (Cy 
putative transmembrane protein 
activated leucocyte cell adhesion moiecu 
ESTs, Weakly similar to transformation-r 



cytoplasm 
nuclear 

extracellular 

extracellular 



nuclear 

extracellular 
extiacellular 
extiacellular 



Seq 
Seq 
Seq 



Seq 
Seq 
Seq 
Seq 
Seq 
Seq 
Seq 
Seq 



plasma membraneSeq 
Seq 
Seq 
Seq 
Seq 
Seq 



extracellular 



ID93&94 
ID95&96 
1D97&98 
ID 99 &100 
ID 101 & 102 
ID 103 & 104 
ID 105 & 106 
ID 107 & 108 
ID 109 & 110 
ID 111 & 112 
1D113&114 
ID 115 & 116 
ID 117 & 118 
ID 119 & 120 
ID 121 & 122 
ID 123 & 124 
ID 125 & 126 
ID 127 & 128 
(D 129 & 130 
ID 131 & 132 
ID 133 & 134 
ID 135 & 136 
ID 137 & 138 
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TABLE 24A 

Table 24A shows the accession numbers for those pkeys lacking imigenelD's for Table 24. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
5 were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

1 0 Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 

IS Pkey CAT number Accession 

335824 CH22.3197FG,619JUIN1CE 
325372 Gl2_hs 
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TABLE 24B 

Table 24B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 24. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey. Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Idervb'fler (Gl) numbers. "Dunham i. et al' refers to the pubOcatfon 

entitled "The DNA sequence of human chromosome 22." Dunham I. et al.. Nature (1 999] 402:489-495. 
Strand: Indicates DNA strand from which axons were predicted. 

Nt^sitbn: Indicates nudeoiide positions of predicted exons. 



Pkey Ref Strand Nt_position 
404561 9795980 mm 69039-70100 
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Table 25 

The 69 gene sequences identified to be overexpressed in breast cancer may be used to identify coding regions from Uie public DNA databases (nr and htgs in 
Genbank). The sequences may be used to either identify ^enes that encode known proteins, or they may be used to predict the coding regions from genomic 
DNA using exoo prediclion algorithms, such as FGENESH (Salamov and Solovyev, 2000» Genome Res. 10:516-522) 

Seq ID NO: 1 DNA sequence 

Nucleic Add Accession #: FGENESH predicted ORF 

Coding sequence: 1-1518 (underlined sequences conrespond to start and stop codons) 

1 11 21 31 41 51 
) I I I I I 

ATGGAGCCCT CCTGGCrrCA GGAACTCATG GCTCACCCCT TCTTGCTGCT GATCCTCCTC 60 
TGCATGTCTC TGCTGCTGTT TCAGGTAATC AGGTTGTACC AGAGGAGGAG ATGGATGATC 120 
AGAGCCCTGC ACCTGTITCC TGCACCCCCT GCCCACTGGT TCTATGGCCA CAAGGAGTTT 1 80 
TACCCAGTAA AGGAGTTTGA GGTGTATCAT AAGCTG ATGG AAAAATACCT ATGTGCTGTT 240 
CCCTTGTGGG TTGG ACOCTT TACG ATGTTC TTCAGTGTCC ATGACCCAGA CTATGCTAAG 300 
ATTCTCCTGA AAAGACAAGA TCCCAAAAGT GCTGTTAGCC ACAAAATCCT TGAATCCTGG 360 
GTTGGTCGAG GACTTGTGAC CCTGGATGGT TCTA AATGGA AAAAGCACCG CCAGATTGTG 420 
AAACCTGGCT TCAACATCAG CATTCTGAAA ATATTCATCA CCATGATGTC TGAGAGTGTT 480 
CGGATGATGC TGAACAAATG GGAGGAACGC ATTGCCCAAA ACTCACGTCT GGAGCTCTTT 540 
CAACATGTCT CCCTGATGAC CCTGGACAGC ATCATGAAGT GTGCCITCAG CCACCAGGGC 600 
AGCATCCAGT TGGACAGTAC CCTGGACTCA TACCTGAAAG CAGTGTTCAA CCTTAGCAAA 660 
ATCTCCAACC AGCGCATGAA CAATTTTCTA CATCACAACG ACCrGGTTTT CAAATTCAGC 720 
TCTCAAGGCC AAATCTTTTC TAAATTTAAC CAAG AACTTC ATCAGTTCAC AGAGAAAGTA 780 
ATCCAGGACC GGAAGGAGTC TCTTAAGG AT AAGCTAAAAC AAGATACTAC TCAGAAAAGG 840 
CGCTGGGATT TTCTGGACAT ACmTGAGT GCCAA A AGCG A AAACACCAA AGATTTCTCT 900 
GAAGCAGATC TCCAGGCTG A AGTGAAAACG TTCATGTTTG CAGGACATGA CACCACATCC 960 
AGTGCTATCr CCTGGATCCr TTACTGCTTG GCAAAGTACC CTGAGCATCA GCAGAGATGC 1020 
CGAGATGAA A TCAGGGAACT CCTAGGGGAT GGGTCTTCTA TTACCTGGGA ACACCTGAGC 1080 
CAGATGCCTT ACACCACGAT GTGCATCAAG GAATGCCTCC GCCTCTACGC ACCGGTAGTA 1 140 
AACATATCCC GGTrACTCGA CAAACCCATC ACCTTTCCAG ATGGACGCTC CTTACXTTGCA 1200 
GGA ATAACTG TGriTATCAA TATTTGGGCT CTTCACCACA ACCCCTATTT CTGGGAAGAC 1260 
CCTCAGGTCT TTAACCCCTT G AGATTCTCC AGGGAA AATT CTGAAAAAAT ACATCCCTAT 1 320 
GCCTTCATAC CATTCTCAGC TGGATTAAGG AACTGCATTG GGCAGCATTT TGCCATAATT 1380 
GAGTGTAAAG TGGCAGTGGC ATTAACTCTG CTCCGCTTCA AGCTGGCTCC AGACCACTCA 1440 
AGGCCTCCCC AGCCTGTTCG TCAAGTTGTC CTCAAGTCCA AGAATGGAAT CCATGTGTTT 1500 
GC AAA AAA AG TTTGCTAATT TTAAGTCCTT TCGTATAAGA ATTAATGAGA CAATTTTCCT 1560 
ACCAAAGG AA GAACAAAAGG ATAAATATAA T AC A AAATAT ATGTATATGG TTGTrTG AC A 1 620 
AATTATATAA CTTAGGATAC TTCTGACTGG TTTTGACATC CATTAACAGT AATTTTAATT 1680 
TCTTTGCTGT ATCTGGTGAA ACCCACAAAA ACACCTGAAA AAACTCAAGC TGACTTCCAC 1740 
TGCGAAGGG A AATTATTGGT TTGTGTAACT AGTGGTAGAG TGGCTTTCAA GCATAGTTTG 1 800 
ATCAAAACTC CACTCAGTAT CTGCATTACT TTTATCTCTG CAAATATCTG CATG ATAGCT 1 860 
TTATTCTCAG TTATCTTTCC CCAATAATAA AA AA 



Seq ID NO; 2 Protein sequence; 

Protein Accession #: FGENESH predicted 

1 11 21 31 41 51 
I I I I I I 

MEPSWLQELM AHPFLLLILL CMSLLLFQVI RLYQRRRWMI RALHLFPAPP AHWFYGHKEF 60 
YPVKEFEVYH KLMEKYPCAV PLWVGPFTMF FSVHDPDYAK ILLKRQDPKS AVSHKILESW 1 20 
VGRGLVTLDG SKWKKHRQIV KPGFNISILK IFITMMSESV RMMLNKWEER lAQNSRLELF 1 80 
QHVSLMTLDS IMKCAFSHQG SIQLDSTLDS YLKAVFNLSK ISNQRMNNFL HHNDLVFKFS 240 
SQGQIFSKFN QELHQFTEKV IQDRKESLKD KLKQDTTQKR RWDFLDILLS AKSENTKDFS 300 
EADLQAEVKT FMFAGHDTTS SAISWILYCL AKYPEHQQRC RDEIRELLGD GSSITWEHLS 360 
QMPYTTMCnC ECLRLYAPW NISRLLDKPI TFPDGRSLPA GITVFINIWA LHHNPYFWED 420 
PQVFNPLRFS RENSEKJHP Y AF IPFSAG LR NCIGQHFAII ECKVA VALTL LRFKLAPDHS 480 
RPPQPVRQVV LKSKNGIHVF AKKVC 

Seq ID NO: 3 DNA sequence 

Nucleic Acid Accession #: NM^052997 

Coding sequence: 1 00-4125 (underlined sequences conespond to start and stop codons) 



1 11 21 31 41 51 

I I I i I I 

CTAGTCTATA CCAGCAACG A CTCCTACATC GTCCACTCTG GGG ATCTTAG AAAGATCCAT 60 
AAAGCTGCCT CCCGGGGACA AGTCCGGAAG CTGGAGAAG A TG ACAAAGAG GAAGAAGACC 120 
ATCAACCTTA ATATACAAGA CGCCCAGAAG AGG ACTGCTC TACACTGGGC CTGTGTCAAT 1 80 
GGCCATGAGG AAGTAGTAAC ATTTCTGGTA GACAGAAAGT GCCAGCTTGA CGTCCITGAT 240 
GGCGAACACA GGACACCTCT GATGAAGGCT CTACAATGCC ATCAGGAGGC TTGTGCAAAT 300 
ATTCTGATAG ATTCTGGTGC CGATATAAAT CTCGTAGATG TGTATGGCAA CATGGCTCTC 360 
CIATTATGCTG TTTATAGTGA GATTTTGrCA GTGGTGGCAA AACrGCTGTC CCATGGTGCA 420 
GTCATCGA AG TGCACAACA A GGCTAGCCTC AC ACCACTTT TACTATCCAT AACG AAA AGA 480 
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AGTGAGCAAA TTGTGGAATT TTTGCTG ATA AAAAATGCAA ATGCGAATGC AGTTAATAAG 540 
TATAAATGCA CAGCCCTCAT GCTTGCTGTA TGTCATGGAT CATCAGAGAT AGTTGGCATG 600 
CllCITCAGC AAAATGTTGA CGTCTTTGCT GCAGATATAT GTGG AGTAAC TGCAGAACAT 660 
TATGCTGITA CTTGTGGATT TCATCACATT CATGA ACAAA TTATGGAATA TATACGAAAA 720 
5 TTATCTAAA A ATCATCAA A A TACCAATCCA G AAGG AACAT CTGCAGGA AC ACCTGATGAG 780 
GCTGCACCXrr TGGCGG AAAG AACACCTGAC ACAGCTGAAA GCTTGGTGGA AAAAACACCT 840 
GATGAGGCTG CACCCTTGGT GGAAAGAACA CCTGACACGG CTGAAAGCTT GGTGGAAAAA 900 
ACACCTGATG AGGCTGCATC CTTGGTGGAG GGAACATCTG ACAAAATTCA ATGTTTGGAG 960 
AAAGCGACAT CTGGAAAGTT CGA AC AGTCA GCAGAAGAAA CACCTAGGG A AATTACGAGT 1 020 

1 0 CCTGCAAAAG AAACATCTGA GAAATTTACG TGGCCAGCAA AAGGAAGACC TAGGAAGATC 1 080 
GCATGGGAGA AAAAAGAAGA CACACCTAGG GAAATTATGA GTCCCGCAAA AGAAACATCT 1 140 
GAGAAATTTA CGTGGGCAGC AAAAGGAAGA CCT AGGAAGA TCGCATGGG A GAAAAAAGA A 1 200 
ACACCTGTAA AGACTGGATG CGTGGCAAGA GTAACATCTA ATAAAACTAA AGTTTTGGAA 1260 

. ^ AAAGGAAGAT CTAAGATGAT TGCATGTCCT ACAAAAGAAT CATCTACAAA AGCAAGTGCC 1320 

1 5 AATGATCAGA GGTTCCCATC AGAATCCAAA CAAGAGGAAG ATGAAGAATA TTCTTGTGAT 1380 
TCTCGGAGTCTCnTGAGAG TTCTGCAAAG ATTCAAGTGT GTATACCTGA GTCTATATAT 1440 
CAAAAAGTAA TGGAGATAAA TAGAGAAGTA GAAGAGCCTC CTAAGAAGCC ATCTGCCTTC 1500 
AAGCCTGCCA TTGAAATGCA AAACTCTGIT CCAAATAAAG CCnTGAATT GAAGAATGAA 1560 
CAAACATTG A GAGCAG ATCC G ATGTTCCCA CCAG AATCCA AACAAAAGGA CTATGAAGAA 1 620 

20 AATTCTTGGG ATTCTGAGAG TCTCTGTGAG ACTGTTTCAC AGAAGGATGT GTGTTTACCC 16S0 

AAGGCTACAC ATCAAAAAGA AATAGATAAA ATAAATGGAA AATTAGAAGA GTCTCCTAAT 1740 
AAAGATGGTC TTCTGAAGGC TACCTGCGGA ATGAAAGTTT CTATTOCAAC TAAAGCCTTA 1 800 
GAATTGAAGG ACATGCA AAC TTTCAAAGCG GAGCCTCCGG GGAAGCCATC TGCCTTCG AG 1 860 
CCTGOCACTG AAATGCAAAA GTCTGTCCC A AATAA AGCCT TGGAATTG AA A AATGAACAA 1 920 

25 ACATGGAGAG CAGATGAGAT ACTCCCATCA GAATCCAAAC AAAAGGACTA TGAAGAAAAT 1980 
TCTTGGGATA CTGAGAGTCT CTGTGAGACT GTTTCACAGA AGGATGTGTG TTTACCCAAG 2040 
GCTGCGCATC AAAAAGAAAT AGATAAAATA AATGG AAAAT TAGAAGGGTC TCCTGTTAA A 2 1 00 
GATGGTCTTCTGAAGGCTAA CTGCGGAATG AAAGTTTCTA TTCCAACTAA AGCCTTAGAA 2160 

^ ^ TTGATGGACA TGCAA ACTTT CAAAGCAGAG CCTCCCGAGA AGCCATCTGC CTTCGAGCCT 2220 

3 0 GCCATTGAAA TGCAAAAGTC TGTTCC AAAT AAAGCCTTGG AATTGAAGA A TG AACAAACA 2280 
TTGAGAGCAG ATGAGATACT CCCATC AGAA TCCAAACAAA AGGACTATGA AGAAAGTTCT 2340 
TGGGATTCTG AGAGTCTCTG TGAGACTGTT TCACAGA AGG ATGTGTGTTT ACCCAAGGCT 2400 
ACACATCAAA AAGAAATAGA TAA AATAAAT GGAAAATTAG AAGAGTCTCC TG ATAATGAT 2460 
GGmrCTG A AGGCrcCCTG C AG AATG AAA GTTTCTATTC CAACTAAAGC CTTAG AATTG 2520 

35 ATGGACATGC AAACTTTCAA AGCAGAGCCT CCCGAGAAGC CATCTGCCTr CGAGCCTGCC 2580 
ATTGAAATGC AAAAGTCTGT TCCA AATA AA GCCTTGGAAT TGAAGAATG A ACAAACATTG 2640 
AGAGCAG ATC AGATGTTCCC TTCAG AATCA AAACAAAAG A AGGTTG AAGA AAATTCTTGG 2700 
GATTCTGAGA GTCTCCGTGA GACTGTTTCA CAGAAGGATG TGTGTGTACC CAAGGCTACA 2760 
. - CATCAAAAAG AAATGGATAA A ATA AGTGGA AAATTAGAAG ATTCAACTAG CCTATCAAAA 2820 

40 ATCTTGGATA CAGTTCATTC TTGTG A A AGA GCAAGGG AAC TTCAAAA AG A TCACTGTG A A 2 880 
CAACGTACAG GAAAAATGG A ACAAATG AAA AAG A AGTTTT GTGTACTGAA AAAGAAACTG 2940 
TCAGAAGCAA AAGAAATAAA ATCAC AGTTA GAGAACCAAA AAGTTAAATG GGAACAAGAG 3000 
CTCTGCAGTG TGAGATTG AC TTTAA ACCAA GAAGAAGAGA AGAGAAGAAA TGCCGATATA 3060 
TTAAATGAAA AAATTAGGGA AGA ATTAGG A AGAATCGAAG AGCAGCATAG GAAAGAGTTA 3 120 

45 GAAGTGAAAC AACAACTTGA ACAGGCTCTC AGAATACA AG ATATAGAATT G A AGAGTGTA 3 1 80 
GAAAGTAATT TGAATCAGGT TTCTCACACT CATG AAAATG AAAATTATCT CITACATGAA 3240 
AATTGCATGT TGAAAAAGGA AATTG CCATG CTAAAACTGG AAATAGCCAC ACTGAAACAC 3300 
CAATACCAGG AAAAGGAAAA TAAATACTTT GAGGACATTA AGATnTAAA AGAAAAGAAT 3360 

_ GCTGAACrrC AGATG ACCCT AAAACTGAAA GAGGAATCAT TAACTAAAAG GGCATCTCAA 3420 

5 0 TATAGTGGGC AGCITAAAGT TCTGATAGCT GAG AACACAA TGCTCACTTC TAAATTGAAG 3480 
GAAAAACAAG ACAAAGAAAT ACTAGAGGCA GAAATTGAAT CACACCATCC TAGACTGGCT 3540 
TCTGCTGTAC AAGACCATGA TCAAATTGTG ACATCAAGAA AAAGTCAAGA ACCTGCTTTC 3600 
CACATTGCAG GAGATGCTTG TTTGCA AAGA AAAATGAATG TTGATGTGAG TAGTAGGATA 3660 
. TATAACAATGAGGTGCTCCATCAACCACTTTCTGAAGCTCAAAGGAAATCCAAAAGCCTA 3720 

55 AAAATTAATCTCAATTATGC AGGAGATGCT CTAAGAGAAA ATACATTGGT TTCAGAACAT 3780 
GCACAAAGAG ACCAACGTGA AACACAGTGT CAAATGAAGG AAGCTGAACA CATGTATCAA 3840 
AACGAACAAG ATAATGTGAA CAA ACACACT GAACAGCAGG AGTCTCTAGA TCAGAAATTA 3900 
TTTCAACTAC AAAGCAAAAA TATGTGGCTT CA ACAGCA AT TAGTTCATGC ACATAAGAAA 3960 
GCTGACAACA AAAGCAAGAT AACAATTG AT ATTCATTTTC TTGAGAGGAA AATGCAACAT 4020 

60 CATCTCCTAA AAGAGAAAAA TGAGG AGATA TTTAATTACA ATAACCATTT AAAAAACCGT 4080 
ATATATCAAT ATGAAAAAGA GAAAGCAGAA ACAGAAAACT CATGAGAGAC AAGCAGTAAG 4140 
AAACTTCTTT TGGAG AAACA ACAGACCAGA TCTTTACTCA CAACTCATGC TAGGAGGCCA 4200 
GTCCTAGCAT CACCTTATGT TG AA A ATCTT ACCAATAGTC TGTGTCAACA GAATACTTAT 4260 
TTTAGAAGAA AAATTCATGA TTTCTTCCTG AAGCCTACAG ACATAAAATA ACAGTGTGAA 4320 

65 GAATrACTTGTTCACGAATTGCATAAAGCTGCACAGGATTCCCATCTACC CTGATGATGC 4380 
AGCAGACATC ATTCA ATCCA ACCAG AATCT CGCTCTGCAC TCCAGCCTAG GTGACAGAGT 4440 
GAGACrCCAC CTCGG AAA 

70 Scq ID NO: 4 Protein seouence; 

Protein Accession #: hfP^443723.1 

1 11 21 ■ 31 41 51 

1 I I 1 ^ ' 
7 5 MTKRKKTINL NIQDAQKRTA LHWAC VNGHE EVVTFL VDRK CQLD VLDGEH RTPLMKALQC 60 
HQEACANIU DSGADINLVD VYGNMALHYA VYSEILSVVA KLLSHGAVIE VHNKASLTPL 120 
LLSnXRSEQ IVEFLLIKNA NANA VNKYKC TALMLAVCHG SSEIVGMLLQ QNVDVFAADI 1 80 
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CGVTAEHYAV TCGFHHIHEQ IMEYIRKLSK NHQNTNPEGT SAGTPDEAAP LAERTPDTAE 240 
SLVEKTPDEA APLVERTPDT AESLVEKTPD EAASLVEGTS DKIQCLEKAT SGKFEQSAEE 300 
TPREITSPAK ETSEKFTWPA KGRPRKIAWE KKEDTPREIM SPAKETSEKF TWA AKGRPRK 360 
lAWEKKETPV KTGCVARVTS NKTKVLEKGR SKMIACPTKE SSTKASANDQ RFPSESKQEE 420 
DEEYSCDSRS LFESSAKIQV CiPESrYQKV MEINREVEEP PKKPSAFKPA lEMQNSVPNK 480 
AFELKNEQTL RADPMFPPES KQKDYEENSW DSESLCETVS QKDVCLPKAT HQKEIDKING 540 
KLEESPNKDG LLKATCGMKV SDPTKALELK DMQTFKAEPP GKPSAFEPAT EMQKSVPNKA 600 
LELKKEQTWR ADEILPSESK QKDYEENSWD TESLCETVSQ KDVCLPKAAH QKEIDKINGK 660 
LEGSPVKDGL LKANCGMKVS IPTKALELMD MQTFKAEPPE KPSAFEPAIE MQKSVPNKAL 720 
EIXNEQTLRA DEILPSESKQ KDYEESSWDS ESLCHTVSQK DVCLPKATHQ KEIDKINGKL 780 
EESPDNDGFL KAPCRMKVSI PTKALELMDM QTFKAEPPEK PSAFEPAIEM QKSVPNKALE 840 
LKNEQTLRAD QMFPSESKQK KVEENSWDSE SLRETVSQKD VCVPKATHQK EMDKISGKLE 90O 
DSTSLSKJLD TVHSCERARE LQKDHCEQRT GKMEQMKKKF CVLKKKLSEA KEIKSQLENQ 960 
KVKWEQELCS VRLTLNQEEE KRRNADILNE KIREEIjGRIE EQHRKELEVK QQLEQALRIQ 1020 
DIELKSVESN LNQVSHTHEN ENYLLHENCM IXKHAMUCL EIATUCHQYQ EKENKYFEDI 1080 
KJLKEKNAEL QMTLKLKEES LTKRASQYSG QLKVLIAENT MLTSKLKEKQ DKEILEAEIE 1140 
SHHPRLASAV QDHDQIVTSR KSQEPAFHIA GDACLQRKMN VDVSSTIYNN EVLHQPLSEA 1200 
QRKSKSUON LNYAGDALRE NTLVSEHAQR DQRETQCXJMK EAEHMYQNEQ DNVNKHTEQQ 1260 
ESLDQKLFQL QSKNMWLQQQ LVHAHKKADN KSKmDIHF LERKMQHHLL KEKNEEIFNY 1320 
NNHLKNRI YQ YEKEKAETEN S 

Seq ID NO: S DNA sequence 

Nucleic Add Accession #: none found 

Coding sequence: 273-1 785 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I ) I I i I 

CGCGGGGCGC GGAGTCGGCG GGGCCTCGCG GGACGCGGGC AGTGCGGAGA CCGCGGCGCT 60 
GAGGACGCGG GAGCCGGGAG CGCACGCGCG GGGTGGAGTT CAGCCTACTC TTTCTTAGAT 120 
GTG AAAGG AA AGG A AG ATC A TTTCATGCCT TGTTGATAAA GGTTCAGACT TCTGCTGATT 1 80 
CATAACCATT TGGCTCTGAG CTATGACAAG AGAGGAAACA AAAAGTTAAA CTTACAAGCC 240 
TGCCATAAGT GAGAAGCAAA CTTCCTTGAT AACATGCITT TGCGAAGTGC AGGAAAATTA 300 
AATGTGGGCA CCAAGAAAGA GGATGGTGAG AGTACAGCCC CCACCCCCCG TCCAAAGGTC 360 
TTGCGTTGTA AATGCCACCA CCATTGTCCA G AAGACTCAG TCA ACAATAT TTGCAGCACA 420 
GACGGATATT GTTTCACGAT GATAGAAGAG GATGACTCTG GGTTGCCTGT GGTCACTTCT 480 
GGTTGCCTAG GACTAG AAGG CTCAGATTTT CAGTGTCGGG ACACTCCCAT TCCTCATCAA 540 
AGAAGATCAA TTGAATGCTG CACAGAAAGG AACG AATGTA ATAAAGACCT ACACCCTACA 600 
CTGCCTCCAT TGAAA AACAG AGATI TI G'l I GATGGACCTA TACACCACAG GGCnTACTT 660 
ATATCTGTGA CTGTCTGTAG TTTGCTCTTG GTCCTTATCA TATTATTTTG TTACTTCCGG 720 
TATAAAAGAC AAGAAACCAG ACCTCX3ATAC AGCATTGGGT TAGAACAGGA TGAAACTTAC 780 
ATTCCTCCTG GAG AATCCCT GAGAGACTTA ATTGAGCAGT CTCAGAGCTC AGGAAGTGGA 840 
TCAGGCCTCC CTCTGCTGGT CCAAAGGACT ATAGCTAAGC AGATTCAGAT GGTGAAACAG 900 
ATTGGAAAAC GTCGCTATGG GGAAGTnTGG ATGGGAAAGT GGCGTGGCGA AAAGGTAGCT 960 
GTGAAAGTGTTCTTCACCAC AGAGGAAGCX: AGCTGGTTCA GAGAGACAGA AATATATCAG 1020 
ACAGTGTTGA TGAGGCATGA AAACATTTTG GGTITCATTG CTGCAGATAT CAAAGGGACA 1080 
GGGTCCTGGA CCCAGTTGTA CCTAATCACA GACTATCATG AAAATGGTTC CCTTTATGAT 1 140 
TATCTGAAGT CCACCACCCT AGACGCTAAA TCAATGCTGA AGTTAGCCTA CTCTTCTGTC 1200 
AGTGGCTTAT GTCATTTACA CACAGAAATC TTTAGTACTC A AGCCAAACC AGCAATTGCC 1260 
CATCGAGATCTGAAAAGTAA AAACATTCTG GTGAAGAAAA ATGGAACTTG CrGTATTGCT 1320 
GACCTGGGCCTGGCTGTTAA AnTATTAGT GATACAAATG AAGTTGACAT ACCACCTAAC 1380 
ACTCGAGTTG GCACCAAACG CTATATGCCT CCAGAAGTGT TGGACGAGAG CTTGAACAGA 1440 
AATC ACTTCC AGTCTTACAT C ATGGCTG AC ATGTATAGTT TTGGCCTC AT CCTTTGGGAG 1 500 
GTTGCTAGGA GATGTGTATC AGGAGGTATA GTGGAAGAAT ACCAGCTTCC TTATCATGAC. 1560 
CTAGTGCCCA GTGACCCCTC TTATGAGGAC ATGAGGGAGA TTGTGTGCAT CAAGAAGTTA 1620 
CGCCCCTCAT TCCCA AACCG GTGGAGCAGT GATGAGTGTC TA AGGCAGAT GGGAAAACTC 1680 
ATGACAGAAT GCTGGGCTCA CAATCCTGCA TCAAGGCTGA CAGCCCTGCG GGTTAAGAAA 1740 
ACACTTGCCA AAATGTCAGA GTCCCAGGAC ATTAAACTCT GATAGGAGAG GAAAAGTAAG 1800 
CATCTCTGCA GAAAGCCAAC AGGTACTCTT CTGTTTGTGG GCAGAGCAAA AGACATCAAA 1860 
TAAGCATCCA CAGTACAAGC CTTGAACATC GTCCTGCTTC CCAGTGGGTT CAGACCTCAC 1920 
CTTTCAGGGA GCGACCTGGG CAAAGACAGA GAAGCTCCCA GAAGGAGAGA TTGATCCGTG 1980 
TCTGTTTGTA GGCGGAGAAA COGITGGGTA ACITGTTCAA GATATGATGC AT 



Scq ID NO: 6 Protein sequence: 
Protein Accession It: none found 



1 11 21 31 41 51 
I I I I I I 

MIXRSAGKLN VGTKKEDGES TAPTPRPKVL RdCCHHHCPE DSVNNICSTD GYCFTMIEED 60 
DSGLPWTSG CLGLEGSDFQ CRDTPIPHQR RSIECCfERN ECNKDLHPTL PPLKNRDFVD 120 
GPIHHRALLI SVTVCSLLLV LIILFCYFRY KRQETRPRYS IGLEQDETYI PPGESLRDLI 1 80 
EQSQSSGSGS GLPLLVQRTI AKQIQMVKQI GKGRYGEVWM GKWRGEK.VAV KVFFTTEEAS 240 
WFRETEIYQT VLMRHENILG FIAADIKGTG SWTQLYLITD YHENGSLYDY LKSTTLDAKS 300 
MLKLAYSSVSGLCHLHTEIFSTQGKPAIAHRDUCSKNILVKKNGTCaADLGLAVKFISD 360 
TNEVDIPPNT RVGTKRYMPP EVLDESLNRN HFQSYIMADM YSFGLILWEV ARRCVSGGIV 420 
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EEYQLPYHDL VPSDPSYEDM REI VCIKKLR PSFPNRWSSD ECLRQMGKLM TECWAHNPAS 480 
RLTALRVKKT LAKMSESQDI KL 

Seq ID NO: 7 DNA sequence 

Nucleic Acid Accession U: none found 

Coding sequence: 482-3007(underUned sequences correspond to start and stop codons) 



1 11 21 31 41 51 

10 I I 1 I I I 

AACTGAGCTA ACAAGAAATA CTAGAAAAGG AGGAAGGAGA ACATTGCTGC AGCTTGGATC 60 
TACAACCTAA GAAAGCAAGA GTG ATCA ATC TCAGCTCTGT TA AACATCTT GTTTACnTAC 1 20 
TGCATTCAGC AGCTTGCAAA TGGTTAACTA TATGCAAAAA AGTCAGCATA GCTGTGAAGT 1 80 
ATGCCGTG AA nTTAATTGA GGGAAAAAGG GACAATTGCT TCAGGATGCT CTAGTATGCA 240 

15 CTCrcOTGAAATATmCAATGAAATGCTCAGTATTCTATCTTTCACCAGAG^^ 300 
CTTTATGAAG CTATGGGACT TGACAAAAAG TGATATTTGA GAAGAAAGTA CGCAGTGGTT 360 
GGTGTTTTCT TTTTTTTAAT AAAGGA ATTG AATTACTTTG AACACCTCTT CCAGCTGTGC 420 
ATTACAGATA ACGTCAGGAA GAGTCTCTGC TTTACAGAAT CGGATTTCAT CACATGACAA 480 
CATGA AGCTG TGG ATTCATC TCTTTTATTC ATCTCTCCTT GCCTGTATAT CTTTACACTC 540 

20 CCAAACTCCA GTGCTCTCAT CCAG AGGCTC TTGTGATTCT CTTTGCAATT GTGAGGAAAA 600 
AGATGGCACA ATGCTAATAA ATTGTGAAGC AAAAGGTATC AAGATGGTAT CTGAAATA AG 660 
TGTGCCACCA TCACGACCTT TCCAACTAAG CTTATTAAAT AACGGCTTGA CGATGCTTCA 720 
CACAAATGAC TTTTCrGGGC TTACCAATGC TATTTCAATA CACCTTGGAT TTAACAATAT 780 
TGCAGATATT GAG ATAGGTG CATTTAATGG CCTTGGCCTC CTGAAACAAC TTCATATCAA 840 

25 TCACAATTCr TTAGAAATTC TTAAAGAGGA TACTTTCTAT GGACTGGAAA ACCTGGAATT 900 
CCTGCAAGCA GATAACAATT TTATCACAGT GATTGAACCA AGTGCCTTTA GCAAGCTCAA 960 
CAGACTCAAA GTGTTAATTT TAAATGACAA TGCTATTGAG AGTCTTCCTC CAAACATCTT 1020 
CCGATITGTT CCTTTAACCC ATCTAGATCT TCGTGGAAAT CAATTACAAA CATTGCCTTA 1 080 
TGTTGGTTTT CTCGAACACA TTGGCCG A AT ATTGG ATCTT CAGTTGG AGG ACAACAAATG 1140 

30 GGCCTGCA AT TGTG ACTTAT TGCAGTTAA A A ACTTGGTTG G AGAACATGC CTCCACAGTC 1 200 
TATAATTGGT GATGTTGTCT GCAACAGCCC TCCATTTTTT AAAGGAAGTA TACTCAGTAG 1260 
ACTAAAGAAG GAATCTATTr GCCCTACTCC ACCAGTGTAT GAAGAACATG AGGATCCTTC 1320 
AGGATCATTA CATCTGGCAG CAACATCTTC AATAAATGAT AGTCGCATGT CAACTAAGAC 1380 
CACGTCCATT CTAA AACTAC CCACCAAAGC ACCAGGTTTG ATACCTTATA TTACAAAGCC 1 440 

3 5 ATCCACTCA A CTTCCAGG AC CTT ACT GCCC T ATTCCrTGT AACTGCA AAG TCCTATCCCC 1 500 
ATCAGGACTT CTAATACATT GTCAGGAGCG CAACATTGAA AGCTTATCAG ATCTGAGACC 1560 
TCCTCCGCAAAATCCrAGAAAGCTCArrCTAGCGGGAAATATTATTCACAGTTTAATGAA 1620 
GTCTGATCTA GTGGAATATT TCACTTTGGA AATGCTTCAC TTGGGAAACA ATCGTATTGA 1680 
. ^ AGTTCTTGAA GAAGGATCGT TTATGAACCT AACGAG ATTA CAAAAACTCT ATCTAAATGG 1740 

40 TAACCACCTG ACCAAATTAA GTAAAGGCAT GTTCCTTGGT CTCCATAATC TTGAATACTT 1 800 
ATATCTTGAA TACAATGCCA TTA AGGAAAT ACTGCCAGGA ACCTTTAATC CAATGCCTAA 1 860 
ACTTAAAGTC CTGTATrTAA ATAACAACCT CCTCCAAGTT TTACCACCAC ATATTTmC 1920 
AGGGGTTCCT CTAACTAAGG TAAATCTTAA AACAAACCAG TTTACCCATC TACCTGTAAG 1980 
TAATATTTTG GATGATCTTG ATTTACTAAC CC AGATTGAC CTTGAGGATA ACCCCTGGG A 2040 

45 CTGCTCCTGT GACCTGGTTG GACTGCAGCA ATGGATACAA AAGTTAAGCA AGAACACAGT 2100 
GACAG ATGAC ATCCTCTGCA CTTCCCCCGG GCATCTCGAC AAAAAGGAAT TGAAAGCCCT 2160 
AAATAGTGAA ATTCTCTGTC CAGGTTTAGT AAATAACCCA TCCATGCCAA CACAGACTAG 2220 
TTACCTTATG GTCACCACTC CTGCAACAAC AACAAATACG GCTGATACTA TTTTACGATC 2280 
TCTTACGG AC GCTGTGCCAC TGTCTGTTCT AATATTGGGA CTTCTG ATTA TGTTCATCAC 2340 

5 0 TATTGTTTTC TGTGCTGCAG GGAT AGTGGT TCTTGTTCTT CACCGCAGG A GAAGATACAA 2400 

AAAGAAACAA GTAGATGAGC AAATGAG AGA CAACAGTCCT GTGCATCTTC AGTACAGCAT 2460 
GTATGGCCAT AAAACCACTC ATCACACTAC TG A AAGACCC TCTGCCTCAC TCTATGAACA 2520 
GCACATGGTG AGCCCCATGG TTCATGTCTA TAG AAGTCCA TCCTTTGGTC CAAAGCATCT 2580 
GGAAGAGGAA GAAGAGAGGA ATGAGAAAGA AGGAAGTGAT GCAAAACATC TCCAAAGAAG 2640 

55 TCnTTGGAA CAGGAAAATC ATTCACCACT CACAGGGTCA AATATGAAAT ACAAAACCAC 2700 
GAACCAATCA ACAGAATTTT T ATCCTTCCA AGATGCCAGC TCATTGTACA GAAACATTTT 2760 
AGAAAAAGAA AGGGAACTTC AGCAACTGGG AATCACAGAA TACCTAAGGA AAAACATTGC 2820 
TCAGCTCCAG CCTGATATGG AGGCACATTA TCCTGGAGCC CACGAAGAGC TGAAGTTAAT 2880 
GGAA ACATTA ATGTACTCAC GTCCAAGGAA GGTATTAGTG GAACAGACAA AAAATGAGTA 2940 

00 nTTGAACTT AAAGCTAATT TACATGCTGA ACCTGACTAT TTAGAAGTCC TGGAGCAGCA 3000 
AACATAGATG GAGAGTTTGA GGGCTTTCGC AGAAATGCTG TGATTCTGTT TTAAGTCCAT 3060 
ACCTTGTAAA TAAGTGCCTT ACGTGAGTGT GTCATCAATC AGAACCTAAG CACAGCAGTA 3120 
AACTATGGGG AAAAAAAAAG AAGAAGAAAA GAAACTCAGG GATCACTGGG AGAAGCCATG 3180 
GCATTATCTT CAGGCAATTT AGTCTGTCCC AA ATAAAATC AATCCTTGCA TGTAAATC 

65 



Seq ID NO: 8 Protein sequence: 
Protein Accession #: none found 

70 I n 21 31 41 51 

I i I I I I 

MKLWIHLFYS SLLAQSLHS QTPVLSSRGS CDSLCNCEEK DGTMLINCEA KGIKM VSEIS 60 
VPPSRPFQLS LLNNGLTMLH TNDFSGLTNA ISIHLGFNNI ADIHGAFNG LGLLKQLHIN 120 
HNSLEILKED TFHGLENLEF LQADNNFITV lEPSAFSKLN RLKVLILNDN AIESLPPNIF \ 80 
7 5 RFVPLTHLDL RGNQLQTLPY VGFLEHIGRI LDLQLEDNKW ACNCDLLQLK TWLENMPPQS 240 
IIGD WCNSP PFFKGSILSR LKKESICPTP P VYEEHEDPS GSLHLAATSS INDSRMSTKT 300 
TSILKLPTKA PGliPYITKP STQLPGPYCP IPCNCKVLSP SGLUHOQER NIESLSDLRP 360 
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PPQNPRKUL AGNIIHSLMK SDLVEYFTLE MLHLGNNRIE VLEEGSFMNL TRLQKLYLNG 420 
NHLTKLSKGM FLGLHNLEYL YLEYNAIKEI LPGTFNPMPK LKVLYLNNNL LQVLPPHIFS 480 
GVPLTKVNLK TNQFTHLP VS NILDDLDLLT QIDLEDNPWD CSCDLVGLQQ WIQKLSKNTV 540 
TDDILCTSPQ HLDKKELKAL NSEILCPGLV NNPSMPTQTS YLMVTTPATT TWADTILRS 600 
LTDAVPLSVL ILGLUMFIT IVFCAAGIW LVLHRRRRYK KKQVDEQMRD NSP VHLQYSM 660 
YGHKTTHHTTERPSASLYEQHMVSPMVHVYRSPSFGPKHLEEEEERNEKEGSDAIUIL 720 
LLEQENHSPL TGSNMKYKTT NQSTEFLSFQ DASSLYRNIL EKERELQQLG ITEYLRKNIA 780 
QLQPDMEAHY PGAHEELKLM EHfLMYSKPRK VLVEQrTKNEY FELKANLHAE PDYLEVLEQQ 840 
T 

Seq IDNO: 9 DNA sequence 

Nucleic Acid Accession #: NM_003474 

Oodmg sequence: 307-3036 (underlined sequences correspond to start and stop oodons) 



1 11 i\ 31 41 51 
I I I I I I 

CACTAACGCT CTTCCTAGTC CCCGGGCCAA CTCGGACAGT TTGCTCATTT ATTGCAACGG 60 
TCAAGGCTGG CTTGTGCCAG AACGGCGCGC GCGOGACGCA CGCACACACA CGGGGGGAAA 120 
CrrmTAAA AATGAAAGGC TAGAAGAGCT CAGCGGCXiGC GCGGGCCGTG CGCGAGGGCT 180 
CCGGAGCTGA CTCGCCG AGG CAGGAAATCC CTCOGGTCXjC GACGCCCGGC CCCGCTCGGC 240 
OCtXaOOTOGOATGOTGCAOCOCTCGCCGCCOGGCCJOGAQAGCTGCTGCAC^^ 300 
GOGACOATGG CAGCGCGCCC GCTGCCCGTG TCCOCCGCCC GCGCCCTCCT GCTCGCCCTG 360 
GCCGGTGCTC TGCTCGCGCC CTGCGAGGCC CGAGGGGTGA GCTTATGGAA CGAAGG AAGA 420 
GCTGATGAAG TTGTCAGTGC CTCTGTTCGG AGTGGGGACC TCTGGATCCC AGTGAAGAGC 480 
TTCGACTCCA AGAATCATCC AGAAGTGCTG AATATTCXiAC TACAACGGGA AAGCAAAGAA 540 
CTGATCATAA ATCTGGAAAG AAATGAAGGT CTCATTQCCA GCAGTTTCAC GGAAACCCAC 600 
TATCTGCAAG ACOGTACTO A TOTCTCCCTC GCTCQAAATT ACACGOTAAT TCTGGGTCAC 660 
TGTTACTACC ATGGACATGT ACGGGGATAT TCTGATTCAG CAGTCAGTCT CAGCACGTGT 720 
TCTGGTCTCA GGGG ACTTAT TGTGTTTG AA AATGAAAGCT ATGTCTTAGA ACCAATG AAA 780 
AGTGCAACCA ACAGATACAA ACTCTTCCCA GCG AAGAAGC TGAAAAGCGT CCGGGGATCA 840 
TGTGGATCAC ATCACAACAC ACCAAACCTC GCTGCAAAGA ATGTGTTTCC ACCACCCTCT 900 
CAGACATGGG CAAGAAGGCA TAAAAGAGAG ACCCTCAAGG CAACTAAGTA TGTGGAGCTG 960 
GTGATCGTGG CAGACAACCG AGAGTTTCAG AGGCAAGGAA AAGATCTGGA AAAAGTTAAG 1020 
CAGOGATTAA TAGAGATTGC TAATCACGTT GACAAGmT ACAGACCACT GAACATTCGG 1080 
ATCXjTGTTGG TAGGCGTGGA AGTGTGGAAT GACATGGACA AATGCTCTGT AAGTCAGGAC 1 140 
CCATTCACCA GCCTCCATG A ATTTCTGGAC TGGAGGAAGA TGAAGCTTCT ACCTCGCAAA 1200 
TCCCATGACA ATGCGCAGCT TOTCAGTGGG GTTTATTTCC AAGGGACCAC CATCGGCATG 1260 
GCCCCAATCA TGAGCATGTG CACGGCAGAC CAGTCTGGGG GAATTGTCAT GGACCATTCA 1320 
GACAATCCCC TTGGTGCAGC CGTGACCCTG GCACATGAGC TGGGCCACAA TTTCGGG ATG 1380 
AATCATGACA CACTGGACAG GGGCTGTAGC TGTCAAATGG CGGITGAGAA AGGAGGCTGC 1440 
ATCATGAACG CTTCCACCGG GTACCCATTT CCCATGGTGT TCAGCAGTTG CAGCAGGAAG 1500 
GACITGGAGA CCAGCCTGGA GAAAGGAATG GGGGTGTGCC TGTTTAACCT GCCGGAAGTC 1560 
AGGGAGTCTT TCX3GGGGCCA GAAGTGTOGG AACAGATTTQ TGGAAGAAGQ AGAGG AGTGT 1 620 
GACTGTGGGQ AGCCAGAGGA ATGTATGAAT CXiCTQCTGCA ATGCCACCAC CTGTACCCrG 1680 
AAGCCGGACG CTGTGTGCGC ACATGGGCTG TGCTGTGAAG ACTGCCAGCT GAAGCCTGCA 1740 
GGAACAGCGT GCAGGGACTC CAGCAACTCC TGTGACCTCC CAGAGTTCTG CACAGGGGCC 1800 
AGCCCrCACT GCCCAGCCAA CXjTGTAOCTG CACGATGGGC ACTCATGTCA GGATGTGGAC 1860 
GGCTACTGCr ACAATGGCAT CTGOCAGACTCACGAGCAGC AOTGTOTCAC ACTCTGOGGA 1920 
CCAOGTGCTA AACCTGCCCC TGGGATCrGC TTTGAOAGAO TCAATTCTGC AGGTGATCCT 1950 
TATGGCAACT GTGGCAAAGT CTCGAAGAGT TCCTTTGCCA AATGCGAGAT GAGAG ATGCT 2040 
AAATGTGG AA AAATCCAGTG TCAAGGAGGT GCCAGCOGGC CAGTCA1TGG TACCAATGCC 2100 
GITTCCATAG AAACAAACAT CCCCCTGCAG CAAGGAGGCC GGATTCTOTG CCGGGGGACC 2160 
CACX3TGTACT TGGGOG ATG A CATGCCX3G AC CCAGGGCTTO TGCITOCAGO CACAAAOTOT 2220 
GCAGATGGAA AAATCTGCCT G AATCGTCAA TGTCAAAATA TTAGTGTCTT TGGGGTTCAC 2280 
GAGTGTGCAA TGCAGTGCCA CGGCAGAGGG GTGTGCAACA ACAGGAAGAA CTGCCACTGC 2340 
GAGGCCCACT GGGCACCTCC CTTCTGTGAC AAGTTTGGCT TTGG AGG AAG CACAGACAGC 2400 
GGCCCCATCC GGCAAGCAG A TAACCAAGGT TTAACCATAG GAATTCTGGT GACCATCCTG 2460 
TGTCrTCTTG CTGCCGGATT TGTGGTTTAT CTCAAAAGGA AGACCTTGAT ACGACTGCTG 2520 
TTTACAAATA AGAAG ACCAC CATTGAAAAA CTAAGGTGTG TGCGCCCTTC CCXjGCCACCC 2580 
OGTGGCTTCC A ACCCTGTCA GGCTCACCTC GGCCACCTTG GAAAAGGCCT GATG AGGAAG 2640 
COGCCAGATT CCTACX:CACC GAAGGACAAT CX:CAGGAGAT TGCTGCAGTG TCAGAATGTT 2700 
GACATCAGCA GACCCCTCAA CGGCCTG AAT GTCCCTCAGC CCCAGTCAAC TCAGCGAGTG 2760 
CrrCCTCCCC TCCACCGGGC CCCACGTGCA CCTAGCGTCC CTGCCAGACC CCTGCCAGCC 2820 
AAGCCTGCAC TTAGGCAGGC CCAGGGGACC TGTAAGCCAA ACCCCCCTCA GAAGCCTCTG 2880 
CCTGCAGATC CTCTOGCCAG AACAACTCGG CTCACTCATG CCTTGGCrAO OACCCCAGGA 2940 
CAATOGGAGACTGGGCTCCG CCTGGCACCCCTCAGACCTG CTOCACAATA TCCACACXIAA 3000 
GTGCXX:AG AT CCACXrCACAC CGCCTATATT AAGIGAGAAG CCGACA CCTT TTTTCAACAG 3060 
TC AAG AC AG A AGTTTGCACT ATCTTTCAGC TCCAGTTGGA GTTTTTTGTA CCAACTTTTA 3120 
GGATTrmT TAATGTTTAA AACATCATTA CTATAAGAAC TTTGAGCTAC TGCCGTCAGT 3 180 
GCTGTGCTGT GCTATGOTGC TCTGTCTACT TGCACAGGTA CTTGTAAATT ATTAATTTAT 3240 
GCAOAATGTTGATTACAGTG CAGTGCGCTG TAGTAGGCATTTTTACCATC ACTGAGTTTT 3300 
CCATGGCAGG AAGGCTTGTT GTGCTTTTAG TATnTAGTG AACTTGAAAT ATCCTGCTTG 3360 
ATGGGAITCT GGACAGGATG TGTTTGCnT CTGATCAAGG CCTTA1TGGA AAGCAGTCCC 3420 
CCAACTACCC CCAGCTGTGC TTATGGTACC AGATGCAGCT CAAGAGATCC CAAGTAGAAT 3480 
CTCAGTTG AT TTTCTGGATT CCCCATCTCA GGCCAGAGCC AAGGGGCTTC AGGTCCAGGC 3540 
TCTGTTTGGC TTTCAGGG AG GCCCTQTGCC CCITGACAAC TGGCAGGCAG GCTCCCAGOG 3600 
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ACACCTGGG A GAAATCTGGC TTCTGGCCAG GAAGCTTTGG TGAGAACCTG GGTTGCAGAC 3660 
AGGAATCTTA AGGTGTAGCC ACACCAGGAT AG AGACTGGA ACACTAGACA AGCCAGAACT 3720 
TGACCCTG AG CTGACCAGCC GTGAGCATGT TTGGAAGGOO TCTGTAGTGT CACTCAAGGC 3780 
GGTGCTTG AT AGAAATGCCA AGCACTTCTT TTTCTCOCTG TCCTTTCTAG AGCACTGCCA 3840 
CCAGTA0G1T ATTTAGCTTG GGAAAGGTGG TGTTTCTGTA AGAAACCTAC TGCCCAGGCA 3900 
CTGCAAACCG CCACCTCCCT ATACTGCTTG GAGCTGAGCA AATCACCACA AACTGTAATA 3960 
CAATGATCCT GTATTCAG AC AGATGAGGAC TTTCCATGGG ACCACAACTA TTTTCAGATG 4020 
TGAACCATTA ACCAGATCTA GTCAATCAAG TCTGTTTACTGCAAGGTTCA ACTTATTAAC 4080 
AATTAGGCAG ACTCTTTATG CTTGCAAAAA CTACAACCAA TGGAATGTGA TGTTCATGGG 4140 
TATAGTTCAT QTCTOCTATC ATTATTCOTA GATATTOGAC AAAO AACCTT CTCTATOGGQ 4200 
CATCCTCTTT TTCCAACTTG GCTGCAGGAA TCTTTAAAAG ATGCTTTTAA CAGAGTCTGA 4260 
ACCTATTTCT TAAACACTTG CAACCTACCTGTTGAGCATC ACAGAATGTG ATAAGGAAAT 4320 
CAACTTGCTT ATCAACTTCC TAAATATTAT GAG ATGTGGC TTGGGCAGCA TCCCCTTGAA 4380 
CTCrrCACTC TTCAAATGCC TGACTAGGGA GCCATGTTTC ACAAGGTCTT TAAAGTGACT 4440 
AATGGCATGA GAAATACAAA AATACTCAGA TAAGGTAAAA TGCCATGATQ CCTCTQTCTT 4500 
CTGGACTGGT TTTCACATTA G AAGACAATT GACAACAGTT ACATAATTCA CTCTGAGTGT 4560 
TTTATG AG AA AGCCTTCTTT TGGGGTCAAC AGTnTCCTA TGCTTTG AAA CAGAAAAATA 4620 
TGTACCAAGA ATCTTGGTTT GCCTTCCAG A AAACAAAACT GCATTTCACT TTCCXXSGTGT 4680 
TCCCCACTGT ATCTAGGCAA CATAGTATTC ATGACTATGG ATAAACTAAA CACGTGACAC 4740 
AAACACACAC AAAAGGG AAC CCAGCTCTAA TACATTCCAA CTCGTATAGC ATGCATCTOT 4800 
TTATTCTATA GITATTAAGT TCTTTAAAATGTAAAGCCAT GCTGGAAAAT AATACTGCTG 4860 
AGATACATAC AGAATTACTO TAACTGATTA CACTTGGTAA TTGTACTAAA GCCAAACATA 4920 
TATATACTAT TAAAAAOOTT TACAGAATTT TATGGTGCAT TACGTGGGCA TTCTCl l 1 1'l" 4980 
AG ATQCCCAA ATCCTTAO AT CrGGCATGTT AGCXCTTCCT CCAATTATAA GAGG ATATGA 5040 
ACCAAA AAAA AAAAAAAAAA AA 



Seq ID NO; 10 Protein sequence: 
Protein Accession #: NP_0034 65.2 

1 11 21 31 41 51 
) I I I i t 

MAARPLPVSP ARALLLALAG ALLAPCEARG VSLWNEGRAD EWSASVRSG DLWlPVKSFD 60 
SKNHPEV LNl RLQRESKELI INLERNEG LI ASSFTETHYL QDGTDVSL AR NYTVILGHCY 1 20 
YHGHVRGYSD SAVSLSTCSG LRGLIVFENE SYVLEPMKSA TNRYKLFPAK KLKSVRG9CG 180 
SHHNTPNLAA KNVFPPPSQT WARRHKRETL KATKYVELVI VADNREFQRQ GKDLEKVKQR 240 
LIEIANHVDK FYRPLNIRIV LVGVEVWNDM DKCSVSQDPF TSLHEFLDWR KMKLLPRKSH 300 
DNAQLVSG VY FOGITIGMAP IMSMCTADQS GGIVMDHSDN PLGAAVTLAH ELGHNFGMNH 360 
DTLDRGCSCQ MAVEKGGCIM NASTGYPFPM VFSSCSRKDL ETSLEKGMGV CLFNLPEVRE 420 
SFGGQKCGNR FVEEGEECDC GEPEECMNRC CNATTCTLKP DAVCAHGLCC EDCQLKPAGT 480 
ACR0SSNSCD LPEFCTGASP HCPANVYLHD GHSCQDVDGY CYNGICQTHE QQCVTLWGPG 540 
AKPAPGICFE RVNSAGDPYG NOGKVSKSSF AKCEMRDAKC GKIQCQGGAS RPVIGTNAVS 600 
lETNIPLQQG GRILCRGTHV YLGDDMPDPG LVLAGTKCAD GKJCLNRQCQ NISVFG VHEC 660 
AMQCHGRGVC NNRKNCHCEA HWAPPFCDKF OFGGSTDSGP IRQADNQGLT IGILVTILCL 720 
LAAGF WYLK RKTLIRLLFT NKKTTIEKLR CVRPSRPPRG FQPCQAHLGH LGKGLMRKPP 780 
DSYPPKDNPR RLLQCQNVDI SRPLNGLNVP QPQSTQRVLP PLHRAPRAPS VPARPLP AKP 840 
ALRQAQGTCK PNPPQKPLPA DPLARTTRLT HALARTPOQW ETGLRLAPLR PAPQYPHQVP 900 
RSTHTAYIK 

Seq ID NO; 1 1 DNA sequence 

Nucleic Acid Accession #: NM.001394 

Coding sequence: 400- 1 584(umlerliiied sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I t I 

GGCACGAGGG CTGAGCGCCG GAGGAGCGTA GGCAGGGCAG CGCTGGCGCC AGCGGCGACA 60 
GGAGCCGCGC GACCGGCAAA AATACACGGG AGGCCGTCGC CG AAAAGAGT CCGCGGTCCT 120 
CTCrCGT AA A CACACTCTCC TCCACCGGCG CCTCCCCCTC CGCTCTGCGC GCCGCCCGGC 1 80 
TOGGCGCCCG AGOCCOCTCC G ACTGCTATG TQACCGCOAQ GCTGCGGGAG GAAGGGGACA 240 
GGGAAG A AG A GGCTCTCCCG CGGGAGCCCT TGAGGACCAA GTTTGCGGCC ACTTCTGCAG 300 
GCGTCCCTTC TTAGCTCTCG CCTGCCCCTT TCTGCAGCCT AGGCGGCCCA GGTrCTCITC 360 
TCTTCCTCGC GCGCCCAGCC GCCTCGGTTC CCGGCGACCAT^GTG ACGAT GGAGGAGCTG 420 
CGGGAGATGG ACTGCAGTGT GCTCAAAAGG CTGATGAACC GGG ACGAG AA TGGCGGCGGC 480 
GCGGGCGGCA GCGGCAGCCA CGGCACCCTG GGGCTGCCGA GCGGCGGCAA GTGCCrGCTG 540 
CTGG ACTGCA G ACCGTTCCT GGCGCACAGC GCGGGCTACA TCCTAG GTTC GGTCAACGTG 600 
CGCTGTAACA CCATCGTGCG GCGGCGGGCT AAGGGCTCCG TG AGCCTGG A GCAG ATCCTG 660 
CCCGCCG AGG AGGAGGTACG CGCCCGCTTG CGCTCCGGCC TCTACTCGGC GGTCATCGTC 720 
TACGACGAGC GCAGCCCGCG CGCOGAGAGC CTCCGCGAGG ACAGCACCGT GTCGCTGGTG 780 
GTGCAGGCOC TGCGCCGCAA OGCCGAGCGC ACCGACATCT GCCTGCTCAA AGGCGGCTAT 840 
G AGAGGTTTT CCTCCG AGTA CCCAG AATTC TGTTCTAAAA CCAAGGCCCT GGCAGCCATC 900 
CCACCCCCGG TTCCCCCCAG CGCCACAGAG CCCITGGACC TGGGCTGCAG CTOCTGTGGG 960 
ACOCCACTAC ACQACCAOOG GOGTCCTGTG G AGATCCTTC CCTTCCTCTA CCTCGGCAGT 1020 
GCCTACCATG CTGCCCGGAG AGACATGCTG GACGCCCTGG GCATCACGGCTCTGTTGAAT 1080 
GTCTCCTCGG ACTGCCCAAA CCACTTTGAA GG ACACTATC AGTACAAGTG CATCCCAGTG 11 40 
GAAGATAACC ACAAGGCCGA CATCAGCTCC TGGTTCATGG AAGCCATAGA GTACATCGAT 1200 
GCCOTGAAGO ACTOCCQTGG GCQCGTGCTG GTGCACTGCC AGGCGGGCAT CTCGOGGTCG 1260 
GCCACCATCTGCCTGOCCTA CCTGATGATG AAGAAACGGG TOAGGCTGGA GGAGGCCITC 1320 
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G AGTTCGTTA AGCAGCGCCG CAGCATCATC TCGCCCAACT TCAGCTTCAT GGGGCAGCTG 1380 
CTGCAGTTCG AGTCCCAGGT GCTGGCCACG TCCTGTGCTG CGGAGGCTGC TAGCCCCTCG 1440 
GG ACCCCTGC GGGAGCGGGG CAAG ACOCCX: GCCACCCCCA CCTCGCAGTT CGTCTTCAGC 1500 
TTTCCGGTCTCCGTGOOOGTGCACTCGGCCCCCAGCAOCCTOCCXn'ACCTOCACAOC^ 1560 
5 ATCACCACCT CTCCCAGCTG TTAG AGCCGC CCTGGGGGCC CCAG AACCAG AGCTGGCTCC 1 620 
CAGCAAGGGT AGGACGGGCC GCATGCGGCA G AAAGTTGGG ACTGAGCAGC TGGGAGCAGG 1680 
CGACCGAGCT CCTTCCCCAT CATTTCTCCT TGGCCAACXjA CGAGGCCAGC CAGAATGGCA 1740 
ATAAGG ACTC CXiAATACATA ATAAAAGCAA ACAGAACACT CCAACTTAG A GCAAT AACCO 1800 
GTGCCGCAGC AGCCAGGGAA G ACCTTGGTT TGGTTTATGT GTCAGTTTCA CTTTTCCXSAT 1 860 

1 0 AGAAATTTCT TACCTCATTT TTTTAAGCAG TAAGGCTTGA AGTGATGAAA CCCACAGATC 1920 

CTAGCAAATG TGCCCAACCA GCTTTACTAA AGGGGGAGGA AGGGAGGGCA AAGGGATGAG 1980 
AAGACAAGTT TCCCAGAAGT GCCTGGTTCT GGGTACTTGT CCCTTTGTTG TCGTTGTrGT 2040 
AGTTAAAGG A ATTTCATTTT TAA AAG AAAT CTTCX} A AGGT GTGGTTTTCA TTTCTCAGTC 2 1 00 
ACC A ACAG AT G AATAATTAT GCTTAATA AT AA AGT ATTTA TTAAG ACTTT CTTCAG AGTA 2 1 60 

1 5 TG AAAGTACA AAAAGTCTAG TTACAGTGG A TTTAGAATAT ATTTATGTTG ATGTCAAACA 2220 
GCTGAGCACC GTAGCATGCA GATGTCAAGG CAGTTAGG AA GTAAATGGTO TCTTGTAGAT 2280 
ATGTGCAAGG TAGCATG ATG AGCAACTTG A GTITGTTGCC A CTOAGA AGC AGGCGGGTTG 2340 
GGTGGG AGGA GGAAGAAAGG GAAGAATTAG GTTTG AATTG CTTTTTAAAA AAAAAAGAAA 2400 
^ AG AAAAAG AC AGCATCTCAC TATGTTGCCA AGGCTCATCT TGAGAA GCAG GCGGGTTGGG 2460 

20 TGGGAGGAGQ AAGAAAGGGA AGAATTAGGT TTGAATTGCT TmTAAAAA AAAA 

SeqIDNO: 12 Protein seQuenoc; 
Protein Accession #: NP_001385 

25 1 U 21 31 41 51 

M VTMEELREM DCSVLKRLMN RDENGGG AGG SG SHGTLGLP SGGKCLLLDC RPFLAHSAG Y 60 
ILGSVNVRCN TIVRRRAKGS VSLEQILPAE EEVRARLRSG LYSAVIVYDE RSPRAESLRE 120 
DSTVSLVVQ A LRRNAERTDI CLLKGG YERF SSEYPEFCSK TKALAAIPPP VPPSATEPLD 1 80 
30 LGCSSCGTPL HDQGGPVEIL PFLYLGSAYH AARRDMLDAL GITALLNVSS DCPNHFEGHY 240 
QYKCIPVEDN HKADISSWFM EAIEYIDAVK DCRGRVLVHC QAGISRSATI CLAYLMMKKR 300 
VRLEEAFEFV KQRRSIISPN FSFMGQLLQF ESQVLATSCA AEAASPSGPL RERGKTPATP 360 
TSQFVFSFPV SVGVHSAPSS LPYLHSPITT SPSC 

3 5 Scq ID NO; 1 3 DNA sequence 

Nucleic Acid Accession iti none found 

Coding sequence: 68-340(undertined sequences coirespond to start and stop codons) 



40 1 11 21 31 41 51 

AOCG'ccrrJjC cnCTcrrAG gctttgaagc atttttgtct gtgctcccig atcttcaggt 6o 

CACCACCATG AAGTTCTTAG CAGTCCTGGT ACTCTTGGGA GTTTCCATCT TTCTOGTCTC 120 
TGCCCAG AAT CCGACAACAG CTGCTCCAGC TGACACGTAT CCAGCTACTG GTCCTGCTGA 1 80 

45 TG ATGAAGCC CCTGATGCTG AAACCACTGC TGCTGCAACC ACTGCGACC A CTGC TGCTCC 240 
TACCACTGCA ACCACCGCTG CTTCTACCAC TGCTCGTAAA GACATTOCAG mTACCCAA 300 
ATGGGTTGGG GATCTCCCGA ATGGTAGAGT GTGTCCCTOA GATGGAATCA GCTTG AGTCT 360 
TCTGCAATTG GGTCACAACT ATTCATGCTT CCTGTGATTT CATCCAACTA CTTACCTTOC 420 
CTACGATATCCCCrTTATCTCTAATCAGTTTATTrTCTTTCAAATAAAAAATAACTATGA 480 

5 0 GCG AGCTAAC AT 
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Seq ID NO: 14 Protein seouence; 
Protein Accession #: none found 



1 11 21 31 41 51 

MKFLAVLvLl GVSDFLVSAQ NPTTAAPADT YPATOPADDE APDAETTAAA TTATTAAPTT 60 
60 ATTAASTTARKDIPVLPKWVGDLPNGRVCP 



Seq ID NO: 15 DNA sequence 

Nucleic Acid Accession NM J)l 66402 

Coding sequence: 39-1358(undcrlIned sequences correspond to start and stop codons) 



1 11 21 31 41 51 

GCTTAAGTTG ACCTCTGGGT CCGGAATCGC GGGCAAAGATGGCGGOGGCC AGGTGTTGGA 60 
7 0 GGCCTTTGCT ACGCGGTCCG AGGCTTTCAT TGCACACCGC GGCTAATGCC GCCGCCACGO 1 20 
CTACAGAAAC GACCTCCCAA G ACGTCGCGG CG ACCCCCGT CGCGOGGTAC CCGCCGATTG 1 80 
TGGCCTCCAT G ACAGCCGAC AGCAAAGCTG CACGGCTGCG GCGG ATCGAG CGCTGGCAGG 240 
CGACGGTGCA CGCTGOGGAG TCGGTAGACG AGAAGCTGCG AATCCTCACC AAGATGCAGT 300 
TTATG AAGTA CATGGTTTAC CCGCAG ACCT TCGCACTGAA TGCCG ACOGC TGGTACCAGT 360 
75 ACTTCACCAA GACCGTGTTC CTGTCGGGTC TGCCGCCGCC CCCAGCGGAG CCCGAGCCCG 420 
AGCCCG AACC COAACCTGAA CCTGCGCTQG ACCTCGCGGC GCTGCGTGOG GTCGCCTOCG 480 
ACTGCCTGCT GCAOG AGCAC TTCTACCTGC GGCGCAGGCG GCGCGTGCAC CCrrACOAOO 540 
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AGAGCGAGGT CATATCTTTG CCCTTCCTGG ATCAGCTGGT GTCAACCCTC GTGGOCCTCC 600 
TCAGCrCACA CAACCCGGCC CTGGCX:GCrG CCGOXTOGA TTATAGATGC CCAGTTCATT 660 
TTTACTGGGT GCGTGGTGAA GAAATTATTC CTCGTGGTCA TCGAAGAGGT CG AATTGATG 720 
ACTTGCG ATA CCAGATAGAT G ATAAACCAA ACAACCAGAT TCGAATATCC AAGCAACTCG 780 
CAGAGTTTGT GCCATTGGAT TATTCTGTTC CTATAGAAAT CCCCACTATA AAATGTAAAC 840 
CAG ACA AACT TCC ATTATTC AAACGGCAGT ATG AAAACCA CATATTTGIT GGCTCAAAAA 900 
CTGCAGATCC TTGCTGTTAC GGTCACA(XC AGTTrCATCT GTTACCTGAC AAATTAAGAA 960 
GGG AAAGGCT TTTGAGACAA AACTGTGCTG ATCAGATAGA AGTTGTmT AGAGCTAATG 1020 
CTATTGCAAG CCTTTTTGCT TGGACTGGAG CACAAGCTAT GTATCAAGGA TTCTGGAGTG 1080 
AAGCAGATGT TACTCG ACCT TTTGTCTCCX: AGGCTGTGAT CACAGATGGA AAATACmT 1 140 

ccrmrrcTG ctaccagcta aatactttgg cactgactac acaagctgat caaa ataacc 1200 

CTCGTAAAAA TATATGTTGG GGTACACAAA GTAAGCCTCT TTATGAAACA ATTGAGGATA 1260 
ATGATGTGAA AGGTTTTAAT GATGATGTTC TACTTCAGAT AGTTCACTTT CTACTGAATA 1320 
GACCAAAAGA AGAAAAATCA CAGCTGTTGG AAAACTGAAA AAGCATATTT GATTGAGAAC 1380 
TGTGGGAATA TTTAAATTTT ACTGAAGGAA CAATAATGAT GAGATTTGTA ACTGTCAACT 1440 
ATTAAATACA TTGATnTTQ AGACAAATAT TrCTTATGTC AACCTGTTAT TAG ATCTCTT 1500 
ACTCTGCTCA AATTCATCAC TGAAAGATTT AATTTTAGTr ACCmTGTT GATTTAAAAA 1560 
TAATTGCATT TGTATATTGC TAACTGATAA GACAAATTGA GTTATTGAGC TATTAAATGC 1620 
ACATTTTAAT ATAAATGCAG AAATCOCAAA TAAAATGCTA ACATACTGAA TTCAGTAATT 1680 
AAAAGAACCC ACTGC 



Seq ID NO; 16 Protein sequence; 
Protein Accession #: NP_057724. 1 

1 11 21 31 41 51 
I I I I I I 

MAAARCWRPL LRGPRLSLHT AANAAATATE TTSQDVAATP VARYPPIVAS MTADSKAARL 60 
RRIERWQATV HAAESVDEKL RILTKMQFMK YMVYPQTFAL NADRWYQYFT KTVFLSGLPP 120 
PPAEPEPEPE PEPEPALDLA ALRAVACDCL LQEHFYLRRR RRVHRYEESE VISLPFLDQL 180 
VSTLVGLLSP HNPALAAAAL DYRCPVHFYW VRGEEIIPRG HRRGRIDDLR YQIDDKPNNQ 240 
IRISKQLAEF VPLDYSVPIE IPTIKCKPDK LPLFKRQYEN HIFVGSICTAD PCCYOHTQFH 300 
aPDKLRRER LLRQNCADQI EVVFRANAIA SLFAWTGAQA MYQGFWSEAD VTRPFVSQAV 360 
rrDGKYFSFF CYQLNTLALT TQADQNNPRK NICWGTQSKP LYETIEDNDV KGFNDDVLLQ 420 
IVHFLLNRPK EEKSQLLEN 

Seq ID NO: 17 DNA sequence 

Nucleic Acid Accession #: NM_025059. 1 

Coding sequence. 3 -2 1 50 (underlined sequences coirespond to start and stop codons) 



1 11 21 31 41 51 
I I t I I I 

GCAIfiAGCCT GGACTGCACC AGCCATATCG CGCTOGGTGC CGCTTCGCCA GCGCCCGAGG 60 
AAACTTACGA TCATCT1TCX3 GAAGTCCXX3G TCAOGOSGOA GCAG1TAAAC CACTATCGGA 120 
ATGTGGCTCA AAATGCTCG A AGTGAACTTG CAGCAACTTT GGTCAAATTT GAATGTGCTC 1 80 
AGTCTGAGCT TCAAG ACCTC CGATCCAAOA TGCTTTCTAA AGAAGTCTCC TGTCAAGAAC 240 
TGAAAGCTG A AATGG AG AGC TACAAGG AAA ACAATQCCAG AAAATCATCT CTCCTTACCT 300 
CTTTGAGAGA CAGAGTTCAG GAACTAQAAO AAGAATCAGC AGCACTTTCC ACTTCTAAAA 360 
TCAGAACAGA AATCACAGCT CACGCTGCAA TCAAGGAGAA CCAGGAATTA AAG AAGAAAG 420 
TTGTAGAGTT AAATGAAAAA TTACAAAAGT GTTCAAAAGA AAATGAGGAG AATAAGAAAC 480 
AAGTTTCAAA GAATTGCAGG AAACATGAGG AATTTCTGAC TCAACTGCGT GACTGCTTGG 540 
ATCCAGATGA G AGG AATGAC AAGGCATCAG ATGAAGATTT AATTTTAAAG CTTAGAGACC 600 
TGCGCAAAGA AAATGAATTC GTGAAAGG AC AAATrOTFAT TCTTOAAG AG ACTATAAATG 660 
TCCATGAGAT GG AAGCAAAA GCTAGCAG AG AAAOGATCAT GAGGCTGGCT TCAGA AGTCA 720 
ACAGAGAGCA GAAAAAAGCT GCCTCCTGTA CTGAAGAG AA AGAGAAGCTG AACCAGGACC 780 
TGCTCAGTGC TGTAGAAGCA AAAG AAGCTC TTGAAAGGGA AGTTAAGATC TTCCAAGAAA 840 
GGCTGCTTGC TGGCCAGCAG GTCTGGGATG CCTCAAAGCA GGAAGTGAGC CTCCTG AAG A 900 
AAAGCTCTTC TG AGTTGG AG AAGAGTTTG A AGGCCAGTCA GGATGCAGTC ACAACCTCAC 960 
AAAGCCAGTA CTCCTCATTT AGGG AGAAAA TCGCAGCCCT CCTTAGGGGC AG ATTGAGCA 1 020 
TGACTGGGTC CACTGAGGAC ACCATTTrGG AGAAGATTCG AGAAATGGAC AGCCGGGAAG 1080 
AAAGCAGGGA CCGGATGGTC TCCCAGCTTG AAGOCCAAAT ATCTGAGCTT GTTGAACAGT 1 140 
TGGGAAAGGA GTCTGGGTTT CACCAGAAAG CTCTCCAGAG GGCCCAGAAA GCAGAGAATA 1200 
TGTTGGAGAC TCTTCAGGGT CAGCTGACAC ACCTGGAGGC AGAGCTGGTT TCTGGAGGTG 1260 
TTTTGCGAG A CAACTTGAAT TTTGAGAAAC AAAAATATCT TAAATTTCTG G ATCAGCTTT 1320 
CTCAG AAAAT GAAGTTGGAC CAGATGGCTG CCGAACITGQ CTTTGACACG CGGCTGGACG 1380 
TGGTTTTAGC TCGAACAOAG CAGCTGGTTC GTCTTG AG AG CAATGCAGTC ATTGAGAACA 1440 
AGACCATTGC CCACAATTTO CAGAGAAAGCTAAAGACACA GAAAGAGAGA CTGGAGAGCA 1500 
AAGAATTACA CATG AGCCTC CTCCGGCAGA AAATAGCCCA GCTGGAGGAG GAGAAGCAGG 1560 
CACGCACGGC ClTGGTGGTr GAG AGGG ACA ACGCGCATCT TACCATCAGG AACTTGCAGA 1620 
AGAAGGTGGA GAGGCTGCAG AAAGAGCTGA ACAOGTGTCO AGACTTOCAC ACCGAGCTCA 1680 
AAGCCAAACr GGCCGACACC AATGAACTGA AGATTAAAAC TTTGGAACAG ACTAAAQCCA 1740 
TTG AAGATCT AAACAAATCC AGAGACCAAC TGGAG AAG AT G AAGGAG AAA GCTG AGAAAA 1 800 
AGCTCATGTC TGTCAAGTCA G AACTGGATA CCACAGAACA TGAGGCTAAG GAG AAT AAAG 1 860 
AAAGGGCCAG AAACATGATA GAAGTGGTAA CCAGTGAAAT GAAGACACTA AAAAAATCTC 1920 
TGGAAGAAGC AGAAAAOAGA GAAAAGCAGC TGGCAGACTT CAQGGAOGTG GTGTCGCAGA 1980 
TGCTAGGCTT GAACGTGACC AGCCTTGCTC TrCCTGATTA TGAAATCATC A AOTGTCTTO 2040 
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AAAGATTGGT CCATTCACAT CAGCATCACT TTGTTACCTG TGCCTGCCTC AAAGATGTGA 2100 
CTACTGGGCA AGAGAGGCAC CCACA AGGCC ATTTACAGCT TCTTCATTGA ACACTGTATC 21 60 
TCTTGAGAG A GGTGGCCATA AGACATGGCA CACAATTCCC AATTTCACAA ATTCCTCATG 2220 
TCTTTGAGAT TTGATCAGTT TGTGAATATT TTATGCTTTG ATGATATAGT GAGAATGCAT 2280 
5 CACTTGCAAA AACG ATCTCA AAAQTGTCAO CCTTAGATAA ACOTCAGCAT TAAAAAAOGC 2340 
CAAAAAAAAAAAAAAAAAGCArrrTAGGATCCAGAAGAATTCCACCAGATTGCATGAGTT 2400 
AGATTGGGAA ATGGGAGTGG GAGATAATATTGGGAGGTAT CTATTTTAAG TCAGGGGCTT 2460 
TACTAGCCGA TTTAGTTCTC ACAATAACCA TGTGGAG AAG CTGTGACATT TTTA ATTTAC 2520 

, _ AACCTTTCTG GGGCTCAGAC ATAAAGTTAC CTATCCAAGG TTGCAG1TGG GTAGTGGTGG 2580 

1 0 G ACCAGG ATG GACAACTCAT TQGCCCTGCC TCAAAAGCCA TACCTCTTCT CCTGCTATGC 2640 
AOAATCTGTT TCTOCTGAAT CTCTOTG ATG CTGGTGGGAA 1TGTTTGCAT AG AGGAAGGA 2700 
CAATAACCCT GCCATCGTGA GTTAATGTCC GGGCTGGTCA CAGTGGTTCA TGCCTGTAAT 2760 
CCCAGCACTT TGGGAGTCCA AGGCAGGCAT ATCATTTGAG GTCAGGAGTT TAAGACCAGC 2820 
CTGGCTAACA TAGTG AGACC CTGTrTCTAC TAAA AATACA AAAATAAGCC AGGTGTGGTG 2880 

1 5 GTGCATGACT GTAATCCCAA CTACTCAGCA GGAGAAGCAC TTGAACCX:AG G AGACGGAGG 2940 
CGGCAGTGAG CCAAG ATTGT GCCACTGCAC TCCAGCCTGG GCOACAGAGTGAAACTACAT 3000 
CTCAGAAAAA AAAAAAAAAA AAAAAAAAAA AAGTTAATGT CCAAAAATGA CAGATTTACA 3060 
AGTGTAAGCT ATATGATTTC TTCAAAAAGC AAAAGCAATA TACCTAATTC ATTTGG ATCA 3120 
AACTT ACAT A GGTCTC AGGT CCTGTA AG AA ACTTGCCTGT TCTAACTGTT GCTACCAG AT 3 1 80 

20 TATATCTGGT GGTAATTGTT AATGTTTCAG CAGGGCTGGT CTCAGTCCTT TAAAATGGAA 3240 
AGCTTTATTT GGAGCCCAAC CCTATAAG AT GAAGA AATCC TATA TAGTC T TATTCACCAA 3300 
TATATCCAAT ACAOCCACAG CAATGGTACC TTTTTAAG AT CAGGATTTTA TTATGAATTC 3360 
CTGTCACTTT CTOTnTCCA TTTAAATTTC TATTTTACA A ATTTTTCAGG GAATCATATT 3420 
CTTAACTTCA CTGAGAAAAA TGTGTTACTC nTTGG ACAA TTTATCTTAT TTCTATCATA 3480 

25 TAAGATGTAT TTTrTTATTG TCXTTTAAAAG AAGCTCTAGC ATGAAATTAA AGGAAAGGGA 3540 
AAGAAATAOA TCTGGTGCAC CCGAACATTA GGAG AAAATG AAAAATATAC AACCAACCXiT 3600 
TOGTG AOTCA TCAAAAAGTC AAAGTCAGCC TGGCCAACAT GGCAAAACTC COTCTCTGCA 3660 
AAAAATACAA AAATGAGCOC GGTATGTTGG CATATGCCTG TAATCCCAGC TACTCX3AGAG 3720 
GCTGAG ACAC GAAAATTGCT TGAACCTGGG AGGCGGAGGT TGCAATGAGC CG AGATOSCG 3780 

3 0 CTACTGCACT CCAGCCTGGG CAACAG AGAG AG ACCTTGTC TCAAAAAACA ACAACAACA A 3840 
AAAGTCAAAG TCATAATAAG CAAATTATTG GCTTCTTTCT TCTAG ACYAA AAGAAATTAA 3900 
AG AGATG A AA CAATCAATTG CAAGGGTCAA AACTAG ATTG GATCTTGGTT TG AATG AAAA 3960 
A AAGCATAAA ATATTCTTG C AAT AATTGTA AAAATTTG AA TGTGGACTAA GTCCT AOATT 4020 

_ _ ATATTAAA AT ATTTTTAATT TTTTAAG CTT GACAAATGCA CTG ATTGTTA TACTTTAAAT 4080 

35 AACTAAAAATCTGAGAATCCACAGTGCTACAGACAATAAATGATAAAATGGGAAAAAAAA 4140 
AAAAAAAAAA A 



Sea ID NO: 18 Protein sequence: 
40 Protein Accession #: NP_079335.1 

1 11 21 31 41 51 
I } I I i i 

MSUXr^SHULGAASPAPEETYDHLSEWVTREQL^nmWVAQNARSELAATLVKFECAQ 60 
45 SELQDLRSKM LSKEVSCQEL KAEMESYKEN NARKSSLLTS LRDRVQELEE ESAALSTSKI 120 
RTEITAHAAI KENQELKKKV VELNEKLQKC SKENEENKKQ VSKNCRKHEE FLTQLRDOLD 1 80 
PDERNDKASD EDUUCLRDL RKENEFVKGQ IVILEETINV HEMEAKASRE TIMRLASEVN 240 
REQKKAASCT EBKEK.LNQDL LSAVEAKEAL EREVKIFQER LLAGQQVWDA SKQEVSLLKK 300 
SSSELEKSLKASQDAVTI^SQYSSFREKIAALUlGRI^MTGSTEDTILEKimdDSREE 360 
5 0 SRDRMVSQLE AQISELVEQL GKESOFHQKA LQRAQKAENM LETLQGQLTH LEAELVSGGV 420 
LRDNLNFEKQ KYLKFLDQLS QKMKLDQM AA ELGFDTRLDV VLARTEQLVR LESNAVIENK 480 
TIAHNLQRKL KTQKERLESK ELHMSLLRQK lAQLEEEKQA RTALVVERDN AHLTIRNLQK 540 
K VERLQKELN TCRDLHTELK AKLADTNELK KTLEQTKAI EDLNKSRDQL EKMKEKAEKK 600 
U4SVKSELDTTEHEAKENKERARNMffiVVTSEMKTLKKSLEEAEKREKQLADFREVVSQM 660 
5 5 LGLNVTSLAL PDYEIIKCLE RLVHSHQHHF VTCACLKD VT TGQERHPQGH LQLLH 



Seq ID NO: 1 9 DNA sequence 

Nucleic Acid Accession #: AF071552,NMJ)00662 
60 Coding sequence: 441-1 3 1 3 (underlined sequences coirespond to start and stop codons) 



1 11 21 31 41 51 

65 CTTTGTATAA GGCTCAGCTA AAAGGGAAAT TGAGTGGGTC AGGTACCAC G GAT ACTATAC 60 
ACTCTATTGC ATGATTCTCCTGCCTACATC AGAAGACGTTTATAAGCCTA TTTTAAAGGA 120 
TACCAGTTGG AATCTCTCTTTTATTAATCA CCAAGAGAAC CATGAACAAG CTGTTTATCA 180 
TTTG ACTCAT CATTTAATCT TGATTTCCAG CTTCTCACAC TTG AAAGAAG ACATAATACA 240 
TTTCTCACAG GATTCTGGGA CTATTAACTG AACTT ATGTG TGTAAAAGG A ATTCATACAA 300 

70 TG AAAGCACT AGAAATAATT ATTATACTTA TAACCATTGT ATmTACAT GTTTA AAATA 360 
TAGCCATAAT TAGCCTACTC AAATCCAAGT OTAAA AOTAA AATOATTTGC TTTCGTTrTG 420 
TTTTCCTTGC TTAGGGGATC ATGG ACATTO AAGCATATCT TGAAAGAATT GGCTATAAOA 480 
AGTCTAGGAA CAAATTGGAC TTGGA AACAT TAACTG ACAT TCTTCAACAC CAGATCCGAG 540 
CTGTTCCCTT TGAGAACCTT AACATCCATT GTGGGG ATGC CATGGACTTA GGCTTAGAGG 600 

75 CCATmTG A TCAAGTTGTG AGAAGAAATC GGGGTGGATG GTGTCTOCAG GTCAATCATC 660 
TTCTCTACTG GGCTCTGACC ACTATTGGTT TTG AO ACCAC GATGTTGGGA GGGTATGTTT 720 
ACAGCACTCC AGCCAAAAAA TACAGCACTG GCATGATTCA CCTTCTCCrG CAGGTGACCA 780 
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TTGATGGCAG GAACTACATT GTCG ATGCTG GGTTTGGACXj CTCATACCAG ATGTGGCAGC 840 
CTCrGGAGTT AATTTCTGGG AAGG ATCAGC CTCAGGTGCC TTGrGTCTTC CGTTTGACGG 900 
AAGAGAATGG ATTCTGGTAT CTAGACCAAA TCAGAAGGGA ACAGTACATT CCAAATQ AAO 960 
AATllCMCA TTCTG ATCTC CTAG AAGACA GCAAATACCG AAAAATCTAC TCCTTTACTC 1020 
1TAAGCCTCG AACAATTGAA G ATTTTGAGT CTATGAATAC ATACCTGCAG ACATCTCCAT 1 080 
CATCTGTGTT TACTAGTAAA TC ATTTTGTT CXTTTGCAGAC CX:CAG ATGGG GTTCACTG1T 1 1 40 
TGGTGGGCTT CACCCTCACC CATAGG AGAT TCAATTATAX GGACAATACA GATCTAATAG 1200 
AGTTCAAGAC TCTGAGTGAG GAAGAAATAG AAAAAGTGCT GAAAAATATA TTTAATATTT 1260 
CCTTCCAGAG AAAGCTTGTG CCCAAACATG GTG ATAG ATT TTTTACTATT TAGA ATAAGG 1320 
AGTAAAACAA TCTTGTCTAT TTGTCATCCA GCTCACCAGT TATCAACTGA CGACCTATCA 1380 
TGTATCITCT GTACCCTTAC CTTATTTTG a AGAAAATCCT AGACATCAAA TCATTTCACC 1 440 
TATAAAAATG TCATCATATA TAATTAAACA GCTTTTTAAA GAAACATAAC CACAAACCTT 1500 
TTCAAATAAT AATAATAATA ATAATAATAA ATGTATTTTA AAGATGGCCT GTGGTTATCT 1560 
tggaaattgg tg atttatgc tag AAAGCTT TTAATGTTGG TTTATTOTTG AATTC 

Seq IDNO: 20 fiiQt^m sfffflignce: 
Protein Accession P: NPJX)0653. 1 



1 11 21 31 41 51 
I I I I I I 

MDIEAYLERI G YKKSRNKLD LETLTDILQH QIRAVPFENL NIHCGDAMDL GLEAIFDQVV 60 
RRNRGGWCLQ VNHLLYWALT TIGFETTMLG GYVYSTPAKK YSTGMIHLLL QVTIDGRNYI 120 
VDAGFGRSYQ MWQPLELISG KDQPQVPCVF RLTEENGFWY LDQIRREQYl PNEEFLHSDL 1 80 
LEDSKYRKIY SFTLKPRTIB DFESMNTYLQ TSPSSVFTSK SFCSU2TPD0 VHCLVGFTLT 240 
HRRFNYKDNfT DLIEFKTLSE EEIEKVLKNI FNISLQRKLV PKHGDRFFTI 

Seq ID NO: 21 DNA sequence 

Nucleic Acid Accession NM_003714 

Goding sequence: 123- 1031 (underlined sequences coire^nd to start and stop codons) 



1 11 21 31 41 51 
I i { I I I 

CGGCAOGAGC AAAAAGGAAG AGTGGGAGGA GGAGGGGAAG CXiGCGAAGGA GGAAGAGGAG 60 
GAGGAGGAAG AGGGGAGCAC AAAGG ATCCA GGTCTCCCGA OGGGAGGTTA ATACCAAGAA 120 
CQAlSrGTGC CGAGCGGCTG GGCCAGTrCA TGACCCrGGCTTrGGIOTTG GCCACCTTTG 180 
ACCOSGOGCG GGGG ACCG AC GCCACCAACC CACCXX3AGGG TCCCCAAGAC AGGAGCTCCC 240 
AGCAGAAAGG OCGCCTGTCC CTGCAGAATA CAGCGGAGAT CCAGCACTGT TTGGTCAACG 300 
CTGGCGATGT GGGGTGTGGC GTGTTTGAAT GTITOGAGAA CAACTCITGT GAGATTCGGG 360 
GCTTACATGG GATTTGCATG ACTTTTCTGC ACAACGCTGG AAAATTTGAT GCCCAGGGCA 420 
AGTCATTCAT CAAAQACGCC TTGAAATGTA AGOOCCACGC TCTGCGGCAC AGGTTCGGCT 480 
GCATAAGCCG QAAGTGCCCG GCCATCAGGG AAATGGTGTC CX:AGTTGCAG CGGGAATGCT 540 
ACCTCAAGCA CG ACCTGTGC GCGGCTGCCC AGGAGAACAC CCGGGTGATA GTGGAG ATG A 600 
TCCATTTCAA GGACTTGCTG CTGCACGAAC CCTAOGTGGA CCTCGTGAAC TTGCTGCTGA 660 
CCTGTGGGGA GG AGGTGAAG GAGGCCATCA CCCACAGCGT GCAGGTTCAG TGTGAGCAGA 720 
ACTOGGGAAG CXrTGTGCrcC ATCTTG AOCT TCTGCACCTC GGOCATOCAG AAGCCTCCCA 780 
CGGCGOCCCC CG AGCGCCAG CCCCAGGTGG ACAG AACCAA GCTCTCCAGG GCCCACCACG 840 
GGGAAGCAGG ACATCACCTC CCAGAGCCCA GCAGTAGGGA GACTGGCCG A GGTGCCAAGG 900 
GTGAGCGAGG TAGCAAGAGC CACCCAAACG CCCATGCCOG AGGCAGAGTC GGGGGCCTTG 960 
GGGCTCAGGG ACCTTCCGGA AGCAGCGAGT GGGAAGACG A ACAGTCTGAG TATTCTGATA 1020 
TOCGGAGGTGAAATG AAAGG CCTGGCCACG AAATCTTTCC TCCACGCCGT CCATTTTCTT 1080 
ATCTATGGAC ATTCCAAAAC AnTACCATt AGAGAGGGGG GATGTCACAC GCAGGATTCT 1140 
GTGGGGACTG TGGACTTCAT CGAGGTGTGT GTTCGCGGAA CGGACAGGTG AGATGG AGAC 1200 
CCCTGGGGCC GTGGGGTCTC AGGGGTGCCT GGTGAATTCT GCACTTACAC GTACTCAAGG 1260 
GAGCGCGCCC GCGrTATCCT CGTACCTTTG TCTTCTTTCC ATCTGTGG AG TCAGTGGGTG 1 320 
TCGGCCGCrC TGTTGTGGGG G AGGTGAACC AGGGAGGGGC AGGGCAAGGC AGGGCCCCCA 1380 
GAGCTGGGCC ACACAGTGGG TGCTGGGCCT CGCCCCGAAG CTTCTGGTGC AGCAGCCTCT 1440 
GGTGCTGTCTCCGCGGAAGT CAGGGCGGCT GGATTCCAGG ACAGGAGTGA ATGTAAAAAT 1500 
AAATATCGCT TAOAATGCAO GAGAAGGGTG GAGAGOAGGC AGGGGCCOAG GGGGTGCTTO 1560 
GTGOCAAACT GAAATTCAGT TTCTTGTGTG GGGCCTTGCG GTTCAGAGCT CrrGGCOAGG 1 620 
GTGGAGGGAG GAGTGTCATT TCTATGTGTA ATTTCTGAGC CATTGTACTG TCTGGGCTGG 1 680 
GGGGGACACT GTCCAAGGGA GTGGCCCCTA TGAGTITATA TnTAACCAC TGCTTCAAAT 1740 
CTCG ATTTCA CllinilAT TTATCCAGTT ATATCTACAT ATCTGTCATC T AAATA AATG 1 800 
GcrrrcAAAC AAAAAAAAAA AAAAAAAAAA AAAAAAA 

Seq ID NO: 22 Protein sequence: 
Protein Accession #: NP_003705 



1 U 21 31 41 51 
I I I I I I 

MCAERUjQFM TLALVLATFD P ARGTDATNP PEGPQDRSSQ QKGRLSLQNT AEIQHCLVNA 60 
GDVGCGVFEC FENNSCEIRG LHGICMTFLH NAGKFDAQGK SFDCDALKCK AHALRHRFGC 1 20 
ISRKCPAIRE M VSQLQRECY LKHDLCAAAQ ENTRVIVEMI HFKDLLLHEP YVDLVNLLLT 1 80 
CGEEVKEAIT HSVQVQCEQN WGSLCSILSF CTSAIQKPPT APPERQPQVD RTKLSRAHHG 240 
EAGHHIPEPS SRETGRGAKG ERGSKSHPNA HARGRVGGLG AQGPSGSSEW EDEQSEYSDI 300 
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Scq ID NO: 23 DNA sequence 
J Nucleic Acid Accession #: NM_005264. 1 

Coding sequence: SS?* 1954(underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

10 I I I I I 1 

GAATTCCGGC CAG AAGAAAT CTGGCCTCGQ AACACGCCAT TCTCOGCGCC GCTTCCAATA 60 
ACCACTAACA TCCCTAACGA GCATCXXjAGC CGAGGGCTCT GCTCGGAAAT CGTCCTGGCC 120 
CAACrCGGCC CTTCGAGCTC TCG AAGATTA CCGCATCTAT TTTnTTTTC I IlIilll CT 1 80 
TTTCCTAGCG CAGATAAAGT GAGCCOGGAA AGGGAAGGAG GGGGCGGGGA CACCATTGCC 240 

15 CTGAAAGAATAAATAAGTAAATAAACAAACTGGCTCCTCGCXGCAGCTGQACGCGGTCGG 300 
TTG AGTCCAG GTTGGGTCGG ACCTGAACCX: CTAAAAGCGG AACOSCCTCC 0GCCCTCX3CC 360 
ATCCCGG AGC TG AGTCGCCG GCGGCGGTGG CTGCTGCCAG ACCOGG AGTT TCCTCTTTCA 420 
CTGGATGGAG CTGAACTTTG GGOGGCCAGA GCAGCACAGC TGTCCGGGGA TCGCTGCACG 480 

_ ^ CTGAGCTCCC TCGGCAAG AC CCAGCGGCGG CTCGGG ATTT TTTTGGGGGG GCGGGGACCA 540 

20 GCCCCGCGCC GGCACCATGT TCCTGGCGAC CCTOTACTTC GCGCTGCCGC TCTTGGACTT 600 
GCTCCTGTCG GCCGAAGTG A GCGGOGGAGA COGCCTGGAT TGCOTGAAAG CCAGTGATCA 660 
GTGCCTGAAG G AGCAGAGCT GCAGCACCAA GTACCGCACG CTAAGGCAGT GCGTGGCGGG 720 
CAAGGAGACC AACTTCAGCC TGGCATCCGG CCTGGAGGCC AAGGATGAGT GCCGCAGCGC 780 
CATGGAGGCC CTGAAGCAGA AGTCGCTCTA CAACTGCCGC TGCAAGCGGG GTATGAAGAA 840 

25 GOAO AAOAAC TGCCTGCGCA TTTACrQGAG CATGTACCAG AGCCTGCAGO G AAATG ATCT 900 
GCTGGAGGAT TCCCCATATG AACCAGTTAA CAGCAGATTG TCAGATATAT TCCGGGTGGT 960 
CCCATTCATA TCAG ATGTTT TTCAGCAAGT GG AGCACATT CCCAAAGGG A ACAACTGCCT 1020 
GGATGCAGCG AAGGCCTGCA ACCTCGACGA CATTTGCAAG AAGTACAGGT CGGCGTACAT 1080 

^ ^ C ACCCCGTGC ACC ACCAGCG TGTCCAACG A TGTCTGCAAC CGCCGC AAGT GCCACAAGGC 1 140 

30 CCTCCGGCAG TTCTTTG ACA AGGTCCCGGC CAAGCACAGC TACGG AATG C TCTTCTGCTC 1200 
CTGCCGGGAC ATCGCCTGCA CAGAGCGGAG GCGACAG ACC ATCGTGCCTG TGTGCTCCTA 1260 
TG AAGAGAGO OAGAAGCCCA ACTGTTTGAA TTTGCAGGAC TCCTGCAAGA CG AATTACAT 1320 
CTGCAGATCr CGCCTTGCGG A H Til ' MA C CAACTGCCAG CCAGAGTCAA GGTCTGTCAG 1380 
CAGCTGTCTA AAGGAAAACTACGCTGACTGCCTCCTCGCCTACTCGGGGCTTATTGGCAC 1440 

35 AGTCATG ACC CCC AACTACA TAG ACTCCAG TAGCCTCAGT GTGGCCCCAT GGTGTG ACTG 1 500 
CAGCAACAGT GGGAACGACC TAGAAG AGTG CTTGAAATTT TTG AATTTCT TCAAGG ACAA 1560 
TACATGTCrr AAAAATGCAA TTCAAGCCTT TGGCAATGGC TCCGATGTGA CCGTGTGGCA 1620 
GCCAGCCTTC CCAGTACAQA CCACCACTGC CACTACCACC ACTGCCCTCC GGGTTAAGAA 1680 
- ^ CAAGCCCCTG GGGCC AGCAG GGTCTGAG AA TG AAATTCCC ACTCATGITT TGCCACCGTG 1 740 

40 TGCAAATTTA CAGGCACAGA AGCTGAAATC CAATGTGTCG GGCAATACAC ACCTCTGTAT 1800 
TTCCAATGGT AATTATGAAA AAGAAGGTCT OGGTGCTTCC AGCCACATAA CCACAAAATC i860 
AATGGCTGCT CCTCCAAGCT GTGGTCTGAG CCCACTGCTG GTCCTGGTGG TAACCGCTCT 1920 
GTCCACCCTA TTATCTTTAA CAGAAACATC ATAGCTGCAT TAAAAAAATA CAATATGG AC 1980 
ATGTAAAAAG ACAAAAACCA AGTTATCTGT TTCCTGTTCT CTT GTATAGC TGAAATTCCA 2040 

45 GTTTAGGAGC TCAGTTGAGA AACAGTTCCA TTCAACTGGA ACATTTTTTT TTTTCCTTTT 2100 
AAGAAAGCTT CTTGTGATCC TTOGGGGCTT CTGTGAAAAA CCTGATGCAG TGCTCCATCC 2160 
AAACTCAGAA GGCTTTGGG A TATGCTGTAT TTTAAAGGGA CAGTTTGTAA CTT GGGC TGT 2220 
AAAGCAAACT GGGGCTGTGT TTTCGATGATGATGATCATC ATGATCATGA TGATTTTAAC 2280 
AGTTTTACTT CTGGCCTTTC CTAGCTAGAG AAGGAGTTAA TATTTCTAAG GTAACTCCCA 2340 

50 TATCTCCTTT AATG ACATTG ATTTCTAATG ATATAAATTT CAGCCTACAT TGATGCCAAG 2400 
CrrmTGCC ACAAAGAAGA TTCTTACCAA GAGTGGGCTT TGTGGAAACA GCTGGTACTG 2460 
ATGTTCACCT TTATATATGT ACTAGCATTT TCCACGCTGA TGTTTATGTA CTGTAAACAG 2520 
TTCTGCACrC TTGTACAAAA GAAAAAACCA COCGGAATTC 

55 Scq ID NO: 24 Protein sequence: 

Protein Accession #: NP_005255 

1 11 21 31 41 51 

60 MFLATLYFAL PLLDLLLSAE VSGGDRLDCV KASDQCLKEQ SCSTKYRTLR QCVAGKETNF 60 

SLASGLEAKD ECRSAMEALK QKSLYNCRCK RGMKKEKNCL RIYWSMYQSL QGNDLLEDSP 120 
YEPVNSRLSD JFRV VPFISD VFQQVEHIPK GNNCLDAAKA CNLDDICKJCY RSAYITPCTT 1 80 
SVSNDVCNRR KCHKALRQFF DKVPAKHSYG MLFCSCRDIA CTERRRQTIV PVCSYEEREK 240 
_ PNCLNLQDSCKTNYICRSRLADFFTNCQPESRSVSSCLKENYADCLLAYSGLIGTVMTPN 300 

65 YroSSSLSVAPWaJCSNSGNDLEEOLKFLNFFKDNTCLKNAIQAFGNGSDVTVWQPAFPV 360 
QmATmALRVKNKPLGPAGSENEIPTHVU»PCANLQAQKUCSNVSGNTHIX:i^ 420 
EKEGLO ASSH ITTKSM AAPP SCGLSPLLVL WTALSTLLS LTETS 

-rx Seq ID NO: 25 DNA sequence 

70 Nucleic Acid Accession #: FGENESH predicted 

Coding sequence: 1576(entiic sequence represents open reading frame) 

1 11 21 31 41 51 

75 CTTTTGnTC GCCATGCCTA GTCTAGTGGT ATCTGOAATA ATGGAAAGAA ATGGGGGCTT 60 

TOO AG AACTA GO ATGTITCG GGGGAAGCQC TAAGGACCG A GGGCTGCTGG AAGAOQ AGCG 1 20 
COCCCTTCAG CTGGCTCTCa ATCAACTCTG CCTCCTGGGT TTGGGGGAGC CCCCCGCCCC 180 
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CAGOOCGGGC GAGGACGGOG GAGGTGGGGG GGGCGGCGCC CCCGCGCAGC CGACAGCOCC 240 
CCCGCAGCCQ GCGCCGCCGC CGCCGCCCGC GGCGCCCCCG GCCGCOCCGA CGACGGCCCC 300 
CXjCAGCGCAG ACGCCCCAGC CCCCCACOGC CCCCAAAGGG GCG AGCGACG CCAAGCTCTO 360 
CGCTCTCTAC AAAGAGGCCG AGCTGCGCCT GAAGGGCAGC AGCAACACCA CGG AGTGTOT 420 
5 TCCCOTOCCC ACCTCCGAGC ACGTOGCOOA GATCGTOOOC AGGCAAGGCT GCAAGATTAA 480 
GGCCTTGAOO GCCAAGACCA ACACCTACAT CAAGACA(XG GTQ AGGGGCG AGGAACCAGT 540 
GTTCATGGTG ACAGGGCGAC GGGAGGACGT GGCCACAGCC CGGCGGGAAA TCAT CTCAGC 600 
AGCXjGAGCAC TTCTCCATGA TCCGTGCCTC CCGCAACAAG TCAGGCGCCG CCTTTGGTGT 660 
GGCTCCTGCT CTGCCCGGCC AGGTGACCAT CCGTGTGCGG GTGCCCTACC GCGTGGTGGG 720 

1 0 GCTGGTGGTG GGCCCCAAAG GGGCAACCAT CAAGCGCATC CAGCAGCAAA CCAACACATA 780 
CATTATCACA CCAAGCCGTG ACCGCGACCC CGTGTTCGAG ATCACGGGTG CCX:CAGGCAA 840 
CGTGGAGCGT GCGCGCOAGG AGATCGAGAC GCACATCGCG GTGCGCACTG GCAAGATCCT 900 
CG AGTACAAC AATGAAAACG ACTTCCTGGC GGGGAGCCCC GACGCAGCAA TCGATAGCCG 960 
CTACrcCG AC GCCTGGCGGG TGCACCAGOC CGGCTGCAAG CCCCTCTCCA CCTTCCGGCA 1020 

1 5 GAACAGCCTG GGCTGCATCG GCXiAGTGCGG AGTGG ACTCT GGCTTTG AGG CCCCACGCCT 1 080 
GGGTGAGCAO GGCXjGGGACT 1TGGCTACGG CGGGTACCTC TTTCCGGGCT ATGGCGTGGG 1 140 
CAAGCAGGATGTOTACTACG GCXiTGGCCGA GACTAGCCCC CCGCTGTGGG CGGGCCAGGA 1200 
GAACGCCACG CCCACCTCCO TGCrCTTCTC CTCTGCCTCC TCCTCCTCCT CCTCTTCCXjC 1260 
CAAGGCCCGC GCTGGGCCCC CGGGCGCACA CCGCTCCCCT GCCACTTCCXj CGGGACCCGA 1320 

20 GCTGGCCGG A CTCCCGAGGC GCCCCCCGGG AGAGCCGCTC CAGGGCTTCT CT AAACTTGG 1380 
TGGGGGCGGC CTGCGGAGCC CCGGCGGOGG GCGGGATTGC ATGGTCTGCT TTGAGAGCGA 1440 
AGTG ACTGCC GCCCTTGTGC CCTGCGOACA CAACCTGTTC TGCATGOAGT GTGCAGTACG 1500 
CATCTGCG AG AGGACGGACC CAGAGTGTCC CGTCTGCCAC ATCACAGCCA CX3CAAGCCAT 1560 

2^ CCGAATATTCTCCTAA 

Seq ID NO: 26 P)rotem gequci^^e: 

Protein Accession #: FGENESH predicted 



30 1 11 21 31 41 51 
t I I I 1 I 

FCFAMPSLV V SGIMERNGGF GELGCFGGSA KDRGLLEDER ALQLALDQLC LLQLGEPPAP 60 
RAGEDGGGGG GGAPAQPTAP PQPAPPPPPA APPAAPTTAP AAQTPQPPTA PKGASDAKLC 120 
ALYKEAELRL KGSSNTTECV PVPTSEHV AE IVG RQGCKIK ALRAKTWYI KTPVRGEEPV 1 80 

35 FMVTGRREDV ATARREIISA AEHFSMIRAS RNKSGAAFGV APALPGQVTI RVRVPYRWG 240 
LVVGPKGATI KRIQQQTNTY IITPSRDRDP VFEITG APGN VERAREEIET HIAVRTGKIL 300 
EYNNENDFLA GSPDAAIDSR YSDAWRVHQP GCKPLSTFRQ NSLGCIGECG VDSGFEAPRL 360 
GEQGGDFGYG GYLFPGYGVG KQDVYYG VAE TSPPLWAGQE NATPTSVLFS SASSSSSSSA 420 
KARAGPPGAH RSPATSAGPE LAGLPRRPPG EPLQGFSKLG GOGLRSPGGG RDCMVCFESE 480 

40 VTAALVPCGH NLFCMECAVR ICERTDPECP VCHITATQAI RIFS 

Seq ID NO: 27 DNA sequence 

Nucleic Acid Accession FGENESH predicted 

Coding sequence: 1 -2070 (underlined sequences correspond to start and stop codons) 

45 

1 11 21 31 41 51 

I i I I } I 

ATGAGCGGTG CGGGGGTGGC GGCTGGGACG CGGCCCCCCA GCTCGCCGAC CCCGGGCTCT 60 
CGGCGCCGGC OCCAGCGCCC CrCTGTGGGC GTCCAGTCCT TQ AGGCCGCA GAGCCCGCAG 120 

50 CTCAGGCAGAGCGACCCGCAOAAACGGAACCTGGACCTGGAGAAAAGCCTGCAGTTCCTG 180 
CAGCAGCAGC ACTCGG AGAT GCTGGCCAAG CTCCATGAGG AGATCG AGCA TCTG AAGCGG 240 
GAAAACAAGG GTG AGCCGGC GCGGGGOCCT AGGCCGGCCC TGCCTCCCCA GGCACACTCA 300 
ACACTGCCGC TCCCGCAGCA CAGAAACACA GCCATCAACT CCAGCACACG CCTGGGCTCA 360 

_^ GGGGGAACACAGOACGGGGAOCCCCrCCAG ACTGTCCTTGCCCACCTGGCTGCACTGGCC 420 

5 5 CCTGTATGCC AACCCAGTGG GTACAGGTTC TGGGGGACCT GG ACAGATGC CGCTACCTCT 480 
AGCCGTGGCT GGACG ATGTT ATGCAGCCAA GCACAGCACG TGCTGCTCTC GGGAAGCCCA 540 
GGGCCTGAGG TCATTGCAGG GCGGCAGGTG GCCACAGGGT GCTCCCCAGA CCTCCCTCCT 600 
CCAAGTAGAG CTG AAATGGG AAGGAACCCC TGGG ACAGCC CCTGCCCTGC TAGATCTTTG 660 
CCTCAGATTG CTGCTGTGGC CAGGCCCAGG ATTTCCAGCC CTATGGCTCT GAGTCCTCAC 720 

60 ATGCTGGGGG CCCAGGGGAT ATGGACACAC TCCATCCAGG GATCCCTTCC TGCCATCTGG 780 
GCAGCAACCA TGGGGACAA A GGGAGGAAGC AGAGTCCTGT TTCCTTGCCA CTTGTCCAAG 840 
GCACTTCCCC ATCCTGACAG CGGCCCCCAC CC AGCCCAGG ATCCTGGGCT GTGGTCTCAA 900 
GCrCACTTCC CATTATCTTT GGGGCTGGGG CTG ACATCAG G AGG ACATCT G ACTGGTGGA 960 
TGGAGCCAGC CTGGG AACAT CGCAGCTGGG GCAGTGCCTA GGGCTCTCCC TTCCCAGGG A 1020 

65 GACATGGAGAAGGGGGTTGAGGGAGGGCCCTTCCCTAGCCGCTGTGGCAACTCCAGTGAG 1080 
CTGTTCTGGO CAAAGTGTGG CCCAAGTCXK3 CAGCCCCAGC CCTGCAGTGC TGGGGACGCT 1 140 
GACAGGACAC GGGAAOAGGC CATGCTTTCC CTCGGGACCT GCTGTTCCAT GTGTCOCAAG 1200 
CCCTCCTGCT TTCCAG ATGG CCCCTCAGGA AACCACCTTT CCAGGGCCTC TGCTCCCTTG 1260 
GGCGCTCGCT GGGTCTGCAT CAACGGAGTG TGGGTAGAGC CGGGAGGACC CAGCCCTGCC 1320 

70 AGGCTGAAGG AGGGCTCCTC ACGGACACAC AGGCCAGGAG GCAAGCGTGQ GCGTCTTGCG 1380 
GGCGOTAGCG CCGACACTGT GCGCTCTCCT GCAGACAGCC TCTCCATGTC AAGCTTCCAO 1440 
TCTGTCAAGT CCATCTCTAA TTCAGCCAAC TCTCAAGGCA AGGCCAGGCC CCAGCCCGGC 1500 
TCCrrCA ACA AGCAAGATTC AAAAGCTGAC GTCTCCCAG A AGGCGG ACCT GGAAGAGGAG 1 560 
CCCCTACTTC ACAACAGCAA GCTGGACAAA GTTCCTGGGG TACAAGGGCA GGCCAGAAAG 1620 

7 5 G AGAAAGCAG AGGCCTCTAA TGCAGGAGCT GCCTGTATGG GG AACAGCCA GCACCAGGGC 1680 
AGGCAGATGG GGGCGGGGGC ACACCCCCCA ATGATCCTGC CCCTTCCCCT GCGAAAGCCC 1740 
ACCACACTTA GGCAGTGCG A AGTGCTCATC CGCGAGCTGT GGAATACCAA CCTCCTGCAG 1800 
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ACCCAAOAGC TGCGOCACCT CAAOTCCCTC CTGGAAGOG A GCCAG AGGCC CCAGGCAGCC 1 860 
COSGAOGAAO CTAGCnTCC CAGOGACCAA GAAGCCACXjC ATTTCCCXrAA GGTCTCCACC 1920 
AAGAGCCrCT CCAAGAAATG CCTGAGCCCA CCTGTGGOGG AGOGTGCCAT CCTGCCOGCA 1980 
CTGAAGCAGA CCCCGAAGAA CAACTTTGCC GAG AGGCAG A AGAGGCTGCA GGCAATGCAG 2040 
5 AAACGGCX3CC TGCATCGCTC AGTGCTTTGA 

SeqIDNO: 28 Protein sequence: 

Protein Accession FOENESH predicted 

10 

1 11 21 31 41 51 

MSGAGVAAGT RPPSSPTPGS RRRRQRPSVG VQSLRPQSPQ LRQSDPQKRN LDLEKSLQFL 60 
QQQHSEMIAK LHEEIEHLKR ENKGEPARGP RP ALPPQAHS TLPLPQHRNT AINSSTRLGS 1 20 

1 5 GGTQDGEPLQ TVLAHLAAL A P VCQPSG YRF WGTWTDAATS SRGWTMLCSQ AQHVLLSGSP 1 80 
GPEVIAGRQV ATGCSPDLPP PSRAEMGRNP WDSPCPARSL PQIAAVARPR ISSPMALSPH 240 
MLGAQGIWTH SIQGSLPAIW AATMGTKGGS RVLFPCHLSK ALPHPDSGPH PAQDPGLWSQ 300 
AHFPLSLGLG LTSGGHLTGO WSQPGNIAAG AVPRALPSQG DMEKGVEGGP FPSRCGNSSE 360 
LFWAKCGPSR QPQPCSAGDA DRTREEAMLS LGTCCSMCPK PSCFPDGPSG NHLSRASAPL 420 

20 GARWVCINGV WVEPGGPSPA RLKEGSSRTH RPGGKRGRLA GGSADTVRSP ADSLSMSSFQ 480 
SVKSISNSAN SQGKARPQPG SFNKQDSKAD VSQKADLEEE PLLHNSKLDK VPGVQGQARK 540 
EKABASNAGA ACMONSQHQO RQMG AG AHPP ^4ILPLPLRKP TTLRQCEVU RELWNTNLLQ 600 
TQEIMUCSLLEGSQRPQAAFEEASFPRDQEATHFPKVSTKSLSKKCLSPPVAERAII^ 660 

2 J LKQTPKNNFA ERQKRLQAMQ KRRLHRSVL 

Scq ID NO: 29 DNA sequence 

Nucleic Acid Accession #: NM_012319.2 

Coding sequence: 138-2405 (underlined sequences conespond to start and stop ccdons) 

30 

1 11 21 31 41 51 
) I I I I I 

CTCXjTGCCGA ATTCGGCACG AGACCGCGTG TTCGCGCCTG GTAGAGATTT CTCGAAGACA 60 
CCAGTGGGCC OGTGTGG AAC CAAACCTGCG CGCGTGGCCG GGCCGTGGGA CAACGAGGCC 1 20 

35 GCGGAGACXjA AGGCGCAATS GCGAGG AAGT TATCTGTAAT CTTGATCCTG ACCTTTGCCX: 1 80 
TCTCTGTCAC AAATCCCCTT CATGAACTAA AAGCAGCTGC TTTCX:CX:CAG ACCACTGAG a 240 
AAATTAGTCC GAATTGGG AA TCTGGCATTA ATGTTGACTT GGCAATTTCC ACACGGCAAT 300 
ATCATCTACA ACAGCTTTTC TACCGCTATG GAGAAAATAA TTCTTTGTCA GTTO AAGGOT 360 
TCAGAAAATTACrrCAAAATATAGGCATAGATAAGATTAAAAGAATCCATATACACCATG 420 

40 ACCACGACCATCACTCAGACCACGAGCATCACTCAGACCATGAGCCTCACTCAGACCATG 480 
AGCATCACTC AGACCACGAG CATCACTCTG ACCATGATCA TCACTCTCAC CATAATCATG 540 
CTGCTTCTGG TAAAAATAAG CGAAAAGCTC TTTGCXXAG A CCATGACTCA GATAGITCAG 600 
GTAAAOATCC TAG AAACAGC CAGGGG AAAG G AGCTCACCG ACCAGAACAT GCCAGTGGTA 660 
GAAQQAATQT CAAGG ACAOT GTTAGTGCTA GTG AAGTGAC CrCAACTGTO TACAACACTG 720 

45 TCTCTG AAGG AACTCACTTT CTAGAG ACAA TAG AGACTCC AAOAOCTGGA AAACTCTTCC 780 
CCAAAGATGT AAGCAGCTGC ACTCCACCCA GTGTCACATC AAAGAGCCGG GTGAGCCGGC 840 
TGGCTGGTAG GAAAACAAAT GA ATCTGTGA GTGAGCCCCG AAAAGGCTTT ATGTATTCCA 900 
GAAACACAAA TGAAAATCCT CAGG AGTGTT TCAATGCATC AAAGCTACTG ACATCTCATG 960 
GCATGGGCATCCAGGTTC(XjCIXjAATGCAACAGAGTTCAACTATCTCTGTCCAGCCATCA 1020 

50 TCAACCAAATTGATGCTAGATCTTGTCTGATTCATACAAOTGAAAAGAAGGCTGAAATCC 1080 
CrCCAAAGAC CTATTCATTA CAAATAGCCT GGGTTGGTGG TTTTATAGCC ATTTCCATCA 1 1 40 
TCAGTTTCCT GTCTCTGCTQ GGGGTTATCT TAGTGCCTCT CATGAATCGG GTGT TTTTCA 1200 
AATTTCTCCT GAGTTTCCTT GTGGCACTGG CCGTTGGGAC TTIGAGTGGT GATGCITTTT 1260 
TACACCTTCT TCCACATTCT CATGCAAGTC ACCACCATAO TCATAGCCAT QAAOAACCAG 1320 

55 caatggaaat gaaaagagga ccAcrmrcA gtcatctotc ttctcaaaac atagaagaaa nso 

GTGCCTATTT TGATTCCACG TGGAAGGGTC TAACAGCTCT AGGAGGCCTG TATTTCATGT 1440 
TTCTTGTTOA ACATGTCCTC ACATTGATCA AACAATTTAA AGATAAGAAG AAAAAGAATC 1500 
AGAAGAAACC TG AAAATGAT GATGATGTGG AGATTAAGAA GCAGTTGTCC AAGTATGAAT 1560 
CTCAACmrC AACAA ATGAG GAGAAAGTAG ATACAGATGA TCGAACTGAA GGCTATTTAC 1620 

6 0 GAGCAG ACrC ACA AG AGCCC TCCCACTTTG ATTCTCAGCA GCCTGCAGTC TTGG AAG AAG 1 680 
AAGAGGTCAT GATAGCrCAT GCTCATCCAC AGGAAGTCTA CAATGAATAT GTACCCAGAG 1740 
GGTGCAAGAA TAAATGCCAT TCACATTTCC ACGATACACT CGGCCAGTCA GACGATCTCA 1 800 
TTCACCACCA TCATG ACTAC CATCATATTC TCCATCATCA CCACCACCAA AACCACCATC 1 860 
CTCACAGTCA CAGCCAGCGC TACTCTCGGG AGGAGCTGAA AGATGCCGGC GTCGCCACTT 1920 

65 TGGCCTGGAT GGTG ATAATG GGTGATGGCC TGCACAATTT CAGCGATGGC CTAGCAA1TG 1980 

OTOcrGcrrr tactg aaggc ttatca agto gtttaagtac TTcroTTGcr gtgttctotc 2040 

ATGAGTTGCC TCATG AATTA GGTGACTTTO CTGTTCTACT AAAGGCTGGC ATGACOGTTA 2100 
AGCAGGCTGT CCTTTATAAT GCATTGTCAG CCATGCTGGC GTATCTTGGA ATGGCAACAG 2 1 60 
GAAnrrCAT TGGTCATTAT GCTGAA AATG TTTCTATGTG GATATTTGCA CTTACTGCTG 2220 

70 GCTTATTCAT GTATGTTGCT CTGGTTGATA TGGTACCTGA AATGCTGCAC AAT GATGC TA 2280 
GTCACCATGG ATGTAGCCGC TGGGGGTATT TCmTTACA OAATO CTOGG ATGCmTOO 2340 
GnTTGGAAT TATGTTACTT ATTTCCATAT TTGAACATAA AATCXjTGTTT CGTATAAATT 2400 
T CTAGT TAAG GTTTAAATGC tag AGTAGCT TAAAAAGTTG TCATAGTTTC AGTAGGTCAT 2460 
AGGGAGATGA GTTTGTATGC TGTACTATGC AGCGTTTAAA GTT AGTGGG T TTTGTGATTT 2520 

7 5 TTGTA1TGAA TATTGCTGTC TGTTACAAAG TCAGTTAAAG GTAOGnTTA ATATTTAAGT 2580 
TATTCTATCT TCG AG ATAAA ATCTGTATGT GCAATTCACC GGTATTACCA GTTTA TTATG 2640 
TAAACAAGAG ATTTGGCATG ACATGTTCTO TATOTTTCAO GGAAAAATGT CTTTAATOCT 2700 
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ITTTCAAGAA CTAACACAGT TATTCCTATA CrGGATTTTA GGTCTCTGAA GAACTGCTGG 2760 
TGTTTAGGAA TAAGAATGTG CATGAAGCCT AAAATACCAA G AAAGCTTAT ACTGAATTTA 2820 
AGCAAAG AAA TA AAGGAG AA AAGAGAAGAA TCTG AGAATT GGGGAGGCAT AGATTCTTAT 2880 
AAAAATCACA AAAnTGTTG TAAATTAGAG GGG AG AAATT TAGAATTAAG TATAAAAAGG 2940 
CAGAATTAGT ATAGAGTACA TTCATTAAAC ATTTTTOTCA GG ATTATTTC CCOTAAAA AC 3000 
GTAGTGAOCA CTCTCATATA CTAATTAGTG TACATTTAAC TTTGTATAAT ACAG AAATCT 3060 
AAATATATTT AATGAATTCA AGCAATATAC ACTTGACCAA GAAATTGGAA TTTCAAAATG 3120 
TTCGTGCGGG TTATATACCA GATGAGTACA GTGAGTAGTT TATGTATCAC CAGACTGGGT 3180 
TATTGCCAAG TTATATATCA CCAAAAGCTG TATGACTGGA TGTrCTGGTT ACCTGGTTTA 3240 
CAAAATTATC AGAOTAGTAA AACITTGATATATATGAGQA TATTAAAACT ACACTAAGTA 3300 
TCATTTOATT CGATTCAGAA AGTACITTG A TATCTCTCAO TGCTTCAGTO CTATCATTGT 3360 
G AGCAATTGT CTTTATATAC GGTACTGTAG OCATACTAGG CCTOTCTGTG GCATTCTCTA 3420 
G ATGITTCTT TTTTACACAA TAAATTCCTT ATATCAGCTT Q 

Seq ID NO: 30 Protein scouence: 
Protein Accession #: NP_03645 1 .2 

1 11 21 31 41 51 
111)11 

MARKLSVILI LTFALSVTNP LHELKAAAFP QTTEKISPNW ESGINVDLAI STRQYHLQQL 60 
FYRYGENNSL SVEGFRKLLQ NIGIDKDCRI HIHHDHDHHS DHEHHSDHER HSDHEHHSDH 120 
EHHSDHDHHS HHNHAASGKN KRKALCPDHD SDSSGKDPRN SQGKGAHRPE HASGRRNVKD 1 80 
SVSASEVTST VYNTVSEGTH FLETIETPRP GKLFPKDVSS STPPSVTSKS RVSRLAGRKT 240 
NESVSEPRKG FMYSR?>TOEN PQECFNASKL LTSHGMGIQV PLNATEFNYL CPAIINQIDA 300 
RSCUHTSEK KAEIPPKTYS LQIAWVGGFI AISIISFLSL LG VILVPLMN RVFFKFLLSF 360 
LVALAVGTLS GDAFLHLLPH SHASHHHSHS HEEPAMEMKR GPLFSHLSSQ NIEESAYFDS 420 
TWKGLTALGG LYFMFLVEHV LTUKQFKDK KiOCNQKKPEN DDDVEKKQL SKYESQLSTN 480 
EEKVDTDDRT EG YLRADSQE PSHFDSQQPA VLEEEEVMIA HAHPQEWNE YVPRGCKNKC 540 
HSHFHDTLGQ SDDLIHHHHD YHHILHHHHH QNHHPHSHSQ RYSREELKDA GVATLAWMVI 600 
MGDGLHNFSD GLAIGAAFTE GLSSGLSTSV AVFCHELPHE LGDFAVLLKA GMTVKQAVLY 660 
NALSAMLAYL GMATGIFIGH YAENVSMWIF ALTAGLFMYV ALVDMVPEML HhfDASDHGCS 720 
RWGYFFLQNA GMLLGFGIML USIFEHKIV FRINF 

Seq ID NO: 31 DNA sequence 

Nucleic Acid Accession NM_0021 84.1 

Coding sequence: 256-301 2(underUned sequences correspond to start and stop codons) 

1 U 21 31 41 51 

111)11 

GAGCAGCCAA AAGGCCCGCG GAGTCGCGCT GGGCCGCCCC GGCGCAGCTG AACCGGGGGC 60 
CGCGCCTGCC AGGCCGACGG GTCTGGCCCA GCCTGGCGCC AAGGGGITCG TGCGCTGTGG 120 
AGACGCGGAG GGTCGAGGCG GCGCGGCCTG AGTGAAACCC AATGGAAAAA GCATGACATT 180 
TAGAAGTAG A AGACTTAGCT TCAAATCCCT ACTCCTTCAC TTACTAAnT TGTGATTTGG 240 
AAATATCCGC GCAAGATGTT GACGTTGCAG ACTTGGGTAG TGCAAGCCTT GTrTATTTTC 300 
CTCACCACTG AATCTACAGG TGAACTTCTA GATCCATGTG GTTATATCAG TCCTGAATCT 360 
CCAGTTGTAC AACTTCATTC TAATTTCACT GCAGTTTGTG TGCTAAAGGA AAAATGTATG 420 
GATrATTTTC ATGTAAATGC TAATTACATT GTCTGG AAAA CAAACCATTT TACTATTCCT 480 
AAGGAGCAAT ATACTATCAT AAACAGAACA GCATCCAGTG TCACCTTTAC AGATATAGCT 540 
TCATTAAATA TTCAGCTCAC TTGCAACATT CTTACATTCG GACAGCTTGA ACAGAATGTT 600 
TATGGAATC A CAATAATTTC AGGCTTGCCT CCAGAAA AAC CTAAA AATTT GAGTTGCATT 660 
GTGAACGAGG GG AAG AAAAT G AGGTGTGAG TGGGATGGTG G AAGGG AAAC ACACTTGGAG 720 
ACAAACTTCA CTTTAAAATC TG AATGGGCA ACACACAAGT TTGCTGATTG CAAAGCAAAA 780 
CGTGACACCC CCACCTCATG CACTGITGAT TATTCTACTG TGTATTTTGT CAACATTGAA 840 
GTCTGGGTAG A AGCAGAGAA TGCCCTTGGG AAGGTTACAT CAGATCATAT CAATTTTG AT 900 
CCTGTATATA AAGTGAAGCC CAATCCGCCA CATA ATTTAT CAGTGATCAA CTCAGAGGAA 960 
CTGTCTAGTA TCTTAAAATT GACATGGACC AACCCAAGTA TTAAGAGTGT TATAATACTA 1020 
AAATATAACA TTCAATATAG GACCAAAGAT GCCTCAACTT GGAGCCAGAT TCCTCCTGAA 1080 
G ACAC AGC AT CCACCCG ATC TTCATTCACT GTCC AAG ACC TTAAACCnT TACAGAAT AT 1 1 40 
GTGTTTAGGA TTCGCTGTAT GAAGGAAGAT GGTAAGGGAT ACTGGAGTGA CTGGAGTGAA 1200 
GAAGCAAGTG GGATCACCTA TGAAGATAGA CCATCTAAAG CACCAAGTTT CTGGTATAAA 1260 
ATAGATCCAT CCCATACTCA AGGCTACAGA ACTGTACAAC TCGTGTGGAA GACATTGCCT 1320 
CCTTTTGAAG CCAATGGAAA AATCTTGGATTATGAAGTGA CTCTCACAAG ATGGAAATCA 1380 
CATITACAAA ATTACACAGT TAATGCCACA AAACTGACAG TAAATCTCAC AAATGATCGC 1440 
TATCTAGCA A CCCTAACAGT AAGAAATCTT OTTGGCAAAT CAGATGCAGC TGTTTTAACT 1 500 
ATCCCTGCCT GTGACTTTCA AGCTACTCAC CCTGTAATGG ATCTTAAAGC ATTCCCCAAA 1560 
GATAACATGC TTTGGGTGGA ATGGACTACT CCAAGGGAAT CTGTAAAGAA ATATATACTT 1620 
GAGTGGTGTG TGTTATCAGA TAAAGCACCC TGTATCACAG ACTGGCAACA AGAAGATGGT 1680 
ACCGTGCATC GCACCTATTT AAGAGGG AAC TTAGCAGAGA GCAAATGCTA TTTGATAACA 1740 
GTTACTCCAG TATATGCTGA TGGACCAGGA AGCCCTGAAT CCATAAAGGC ATACCTTAAA 1800 
CAAGCTCCAC CTTCCA AAGG ACCTACTGTT CGG ACAAA AA AAGTAGGGAA AAACGAAGCT 1 860 
GTCTTAGAGT GGGACCAACT TCCTGTTGAT GTTCAGAATG GATTTATCAG AAATTATACT 1920 
ATATTTTATA GAACCATCAT TGGAAATGAA ACTGCTGTGA AIGTGGATTC TTCCCACACA 1980 
G AATATACAT TGTCCTCTTT G ACTAGTG AC ACATTG TACA TGGTACGAAT GGCAGCATAC 2040 
ACAGATGAAG GTGGGAAGGA TGGTCCAGAA TrCACTTTTA CTACCCCAAA GTTTGCTCAA 2100 
GGAGAAATTG AAGCCATAGT CGTGCCTGTT TGCTTAGCAT TCCTATTGAC AACTCTTCTG 2160 
GGAGTGCTGT TCTGCTTTAA TAAGCGAGAC CTA ATTAAAA AACACATCTG GCCTAATGTT 2220 
CCAGATCCTT CAAAG AGTCA TATTGCCCAG TGGTCACCTC ACACTCCTCC AAGGCACAAT 2280 
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riTAATTCA A AAGATCAAAT GTATTCAG AT GGCAATTTCA CTGATGTAAG TGTTGTGGAA 2340 
ATAGAAGCAA ATGACAAAAA GCCTTTTCCA GAAGATCTGA AATCATTGGA CCTGTTCAAA 2400 
AAGGAAAAAA TTAATACTGA AGGACACAGC AGTGGTATTG GGGGGTCTTC ATGCATGTCA 2460 
TCTTCTAGGCCAAGCATTTC TAGCAGTGAT GAAAATGAAT CTTCACAAAA CACTTCGAGC 2520 
ACTGTCCAGT ATTCTACCXjT GGTACACAGT GGCTACAGAC ACCAAGTTCC GTCAGTCXTAA 2580 
GTCTTCTCAA GATCCGAGTC TACCCAGCCC TTGTTAGATT CAGAGGAGCG GCC AGAAGAT 2640 
CTACAATTAG TAGATCATGT AGATGGCGGT GATGGTATTT TGCCCAGGCA ACAGTACTTC 2700 
AAACAGAACT GCAGTCAGCA TGA ATCCAGT CCAGATATTT CACATTTTGA AAGGTCAAAG 2760 
CAAGTTTCAT CAGTCAATGA GG AAG ATTTT GTTAG ACTTA AACAGCAGAT TTCAGATCAT 2820 
A TTTCA CAAT CCTGTGGATC TGGGCAAATG AAAATGTITC AGGAAGTTrC TGCAGCAGAT 2880 
GCrnTGGTC CAGGTACTGA GGGACAAGTA GAAAGATTTG AAACAGITGG CATGGAGGCT 2940 
GCGACTGATG AAGGCATGCC TAAAAGTTAC TTACTACAGA CTGTACGGCA AGGCGGCTAC 3000 
ATGCCTCAGTGAAGGACTAG TAGTTCCTGC TACAACTTCA GCAGTACCTA TAAAGTAAAG 3060 
CTAAAATGATTTTATCTGTG AATTC 

Scq ID NO: 32 Protein sequence: 
Protein Accession U: NP_,002 175.1 

1 11 21 31 41 51 
I I I I I I 

MLTLQTWWQ ALFIFLTTES TGELLDPCGY ISPESPWQL HSNFTAVCVL KEKCMDYFHV 60 
NANYIVWKTN HFTIPKEQYT IINRTASSVT FTDIASLNIQ LTCNn-TFGQ LEQNVYGITI 120 
ISGLPPEKPK NLSCIVNEGK KMRCEWDGG R ETHLETNFTL KSEWATHKFA DCKAKRDTPT 1 80 
SCTVDYSTVY FVNIEVWVEA ENALGKVTSD HINFDPVYKV KPNPPHNLSV INSEELSSIL 240 
KLTWTNPSDC SVIILKYNIQ YRTKDASTWS QIPPEDTAST RSSFTVQDLK PFTEYVFRIR 300 
CMKEDGKGYW SDWSEEASGI TYEDRPSKAP SFWYKIDPSH TQGYRTVQLV WKTLPPFEAN 360 
GKILDYEVTL TRWKSHLQ^fY TVNATKLTVN LTNDRYLATL TVRNLVGKSD AAVLTIPACD 420 
FQATHPVMDL KAFPKDNMLW VEWTTPRESV KKYILEWCVL SDKAPaTDW QQEDGTVHRT 480 
YLRGNLAESK CYLITVTPVY ADGPGSPESI KA YLKQAPPS KGPTVRTKKV GKNEAVLEWD 540 
QLPVDVQNGF IRNYTIFYRT IIGNETAVNV DSSHTEYTLS SLTSDTLYMV RMAAYTDEGG 600 
KDGPEFTFTT PKFAQGEIEA IVVPVCLAFL LTTLLGVLFC FNKRDLIKKH IWPNVPDPSK 660 
SHIAQWSPHT PPRHNFNSKD QMYSDGNFTD VSV VEIEAND KKPFPEOLKS LDLFKKEKIN 720 
TEGHSSGIGG SSOkiSSSRPS ISSSDENESS QNTSSTVQYS TWHSGYRHQ VPSVQVFSRS 780 
ESTQPLLDSE ERPEDLQLVD HVDGGDGILP RQQYFKQNCS QHESSPDISH FERSKQVSSV 840 
NEEDFVRUCQ QISDHISQSC GSGQMKMFQE VSAADAFGPG lEGQVERFET VGMEAATDEG 900 
MPKSYLPQTV RQGGYMPQ 

Scq ID NO: 33 DNA sequence 

Nucleic Acid Accession NM_0I82S5.1 

Coding sequence: 1 1 -249 1 (underlined sequences correspond to start and stop oodons) 
I 11 21 31 41 51 

i I I i I I 

AGTTGGCGAC ATSGTGGCAC CCGTGCTGGA GACTTCTCAC GTGTTTTGCT GCCCAAACCG 60 
GGTGCGGGGA GTCCTG AACT GG AGCTCTGG G(XCAG AGGA CTTCTGGCCT TTGGC ACGTC 1 20 
CTGCTCCGTG GTGCTCTATG ACTCOCTGAA AAGGGTTGTT GTTACCAACT TGAATGGTCA 180 
CACCGCCCGA GTCAATTGCA TAC AGTGGAT TTGTAAACAG G ATGGCTCCC CTTCTACTGA 240 
ATTAGTTTCr GGAGGATCTG ATAATCAAGT G ATTCACTGG GAAATAGAGG ATAATCAGCT 300 
TTTAAAAGCA GTGCATCTrC AAGGCCATG A AGGACCTGTT TATGCGGTGC ATGCTGTTTA 360 
CCAGAGGAGG ACATC AGATC CTGCATTATG TACACTGATC GTTTCTGCAG CTGCAGATTC 420 
TGCrGTTCGA CTCTGGTCTA AAAAGGGTCC AGAAGTAATG TGCCTTCAGA CTTTAAACTT 480 
TGGAAATGGA TTTGCTTTGG CTCTCTGCTT ATCTTTTTTG CX:AAATACTG ATGTACCAAT 540 
A1TAGCATGT GGCAATG ATG ATTGCAGAAT TCACATATTT GCTCAACAAA ATGATCAGTT 600 
TCAG AAAGTG CnTCTCTCT GTGGACATG A GGATTGGATr AGAGGAGTGG AATGGGCAGC 660 
CrrrGGTAGA GATCTTTTCC TAGCAAGCTG TTCACAAGAT TGCCTOATAA GAATATGGAA 720 
GCTGTATATA AAGTCAACAT CTTTAGAA AC TCAGGATGAC GATAACATAA GACTGAAAGA 780 
AAATACTTTT ACCATAGAAA ATG AA AGTGT TAAAATAGCA TTTGCTGTrA CTCTGGAGAC 840 
AGTGCTAGCC GGTCATGAAA ACTGGGTAAA TGCAGTTCAC TGGCAACCTG TGTTTTACAA 900 
AGATGGTGTC CTACAGCAGC CAGTGAG ATT ATTATCTGCT TCCATGGATA AAACCATG AT 960 
TCTCTGGGCTCCAGATGAAG AGTCAGGAGT TTGGCTAGAA CAGGTTCGAG TAGGTGAAGT 1020 
AGGTGGG AAT ACTTTGGGAT TTTATG ATTG CC AGTTC AAT GAAGATGGCT CCATGATCAT 1 080 
TGCTCATGCT TTCCACG GAG CGTTGCACCT TTGGAA ACAG AATACAGTTA ACCCAAG AG A 1 1 40 
GTGGACTCCA GAGATTGTCA TTTCAGGACA CTTTGATGGT GTCCAAGACC TAGTCTGGGA 1200 
TCCAGAAGGA GAATTTATTA TCACTGTTGG TACTG ATCAG ACAACTAGAC nTTTGCTCC 1260 
ATGGAAGAGA AAAGACCAAT CACAGGTGAC TFGGCATGAA ATTGCAAGGC CTCAGATACA 1320 
TGGGTATGAC CTGAAATGTT TGGCAATG AT TAATCGGTTT CAGTTTGTAT CTGGAGCAGA 1380 
TGAAAAAGTT CTTCGGGTTT TTTCTGCACC TCGGAATTTT GTGGAAAATT TTTGTGCCAT 1440 
TACAGG ACAA TCACTG AATC ATGTGCTCTG TAATCAAG AT AGTGATCTTC CAG AAGGAGC 1 500 
CACTGTCCCT GCATTGGGAT TATCAAATAA AGCTGTCTTT CAGGGAGATA TAGCTTCTCA 1560 
GCCTTCTGAT GAAGAGGAGC TGTTAACTAG TACTGGTTTT GAGTATCAGC AGGTGGCCTT 1620 
TCAGCCCTCC ATACTT ACTG AGCCTCCCAC TG AGGATCAT CTTCTGCAGA ATACTTTGTG 1 680 
GCCTGAAGTT CAAAA ACTAT ATGGGCACGG TTATGAAATA mrrGTGTTA CTTGTAACAG 1740 
TTCAAAG ACT CTGCTTGCCT CAGCTTGTAA GGCAGCTAAG AAAGAGCATG CAGCTATCAT 1 800 
TCnTGGAAC ACTACATCTT GGAAACAGGT GCAGAATTTA GTTTTCCACA GTTTGACAGT 1860 
CACGCAG ATG GCCTTCTCAC CTAATGAG AA GTTCTTACTA GCTGTrrCCA GAGATCGAAC 1920 
CTGG TCATTG TGGAAAAAGC AGG ATACAAT CTCACCTG AG TTCGAGCCAG TTrTTAGTCT 1980 
TTTTGCCTTC ACCAACAAAA TTACTTCTGT GCACAGTAGA ATTATTTGCT CTTGTG ATTG 2040 
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GAGTCCTGAC AGCAAGTATT TCTTCACTGG GAGTCGAGAC AAAAAGGTGG TTGTCTGGGG 2100 
TGAGTGCGAC TCCACTGATG ACTGTATTG A GCACAACATT GGCCCCTGCT CCTCAGTCCT 2 1 60 
GGACX3TGGGT GGGGCTGTGA CAGCTGTCAG CGTCTGC(XA GTGCTCCACC CTTCTCAACG 2220 
ATACGTGGTT GCAGTAGGAT TGG AGTGTGG AA AGATTTGC TTATATACCT GGAAAAAG AC 22B0 
TGATCAAGTTCCAGAAATAA ATGACTGGAC CXTACTGTGTA GAAACAAGTC AAAGCCAAAG 2340 
TCATACACTG GCTATCAGAA AATTATGCTG GAAGAATTGC AGTGGAAAAA CTGAACAG AA 2400 
GGAAGCAGAA GGTGCTGAGT GGTTACACTT TGCAAGCTGT GGTGA AGATC ACACTGTG AA 2460 
GATACACAGA GTCAATAAAT GTGCACT GTA AT GG 

Scq ID NO: 34 Protein seouence: 
Protein Accession #: NP_060725.1 

I 11 21 31 41 51 
i I 1 I I I 

MVAPVLETSH VFCCPNRVRG VLhTWSSGPRG LLAFGTSCSV VLYDPUCRV V VTNU*GHTAR 60 
VNOQWICXQ DGSPSTELVS GGSDNQVIHW EIEDNQLLKA VHLQGHEGPV YA VHAVYQRR 1 20 
TSDPALCTLI VSAAADSA VR LWSKKGPEVM CLQTLNFGNG FALALCLSFL PNTD VPILAC 1 80 
GNDDCRIHIF AQQNDQFQKV LSLCGHEDWI RGVEWAAFGR DLFLASCSQD CLIRIWKLYI 240 
KSTSLETQDD DNIRLKENTF TIENESVKIA FAVTLETVLA GHENWVNAVH WQPVFYKDGV 300 
LQQPVRLLSA SMDKTMILWA PDEESGVWLE QVRVGEVGGN TLGFYDCQFN EDGSMHAHA 360 
FHGALHLWKQ NTVNPREWTP EIVISGHFDG VQDLVWDPEG EFIITVGTDQ TTRLFAPWKR 420 
KDQSQVTWHE lARPQIHGYD UCCLAMINRF QFVSGADEKV LRVFSAPRNF VENFCAITGQ 480 
SLNHVLCNQD SDLPEGATVP ALGLSNKAVF QGDIASQPSD EEELLTSTGF EYQQVAFQPS 540 
ILTEPPTEDH LLQNTLWPEV QKLYG HG YEI FCVTCNSSKT LLASACKAAK KEHA AHLWN 600 
TTSWKQVQNL VFHSLTVTQM AFSPNEICFLL AVSRDRTWSL WKKQDTISPE FEPVFSLFAF 660 
TNKITSVHSR UWSCDWSPD SKYFFTGSRD KKWVWGECD STDDCIEHNI GPCSSVLDVG 720 
GAVTAVSVCP VLHFSQRYVV AVGLECGKIC LYTWKKTDQV PEINDWTHCV ETSQSQSHTL 780 
AHUOCWKNC SGKTEQKEAE G AEWLHFASC GEDHTVKIHR VNKCAL 

Seq ID NO: 35 DNA sequence 

Nucleic Acid Accession #: NM_02213 1 

Coding sequence: 1 1-2878 (underlined sequences ccviespond to start and stop codons) 

1 11 21 31 41 51 
I I i I I I 

TGCTGOGAGG ATGCTGCCTG GGCGGCTGTG CTGGGTGCCG CTCCTGCTGG CGCTGGGCGT 60 
GGGGAGCGGC AGCGGCGGTG GCGGGGACAG COGGCAGCGC CGCCTOCTCG CGGCTAAAGT 120 
CAATAAGCAC AAGCCATGGA TCGAGACTTC ATATCATGGA GTCATAACTG AGAACAATGA 180 
CACAGTCATT TTGGACCCAC CACTGGTAGC CCTGGATAAA GATGCACCGG TTCCmTGC 240 
AGGGGAAATC TGTGCGTTCA AG ATCCATGG CCAGGAGCTG CCCITTGAGG CTGTGGTGCT 300 
CAACAAGACA TCAGGAG AGG GCCGGCTCCG TGCCAAGAGC OCCATTGACT GTGAGTTGCA 360 
GAAGGAGTAC ACATTCATCA TCCAGGCCTA TG ACTGTGGT GCTGGGCCCC ACGAGACAGC 420 
CTGGAAAAAG TCACACAAGG CCGTGGTCCA TATACAGGTG AAGGATGTCA ACGAGTTTGC 480 
TCCCACCTTC AA AG AGCC AG CCTAC A AG GC TGTTGTGACG G AGGGCA AG A TCTATGAC AG 540 
CATTCTGCAG GTGGAGGCCA TTGACGAGG A CTGCTCCCCA CAGTACAGCC AG ATCTGCAA 600 
CTATGAAATC GTCACCACAG ATGTGCCTTT TGCCATCG AC AGAAATGGCA ACATCAGGAA 660 
CACTGAGAAG CTGAGCTATG ACAAACAACA CCAGTATGAG ATCCTGGTGA CCGCCTACGA 720 
CTGTGGACAG AAGCCCGCTG CTCAGGACAC CCTGGTGCAG GTGGATGTGA AGCCAGTTTG 780 
CAAGCCTGGC TGGCA AGACT GGACCAAGAG GATTGAGTAC CAGCCTGGCT CCGGGAGCAT 840 
GCCCCTGTTC CCCAGCATCC ACCTGGAGAC GTGCGATGGA GCCGTGTCTT CCCTCCAG AT 900 
CGTCACAGAG CTGCAG ACTA ATTACAriGG GAAGGGTTGT GACCGGGAGA CCTACTCTG A 960 
G AAATCCCTT CAGAAGTTAT GTGG AGCCTC CTCTGGCATC ATTG ACCTCT TGCCATCCCC 1 020 
TAGCGCTGCC ACCAACTGGA CTGCAGGACT GCTGGTGGAC AGCAGTGAG A TG ATCTTCAA 1 080 
OTTTGAOGGC AGGCAGGGTG CCAAAATCCC CG ATGGGATT GTGCCCAAGA ACCTGACCG A 1 140 
TCAGTTCACC ATCACCATGT GG ATG A AACA CGGCCCCAGC CCTGGTGTG A GAGCCGAGA A 1200 
GGAAACCATC CTCTGC AACT CAG AC AA AAC CG AAATGAAC CGGCATCACT ATGCCCTGTA 1 260 
TGTGCACAAC TGCCGCCTCG TCTTTCTCTT GCGGAAGGAC TTCGACCAGG CTGACACCTT 1320 
TCGCCOCGCG GAGTTCC ACT GGA AGCTGGA TCAGATTTGT GACAAAGAGT GGCACTACTA 1380 
TGTCATCAAT GTGG AGTTTC CTGTGGTA AC CTTATACATG GATGGAGCAA CATATGAACC 1440 
ATACCTGGTG ACCA ACGACT GGCCCATTCA TCCATCTCAC ATAGCCATGC AACTCACAGT 1 500 
CGGCGCTTGT TGGCAAGG AG GAGA AGTCAC CAAACCACAG TTTGCTCAGT TCTITCATGG 1 560 
AAGOCTGGCC AGTCTCACCA TCCGCCCTGG CAAAATGGAA AGCCAGAAGG TG ATCTCCTG 1 620 
CCTGCAGGCC TGCA AGG A AG GGCTGG ACAT TA ATTCCTTG G AAAGCCTTG GCCA AGG A AT 1 680 
AAAGTATCACTTCAACCCCTCGCAGTCCAT CCTGGTGATG GAAOGTGACG ACATTGGGAA 1740 
CATTA ACCGT GCTCTCCAG A A AGTCTCCTA CATCAACTCC AGGCAGTTCC CA ACGGCGGG 1 800 
TGTGCGGCGC CTCAAAGTAT CCTCC A AAGT CC AGTOCTTT GGGGA AG ACG TATGCATCAG 1 860 
TATCCCTGAG GTAG ATGCCT ATGTG ATGGT CCTCCAGGCC ATCG AGCCCC GG ATCACCCT 1 920 
CCGGGGCACA GACCACTTCT GGAGACCTGC TGCCCAGTTr GAAAGTGCCA GGGGAGTGAC 1980 
CCTCTTCCCT GATATCAAG a TTGTG AGCAC CTTCGCCAAA ACCGAAGCCC CCGGGGACGT 2040 
GAAAACCACA GACCCCAAAT CAGAAGTCTT AGAGGAAATG CTTCATAACT TAGATTTCTG 2100 
TGACATTTTG GTGATCGGAG GGGACTTGGA CCCAAGGCAG GAGTGCTTGG AGCTCAACCA 2160 
CAGTGAGCTC CACCA ACGAC ACCTGGATGC CACTAATTCT ACTGCAGGCT ACTCCATCTA 2220 
CGGTGTGGGC TCCATGAGCC GCTATG AGCA GGTGCTACAT CACATCCGCT ACCGCAACTG 2280 
GCGTCCGGCT TCCCTTGAGG CCCGGCGTTT CCGGATTAAG TGCTCAGAAC TCAATGGGCG 2340 
CTACACTAGC AATGAGTTCA ACTTGGAGGT CAGCATCCTTCATGAAGACC AAGTCTCAGA 2400 
TAAGGAGCAT GTCAATCATC TG ATTGTGCA GCCTCCCTTC CTCCAGTCTG TCCATCATCC 2460 
TGAGTCCCGG AGTAGCATCC AGCACAGTTC AGTGGTCCCA AGCATTGCCA CAGTGGTCAT 2520 
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CATCATCTCC GTGTGCATGC TTGTGTTTGT CGTGGCCATG GGTGTGTACC GGGTCXX5GAT 2580 
CGCCCACCAG CACTTCATCC AGGAGACTGA GGCTGCXTAAG GAATCTGAGA TGGACTGGGA 2640 
CGATTCTGCG CTGACTATCA CAGTCAACCC CATGGAGAAA CATGAAGGAC CAGGGCATGG 2700 
GGAAGATGAG ACTGAGGGAG AAGAGGAGGA AGAAGCCGAG GAAGAAATGA GCTCCAGCAG 2760 
TGGCTCTGAC GACAGCGAAG AGGAGGAGGA GGAGGAAGGG ATGGGCAGAG GCAGACATGG 2820 
GCAGAATGGA GCCAGGCAAG CCX:AGCTGGA GTGGGATGAC TCCACXCTCC CCTACTAGTG 2880 
CCCAGGGGTCTGCTGCCTGG CCCACATGTC CCTTTTGTAA ACCCTGAC(X AGTGTATGCC 2940 
CATGTCTATC ATACCTCACC TCTG ATGTCT GTGACATGTC TGGGAAGGCC TTCTCCAGCT 3000 
TCCTGGAGCC CACCCITTAA GCCTTGGGCA CTCCCTGTGT TTCATCCATG GGGAAGTTCC 3060 
AAGAAGCCCA GCATGGCCAT CAGTGAGGAC TTCAGGGTAG ACTTTGTCCT GTAGCCTCCA 3120 
CTTCTGCrCT AAGTICCCCA GCATCCTGAC TACCTGTCTG CAGAGTTTGC CnTGTmr 3180 
TCCTGCAGGG AAGAAGGCCC ACCTTTGTGT CACTCACCTC CCCAGGCTCA GAGTCCCCAA 3240 
GGCCCTGGGG TTCCAACTCA CrGTGCX3TCT CCTCCACACA G ACCAGTAGG TTCTCCTATG 3300 
CTGACTCCAG GTTGCTTCAT ACAAGGAGGG TGGTTGA ACT TCACACACGT AAGGTCITAG 3360 
TGCTTAACAG TTTAAAGGAA AGTCCITGTT G AGGCAGAAC TAAGTTTACA GGGAAAGGTA 3420 
CACACATTCT CTCTCTCTCT CTCTCTCTGT CTATCTAGTT CCCCAGCTTG GAGAGCCTIT 3480 
CCCCTTGCTT CTTTCTGAGG CCATATAAGC TTATAAGAAA AGTCCCAAAC CAAGAATAGG 3540 
TCCTTGGCX:A CAAGCAGGGT CTGATCCCCC ATCAGAGCTA TCTGAGCCTG CCTGTCTGGG 3600 
CACCTGCTGC AACCATGCAG CTACCCTGCC AGGGGCACTC AGCAAACAGA ACCACAGGGC 3660 
CCAGGAGGCA TTCCACACAG GCACTGCCCC AGGACAACAC AACAAGGACA OTCACAACAA 3720 
GGACAACAAG GACACAACAC AACACACAAC AAGGACAGTC ACAACAAGCC TAGAGCCAGA 3780 
AAGCAGATGG AAATGCTAATGAGGTCAAAC GTAGGCTTCA TGGTGGGTGG AGTGGGGGTG 3840 
GCTGGGCTCC CCCAGGACAG AGGGGACTCT GAGGTTGGCA AGGCTCTCAC CACTCAGCCT 3900 
TATGGTCCCTTATCTCCTATCTTCCXn'CTTGAGAAAATAC ACGCTTTCTG CATGTATTAG 3960 
AAACGCACGA GCTCCACCAA GTCTACAATG AAAGTTTGAA ATTTAACTGC AAGGAATTAG 4020 
AAGCATATTT GCAATCATTG CAGCTTCTTC TTTCTTCTGC TCATAAAAGG AGGAACACTT 4080 
TAGATAGAGG GCAAATATAT CTG AAAACCT AATTTCTTTC TTTTTTTGAT AAGGAAATCT 4140 
TTTCCATCTCCATCCTAACA TGCACAACCT GTGAAGAGAA TTGITTCTAT AGTAACTGGT 4200 
CTGTGATCrr TTGTGGCCAA GAGAATAGCA GGCAAGAATT AGGGCCTTGA CAGAATITCC 4260 
ACG AAGCTCT GAG AACATGT TTGnTCGAA TGTCTGATTC CTCnTGTCA TCAATGTGTA 4320 
TGCTCTGTCC CCATCCTTCA CTCCTCCTCA AGCTCACACC AATTGGnTG GCACAGGCAC 4380 
AGAGCTGGTCCCTAGTTAAGTGGCATTTATGTTAAAAAAA AATAGTTCAG AATCTCAGCX: 4440 
TTTTCTTTGT GTCATCAAA A CAGCTTAAG A AGGGGACTAC TGCCAATGTC CTCTAGTCTG 4500 
ACCTCCACCC AGGG AGGACX: CATGGCAGGT CTTTTCAACT TTCTGATTCA TGAGAACAAC 4560 
CTTGTGA AGC TTTTCCCACC TCCTAAAGTG TTTTCTGCAT CTGTTCC TTC C TTTGGACCT 4620 
CACAACAAAT CCTGTGAAGT AACTGAGACA TCTGTTGTTA G ATACATTTT TGTGATGAGT 4680 
AAACTGAGGCTTCG 

Sea ID NO: 36 Prolein sequence: 
Protein Accession #: NPj07I414.1 

1 11 21 31 41 51 
111)11 

MLPGRLCWVP LLLALGVGSG SGGGGDSRQR RLLAAKVNKH KPWIETSYHG VITENNDTVI 60 
LDPPLVALDK DAPVPFAGEI CAFKIHGQEL PFEAVVLNKT SGEGRLRAKS PIDCELQKEY 120 
TFnQAYDCG AGPHETA WKK SHKAWHIQV KD VNEFAPTF ICEPA YKA VVT EGKIYDSILQ 1 80 
VEAIDEDCSP QYSQICNYEI VTTDVPFAID RNGNIRNTEK LSYDKQHQYE ILVTAYDCGQ 240 
KPAAQDTLVQ VDVKPVCKPG WQDWTKRIEY QPGSGSMPLF PSIHLETCDG AVSSLQIVTE 300 
LQTNYIGKGC DRETYSEKSL QKLCGASSGI IDLLPSPSAA TNWTAGLLVD SSEMIFKFDG 360 
RQGAKIPDGI VPKNLTDQFT ITMWMKHGPS PGVRAEKETI LCNSDKTEMN RHHYALYVHN 420 
CRLVFLLRKD FDQADTFRPA EFHWKJLDQIC DICEWHYYVIN VEFPWTLYM DGATYEPYLV 480 
TNDWPIHPSH lAMQLTVGAC WQGGEVTKPQ FAQFFHGSLA SLTIRPGKME SQKVISCLQA 540 
CXEGLDINSL ESLGQGDCYH FNPSQSILVM EGDDIGNINR ALQKVSYINS RQFPTAGVRR 600 
LKVSSKVQCF GEDVCISIPE VDAYVMVLQA lEPRTn-RGT DHFWRP AAQF ESARGVTLFP 660 
DIKIVSTFAK TEAPGDVKTT DPKSEVLEEM LHNLDFCDIL VIGGDLDPRQ ECLELNHSEL 720 
HQRHLDATNS TAGYSIYGVG SMSRYEQVLH HIRYRNWRPA SLEARRFRIK CSELNGRYTS 780 
NEFNLEVSIL HEDQVSDKEH VNHLIVQPPF LQSVHHPESR SSIQHSSVVP SIATVVinS 840 
VCMLVFVVAM G VYRVRIAHQ HFIQETEAAK ESEMDWDDSA LTITVNPMEK HEGPGHGEDE 900 
TEOEEEEEAE EEMSSSSGSD DSEEEEEEEG MORGRHGQNG ARQAQLEWDD STLPY 

Scq ID NO: 37 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 1 43-874 (underlined sequences correspond to start and stop codons) 

I 11 21 31 41 51 

I I I I I I 

GGGAGGGAGA GAGGCGCGCG GGTGAAAGGC GCATTGATGC AGCCTGCGGC GGCCTCGGAG 60 
CGCGGCGGAG CCAGACGCTG ACCACGTTCC TCTCCTCGGT CTCCTCCGCC TCCAGCTCCG 120 
CGCTGCCCGG CAGCCGGGAG CCATGCGACC CCAGGGCCCC GCCGCCTCCC CGCAGCGGCT 180 
CCGCGGCCTC CTGCTGCTCC TGCTGCTGCA GCTGCCCGCG CCGTCGAGCG CCTCTGAGAT 240 
CCCCAAGGGG AAGCAAAAGG CGCAGCTCCG GCAGAGGGAG GTGGTGGACC TGTATAATGG 300 
AATGTGCTTA CAAGGGCCAG CAGGAGTGCC TGGTCGAGAC GGGAGCCCTG GGGCCAATGG 360 
CATTCCGGGT ACACCTGGGA TCCCAGGTCG GGATGGATTC AAAGGAGAAA AGGGGGAATG 420 
TCTGAGGGAA AGCTTTGAGG AGTCCTGGAC ACCCAACTAC AAGCAGTGTT CATGGAGTTC 480 
ATTG AATTAT GGCATAGATC TTGGGAAAAT TGCGGAGTGT ACATTTACAA AG ATGCGTTC 540 
AAATAGTGCT CTAAGAGTTT TGTTCAGTGG CTCACTTCGG CTA AAATGCA GAAATGCATG 600 
CTGTCAGCGT IGGTATrTCA CATTCAATGG AGCTGAATOT TCAGGACCTC TTCCCATTGA 660 
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AGCTATAATT TATTTG GACC A AGO A AGCCC TG AA ATG AAT TCA ACAATTA ATATTCATCG 720 
CACTTCTTCT GTGGA AGGAC TTTGTG AAGG AATTGGTGCT GGATTAGTGG ATGTTGCTAT 780 
CTGGGTTGGC ACTTGTTCAG ATTACCCAAA AGGAGATGCT TCTACTGGAT GGAATTCAGT 840 
1TCTCGCATC ATTATTG AAG AACTACCAAA ATAAATGCTT TAATTITCAT TTGCTACCTC 900 
T lirnT ATT ATGCCTTGG A ATGGTTCACT TAAATGACAT TTTAAATAAG TTTATGTATA 960 
CATCTGAATG AAAAGCAAAG CTAAATATGT TTACAGACCA AAGTGTGATT TCACA CTGTT 1020 
TTTAAATCTA GCATTATTCA TTTTGCTTCA ATCAAAAGTG GTTTCAATAT TTTnTTAGT 1080 
TGGTTAGAAT ACTTTCTTCA TAGTCACATT CTCTCAACCT ATAATTTGGA ATATTGTTGT 1 140 
GGTCTTTTGT 1 1 1 nC TCTT AGTATAGCAT TTTTAAAAAA ATATAAAAGC TACTAATCTT 1 200 
TGTACAAnr GTAAATGTTA AGAA llTllT TTATATCTGT TAAATAAAAA TTATTTCX:aA 1260 
CAACXHTAAA AAAAAAAAAA AAAA 

Scq ID NO: 38 Protein sequence: 
Ptotein Accession #: none found 

1 11 21 31 41 51 
I I I I I I 

MRPQGPAASP QRLRGLLLLL LLQLPAPSSA SEIPKGKQKA QLRQREWDL YNGMCLQGPA 60 
GVPGRDGSPG ANGIPGTPGI PGRDGFKGEK GECLRESFEE SWTPNYKQCS WSSLNYGIDL 120 
GKIAECTFIX MRSN SALRVL FSGSLRIXCR NACCQRWYFT FNG AECSGPL PIEAIIYLDQ 1 80 
GSPEMNSTIN IHRTSSVEGL CEGIGAGLVD VAIWVGTCSD YPKGDASTGW NSVSRIIIEE 240 
LPK 

Scq ID NO: 39 DNA sequence 

Nucleic Acid Accession ff: NM_000949 

Coding sequence: 28S-2 1 S3<underlined sequences conespond to start and stop codons) 

1 11 21 31 41 51 

11)111 

GGAGGCTGAA ATCCCCAGAC GCCGGTTTTC TGGGCTGGGC TTTCTGCTTA CTCACTCCTT 60 
CTCOCTCnT CTGG ATTTTA CCGACCGTTC GCGAAACAGC TTTCCACACA ATGGAGCTTC 1 20 
ATGTCCTCXyr GCAGGAAGTA CTCATCGACT GATGTGGCAG ACTTTGCTCC CTGACAAAAC 1 80 
TAAAG AACrC TCCTATTCAT GGAGGCGAAC ACTGAGGATG CTTTCCACAT GAACCCTGAA 240 
GTGAACTrCT GATACATTTC CTGCAGCAAG AGAAGGCAGC CAACATGAAG G AA AATGTGG 300 
CATCTGCAAC CGTTTTCACT CTG CTACTTT TTCTC AACAC CTGCCTTCTG AATGG ACAGT 360 
TACCTCCTGG AAA ACCTGAG ATCTTTAAAT GTCGTTCTCC CAATAAGGAA ACATTCACCT 420 
GCTGGTGGAG GCCTGGGACA GATGGAGGAC TTCCTACCAA TTATTCACTG ACTTACCACA 480 
GGGAAGGAGA GACACTCATG CATG AATGTC CAG ACTACAT AACCGGTGGC CCCAACTCXT 540 
GCCACnTGG CAAGCAGTAC ACCTCCATGT GG AGGACATA CATCATGATG GTCAATGCTA 600 
CTAACCAGAT GGGAAGCAGT TTCTCGGATG AACTTTATGT GGACGTGACT TACATAGTTC 660 
AGOCAGACCC TCCTTTGG AG CTGGCTGTGG AAGTAAAACA GCCAGAAGAC AGAAAACCCT 720 
AOCTGTGGAT TAAATGGTCT CCACCTA<XC TG ATTGACTT AAAAACTGGT TGGTTCACGC 780 
TOCTGTATGA AATTCGATTA AAACCCGAGA A AGCAGCTGA GTGGGAGATC CATTTTGCTG 840 
GGCAGCAAAC AG AGTTTAAG ATTCTCAGCC TACATCCAGG ACAGAAATAC CTTGTCCAGG 900 
TTCGCTGCAA ACCAGA(XAT GG ATACTGGA GTGCATGG AG TCCAGCXjACC TTCATTCAG A 960 
TACCTAGTG A CTTCACCATG AATG ATACAA CCGTGTGG AT CTCTGTGGCT GTCCTTTCTG 1 020 
CTGTCATCTG TTTGATTATT GTCTGGGCAG TGGCTTTGAA GGGCTATAGC ATGGTGACCT 1080 
GCATCTTTCC GCCAGTTCCT GGGCCAAAAA TA AAAGG ATT TGATGCTCAT CTGTTGG AG A 1140 
AGGGCAAGTC TGAAGAACTA CTG AGTGCCT TGGGATGCCA AGACTTTCCT CCCACTTCTG 1 200 
ACTATGAGGA CTTGCTGGTG GAGTATTTAG AAGTAGATGA TAGTGAGGAC CAGCATCTAA 1 260 
TGTCAGTCCA TTCAAA AG AA CACCCAAGTC AAGGTATGAA AOCCACATAC CTGG ATCCTG 1 3 20 
ACACTG ACTC AGGCCGGGGG AGCTGTGACA GCCCTTCCCT TTTGTCrGA A AAGTGTG AGO 1 380 
AACOCCAGGC CAATCCCTCC ACATTCTATG ATCCTGAGGT CATTGAGAAG CCAGAGAATC 1 440 
CTGAAACAAC OCACACCTGG GACCCCCAGT GCATAAGCAT GGAAGGC AAA ATCCCCTATT 1 500 
TTCATGCTGG TGG ATCCAAA TGTTCAACAT GGCCCTTACC ACAGCCCAGC CAGCACAAOC 1 560 
OCAGATCCTC TTACCACAAT ATTACTGATG TGTGTGAGCT GGCTGTGGGC CCTGCAGGTG 1 620 
CACCGGCCAC TCTGTTG AAT G AAGCAGGTA AAGATGCTTT AAAATCCTCT CAAACCATTA 1 680 
AGTCTAGAGA AGAGGGAAAG GCAACCCAGC AG AGGGAGGT AG AAAGCTTC CATTCTGAG A 1 740 
CTGACCAGGA TACGCCCTGG CTGCTGCCCC AGG AGAAAAC CCCCTTTGGC TCCGCTAAAC 1 800 
COTGGATTA TGTGGAG ATT CACAAGGTCA ACA AAGATGG TGCATTATCA TTGCTACCAA 1 860 
AACAGAGAGA GAACAGCGGC AAGCCCAAGA AGCCCGGGACTCCTGAGAAC AATAAGGAGT 1920 
ATGCCAAGGT GTCCGGGGTC ATGG ATAACA ACATCCTGGT GTTGGTGCCA GATCCACATG 1 980 
CTAAAAACGT GGCTTGCTTT G AAG A ATCAG CCAAAGAGGC OCCACCATCA CTTGAACAG A 2040 
ATCAAGCTGA GAAAGCCCTG GCCAACTTCA CTGCAACATC AAGCAAGTGC AGGCTCCAGC 2100 
TGGGTGGTTT GG ATTACCTG G ATCCCGCAT GTTTTACACA CTCCTTTCAC IGATAGCTTG 2160 
ACTAATGGAA TG ATTGGTTA AAATGTGATT TTTCTTCAGG TAACACTACA GAGTACGTGA 2220 
AATGCTCAAG AATGTAGTCA GACTG ACACT ACTAAAGCTC CCAGCTCCTT TCATGCTCCA 2280 
TnTTAACCA CTIGCCTCTT TCTCCAGCAG CTGATTCCAG AACAAATCAT TATGTITCCT 2340 
AACTGTGATTTGTAGATTTA CTTTTTGCTG TTAGTTATAAAACTATGTGTTCAATGAAAT 2400 
AAAAGCACAC TGCTTAGTAT TCTTG AGGGA CAATGCCAAT AGGTATATCC TCTGGAAAAG 2460 
GCITTCATGA TTTGGCATGG GACAGACGGA AATGAAATTG TCAAAATTGT TTACCATAGA 2520 
AAGATOACAA AAG AAAATTT TCCACATAGG AAAATGCCAT GAAAATTGCT TTTGAAAAAC 2580 
AACTCCATAA CCTTT ACACT CCTCGTCCAT TTTATTAGG A TTACCCAAAT ATAACCATTT 2640 
AAAGAAAGAA TGCATTCCAG AACA AATTGT TTACATAAGT TCCTATACCT TACTGACACA 2700 
TTGCTGATAT GCA AGTAAG A AAT 



Seq ID NO: dO Protein scouence; 
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Protein Accession #: NP_000940.1 

1 11 21 31 41 51 
I I I I I I 

MKENVASATV FTLLLFLNTC LLNGQLPPGK PEIFKCRSPN KETFTCWWRP GTDGGLPTNY 60 
SLTYHREGET LMHECPDYIT GGPNSCHFGK QYTSMWRTYI MMVNATNQMG SSFSDELYVD 120 
VTYIVQPDPP LELAVEVKQP EDRKPYLWIK WSPPTUDLK TGWFTLLYEI RUCPEKAAEW 1 80 
EIHFAGQQTE FKJLSLHPGQ KYLVQVRCKP DHOYWSAWSP ATFIQIPSDF TMNDTTVWIS 240 
VAVLSAVICL nVWAVALKG YSMVTCIFPP VPGPKIKGFD AHLLEKGKSE ELLSALGCQD 300 
FPPTSDYEDL LVEYLEVDDS EDQHLMSVHS KEHPSQGMKP TYLDPDTDSG RGSCDSPSLL 360 
SEKCEEPQAN PSTFYDPEVI EKPENPETTH TWDPOaSME GKIPYFHAGG SKCSTWPLPQ 420 
PSQHNPRSSY HNITDVCELA VGPAGAPATL LNEAGKDALK SSQTIKSREE GKATQQREVE 480 
SFHSETDQDT PWLLPQEKTP FGSAKPLDYV EIHKVNKDGA LSLLPKQREN SGKPKKPGTP 540 
ENNKEYAKVS GVMDNNILVL VPDPHAKNVA CFEESAKEAP PSLEQNQAEK ALANFTATSS 600 
KCRLQLGGLD YLDPACFTHS FH 

Seq 10 NO: 41 DNA sequence 

Nucleic Acid Accession none found, Eos cloned sequence 

Coding sequence: 1-1572 (undeilined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I t 

ATGACCCAAA ATAAATTAAA GCTTTGTTCC AAAGCCAATG TGTATACTG A AGTGCCTG AT 60 
GGAGGATGGG GCTGGGCGGT AGCTGTTTCA TTTTTCTTCG TTGAAGTCTT CACCTACGGC 120 
ATCATCAAG A CATTTGOTGT CTTCITTAAT GACTTAATGG ACAGTTTTAA TGAATOCAAT 180 
AGCAGGATCT CATGG ATAAT CTCAATCTGT GTGnTGTCT TAACATTTTC AGCTCCCCTC 240 
GCCACAGTCC TGAGCAATCG TTTCGGACAC CGTCTGGTAG TG ATGTTGGG GGGGCTACTT 300 
GTCAGCACCG GG ATGGTGGC CGCCTOCTTC TCACAAGAGG TTTCTCATAT GTACGTCGCC 360 
ATCGGCATCA TCTCTGGTCT GGGATACTGC TTTAGTTITC TCCCAACTGT AACCATCCTA 420 
TCACAATATT TTGGCAAAAG ACGTTCCATA GTCACTGCAG TTGCTTCCAC AGGAGAATGT 480 
TTCGCTGTGT TTGCTTTCGC ACCAGCAATC ATGGCTCTGA AGGAGCGCAT TGGCTGGAGA 540 
TACAGCCTCC TCTTCGTGGG CCTACTACAG TTAAACATTG TCATCTTCGG AGCACTGCTC 600 
AGACCCATCT TTATCAGAGG ACCAGCGTCA CCX3AAAATAG TCATCCAGG A AAATCGGAAA 660 
G AAGCGCAGT ATATGCTTGA AAATGAGAAA ACACGAACCT CAATAGACTC CATTGACTCA 720 
GGAGTAG AAC TAACTACCTC ACXTTAAAAAT GTGCCTACTC ACACTAACCT GGAACTGG AG 780 
CCGAAGGCCG ACATGCAGCA GGTCCTGGTG AAGACCAGCC CCAGGCCAAG CGAAAAGAAA 840 
GCCCCGCTAT TAGACTTCTC C ATTTTG AAA GAGAA AAGTT TTATTTGTTA TGCATTATTT 900 
GGTCTCTTTG CAACACTGGG ATTCTTTGCA CCTTCCTTGT ACATCATTCC TCTGGGCATT 960 
AGTCTGGGCA TTGACCAGGA CCGCGCTGCT rmTATTAT CTACGATGGC CATTGCAGAA 1020 
GTTTTCGGAA GGATCGGAGC TGGTTTTGTC CTCAACAGGG AGCCCATTCG TAAGATTTAC 1080 
ATTGAGCTCA TCTGCGTCAT CTTATTGACT GTGTCTCTGT TTGCCTTTAC nTTGCTACT 1 140 
GAATTCTGGG GTCTAATGTC ATGCAGCATA TmTTGGGT TTATGGTTGG AACAATAGGA 1200 
GGGACrCACA TTCCACTGCT TGCTGAGGAT GATGTCGTGG GCATTGAGAA GATGTCTTCT 1260 
GCAGCTGGGG TCTACATCTT CATTCAGAGC ATAGCAGGAC TGGCTGGACC GCCCCTTGCA 1320 
GGTTTGTTGG TGGACCA AAG TAAGATCTAC AGCAGGGCCT TCTACTCCTG CGCAGCTGGC 1380 
ATGGCCCTGG CTGCTGTGTG CCTCGCCCTG GTGAGACCGT GTAAGATGGG ACTGTGCCAG 1440 
CATCATCACT CAGGTGAAAC AAAGGTAGTG AGCCATCGTG GGAAGACTTT ACAGGACATA 1500 
CCTGAAGACT TTCTGGAAAT GGATCTTGCA AAAAATGAGC ACAQAGTTCA CGTGCAAATG 1560 
GAGCCGGTA T GA 

Seq ID NO: 42 PrP^ein gggMcnwi 

Protein Accession U: none found. Eos cloned sequence 

1 11 21 31 41 51 
I i I I I 1 

MTQNKLKJLCS KANVYTEVPD GGWGWAVAVS FFFVEVFTYG IIKTFGVFFN DLMDSFNESN 60 
SRISWnSIC VFVLTFSAPL ATVLSNRFGH RLWMLGGLL VSTGMVAASF SQEVSHMYVA 120 
IGnSGLGYC FSFLPTVTIL SQYFGKRRSI VTAVASTGEC FAVFAFAPAl MALKERIGWR 180 
YSLLFVGLLQ LNIVIFGALL RPIFIRGPAS PKIVIQENRK EAQYMLENEK TRTSIDSIDS 240 
GVELTTSPKN VPTHTNLELE PKADMQQVLV KTSPRPSEKK APLLDFSILK EKSHCY ALF 300 
GLFATLGFFA PSLYIIPLGI SLGIDQDRAA FLLSTMAIAE VFGRIG AGFV LNREPIRKIY 360 
lEUCVILLT VSLFAFTFAT EFWGLMSCSI FFGFMVGTIG GTHIPLLAED DVVGIEKMSS 420 
AAGVYIHQS lAGLAGPPLA GLLVDQSKIY SRAFYSCAAG MALAAVCLAL VRPCKMGLCQ 480 
HHHSGETKW SHRGKTLQDI PEOFLEMDLA KNEHRVHVQM EPV 

Seq ID NO: 43 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted ORF 

Coding sequence: 1 -1 749 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I i I I 

ATGC TGTCTG GCTTCTTG AT G AGTCCCAGT ACCCAGCAC A G AGCACAGTA CACTCCCGGA 60 
GG AAAGAAAC TTCCGTGGGA GGCTTCCATC GGTGCGCACA CCTCCCGAGG GCGAGGCAGC 1 20 
G ACCGGGAGA GGGAGAGCCG GCOGGAGGCT GCCGGGCTCC TGTGGGACCG CGCTGCAGCC 1 80 
GGGGAGGCGG AGAAGGGG AA CCGGGGCGAG CCGCOCGCCT GGATCCGCGC CCAGCAGCAC 240 
CCGCGGCCGC CGOCAGCTGG GCAGGCTCCC GGG ACTGCGG CTGGGGGCGC GCAGGACCCT 300 
CGCCTGCGTC CTGG ACGTTC CCGGGGGAGG GTCCGGTTGC CAGTGAAACC TCCAGAGGCT 360 
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TCCGGACG AC AGCCCCGGGG GCCTTCTGAC TGCATCCCGA GATTTCCATC AGCXjAGTGCA 420 
ACTCATAAGG CAGTCCCTAA GGGGACCGGG CCACCGGCTG AGGACGGGGA TGGCTTAGGA 480 
GCTCCTGG AC CTAGGGCCCG GCGTCGTCGC CTCCTGGGCG TCGCGGCAGA GGGGAGTGGC 540 
CCGCGCGGAA AGCGCCGCGG GACAGTCAGT GACGAGGCCC GGGGGTCGCC GGGGCCACGA 600 
CTTCTCGGAG ACCGTCCTGC GCTCTCTGGA GACGCGCTGT CCGCGCCCAG GGTGGTGCCA 660 
TGTGGGGCGC TCGOCGCTCG TCCGTCTCCT CATCCTGG AA CGCCGCTTCG CTCCTGCAGC 720 
TGCTGCTGGC TGCGCTGCTG GCGGCGGGGG CGAGGGCCCA GCGGCGAGTA CTGCCACGGC 780 
TGGCTGG ACG CGCAGGGCGT CTGGCGCATC GGCTTCCAGT GTCCCGAGCG CTTCGACGGC 840 
GGCG ACGCCA CCATCTGCTG CGGCAGCTGC GCGTTGCGCT ACTGCTGCTC CAGCGCCGAG 900 
GCGCGCCTGG ACCAGGGCGG CTGCGACAAT GACCGCCAGC AGGGCGCTGG CGAGCCTGGC 960 
CGGGCGGACA AAGACGGGCC CCGACGGCTC GGCAGGGCTT CATGTCTTAG GGGTACCCAA 1 020 
GGAGACGGCG AGGGTGCGCC CCCACCCGTG AGGGCCTGGC AGCGGTGCTC CCCTGAAGGC 1080 
TCCCCGAAAG GAAGGCAGCT CCTCAGGGCT TTCCCGGGGC TCCTGCCCCG TGCCAGACGC 1 140 
CGCGGATTCC CATCTICTCC ACGCGGCGGC CCCTCTCCCC TGCAGCGGCC COCCTTGCCC 1200 
ATCTACGTGC CGTTCCTCAT TGTTGGCTCC GTGTTTGTCG CdTTATCAT CTTGGGGTCC 1260 
CTGGTGGCAG CCTGTTGCTG CAGATGTCTC CGGCCTAAGC AGGATCCCCA GCAGAGCCGA 1320 
GCCCCAGGGG GTAACCGCTT GATGGAGACC ATCCCCATGA TCCCCAGTGC CAGCACCTCC 1380 
CGGGGGTCGT CCTCACGCCA GTCCAGCACA GCTGCCAOTT CCAGCTCCAG CGCCAACTCC 1440 
GGGGCCCGGG CGCCCCCAAC AAGGTCACAG ACCA ACTGTT GCTTGCCGG A AGGGACCATG 1 500 
AACAACGTGT ATGTCAACAT GCCCACG AAT TTCTCTGTGC TGAACTGTCA GCAGGCCACC 1 560 
CAGATTGTGC CACATCAAGG GCAGTATCTG CATCCCCCAT ACGTGGGGTA CACGGTGCAG 1620 
CACGACTCTG TGCCCATGAC AGCTGTGCCA CCTTTCATGG ACGGCCTGCA GCCTGGCTAC 1680 
AGGCAGATTC AGTCCCCCTT CCCTCACACG AAC AGTG AAC AG AAG ATGTA CCCAGCGGTG 1 740 
ACTGTATAAC CGAGAGTCAC TGGTGGGTTC CTTTACTG AA GGGAG ACGAA GGCAGGGGTG 1 800 
GATTCTCGAG GTGGAAGTCC GCACATGTCG GTGGTAnTA TGGCAOGATT CCTTTGGATG 1860 
GCrrCATTTG CCCCCAG ACT GTATGAAAAC ATCTCCGA AT TAGCATTTCT GGATATGTTT 1920 
CATCCAGGGT ATCATTGATT TATGATGGA A AACCGGCCTC AGCTGGAGAT GACTGTG ATG 1 980 
TTGCTGATGG GTGTATAACA AATGCTTGAG TCCG AAGTGC CCTTGAGATA TGGTTGACGA 2040 
AAGAATTTTA TAAACTGATA AATTAAGG AT TTTTATTATG TTGTTATTAT TATTTCmT 2100 
TTGTTGT TG A CTGCAC AGGA TCAAAATGCC TGTTATCTCC CmTACTGG GACmTTTT 2160 
in in 1 1 1 1 lllin ITAA TCAGACAGGG TCTTGCTCTG TTGCCCAGGC TGG AGTGCAG 2220 
TGGTGCGATC TCGGCTCACT GCA ACTTCAG CCTCCTGGAT TCAGGCAACA CTCCTGCCTC 2280 
AGCCTCCC AC GTGGCTGGGA TTACAGGTGC CTGCCCCCAT GGCTAATTIT TTGTATTTTT 2340 
TGTAGAG ATG GGGTTTCACC ATGTTGGCTG GGCTGGTCTC ACTCTCCTG A CCTCAAGCAA 2400 
TCTGCCTGTC TCAGCCT CCC AAAGTGCTGG GATTACAGGC GTGAGCCACC GCCCCCAGCC 2460 
TGAGeCTTTT TTnTITCTA ATGCATCCAA GGTTAAGGGG AAGACGCAAA TAACAGGACT 2520 
ATTCTAAAAG GAAACCTGTT TGAACTCTGT GAGATCAGTC ATCAGTCTCA GTATTCC ACA 2580 
GGCACACCTT AATTTCATTG TAAAAAG ATA TATATATTTT GTCTATmT GTGCTTTTGG 2640 
GGGCCTATTT TGTGCmTr TACCTTATGT AGAG ATCTTA TTACAAAOTG ATTTTCTACA 2700 
TTAAAA AGAG ACTGAAATAA ATTGTATAGT TACTTAACTA ATGAAG ACAT TTCAGAACIC 2760 
TGGGATGATTTTAATCrrGA AGTAGTAGGT GGTATAGTCA TAAAACCATT CATCOCCTTC 2820 
TTGATTGTATCTTAATnTCTGGCTTTAAG GTGACATCTG AGAGGTAATG CATT Cl t ' TlT 2880 
TATA1TGA AA TCATAAACTA TCACCCGCTG CTTCTCTGAG TTACTTTTAA TTTTGCCTTG 2940 
TGGTTATGGT TTGGCGTTTC CnCTGTTTG GTTTTCAGAG CCCCATGTCT ATATAGTCCT 3000 
GAGTGCAAGT AATTACTATA CTTGTAAATG AAGATCAGTA TTTCTGCCTA GATCTG ATAA 3060 
AAAAATTTTC TTGTCTTAGT TATAAAAATT CAA AGAAATG TGTTACAAAG ATACTTAGTA 3120 
TAGCTCCTCA GCCA TAAC CT G AGACTTGGG ATGAAATTTA AACCAGATAC GATTTACTTT 3180 
GCAGATCATA AGGCTTTTTA TACTCTTGTT ATCAAAATGG CTTATTTTTC AGGCACTAAG 3240 
GATTGTTAAG AGAAAAGCTT TTCAACGAAG GATTGCCTTT CTTCTCCCAC ACTGTTCTTG 3300 
ATTTCCTCTC TCTTTCAGGC CTCAACAGGC ACTGTATTCA TTGCCAATGT TCCAAATTAT 3360 
CAAATTCA AG TGAATTTATT TGTGTGTTCT TTACTTATAT AAAAAAAGAT AACTTTAAGG 3420 
ATGTGCAAGT ACATTTCCAA CTGCTAGCAC AACCAGTATT TTGTAATrAA ACAAATCGCT 3480 
GTATGGT ATG GTCTTCTACA CATTTATGTC TATAGATATC T ATCGATCAT CITTCTATTC 3540 
TGTTTC ATGA CTGAATAATG TAAAACCAGT GTTGGCAATT GGTATCATCA ATGATACTCA 3600 
TTTTTTAATA ACCAAAGGCA GGGGAA A ATC ATTTT ACTTA TTAATAAATA TTTTATGATG 3660 
TGAAAAA AAA AAAAAAAAAA AAAAAAAAAA 

Seq ID NO: 44 Protein sequence: 

Protein Accession #: FGENESH predicted 

1 11 21 31 41 51 
I I I I t I 

MLSGFLMSPS TQHRAQYTPG GKKLPWEASI GAHTSRGRGS DRERESRPEA AGLLWDRAAA 60 
GEAEKGNRGE PPAWIRAQQQ PRPPPAGQAP GTAAGGAQDP RLRPGRSRGR VRLPVKPPEA 120 
SGRQPRGPSD QPRFPSASA THKAVPKGTG PPAEDGDGLG APGPRARRRR LLGVAAEGSG 1 80 
PRGKRRGTVS DEARGSPGPR LLGDRPALSG DALSAPRVVP CGALAARPSP HPGTPLRSCS 240 
CCWLRCWRRG RGPSGEYCHG WLDAQGVWRI GFQCPERFDG GDATICCGSC ALRYCCSSAE 300 
ARLDQGGCDN DRQQGAGEPG RADKDGPRRL GRASCLRGTQ GDGEG APPPV RAWQRCSPEG 360 
SPKGRQLLRA FPGLLPRARR RGFPSSPRGG PSPLQRPALP lYVPFUVGS VFVAFIILGS 420 
LVAACCCRCL RPKQDPQQSR APGGNRLMET IPMIPSASTS RGSSSRQSST AASSSSSANS 480 
GARAPPTRSQ TNCCLPEGTM NNVYVNMPTN FSVLNCQQAT QIVPHQGQYL HPPYVGYTVQ 540 
HDSVPMTAVP PFMDGLQPGY RQIQSPFPHT NSEQKM YPAV TV 

Seq ID NO: 45 DNA sequence 

Nucleic Acid Accession Ui NM_0022S5 

Coding sequence: 55-3738 (underlined sequences correspond to start and stop codons) 
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1 U 21 31 41 51 
I I I I I 1 

GGCCGAGCCT CGGCGGCGGC GGTAGCGGCG GCGGCGACGC TGACACCTCX: CACCATGGAC fiO 
AGCrrCGACT TAGCCCTGCT CCAGGAATGG G ACCTCX5AGT CACTGTGTGT CTATG AACCA 1 20 
J GATAGAAATG CATTACGGAG GAAAGAACGA GAAAGAAGAA ATCAAGAAAC TCAACAGGAT 180 
GATGGCACGT TTAATTCTAG TTACTCTCTC TTCAGTG AGC CCTACAAGAC TAACAAGGGG 240 
GATGAACTCT CCAACCGG AT CCAGAACACT TTAGGCAATT ATGATGAAAT GAAAGACTTT 300 
TTAACTGATA G AACCAATCA G AGTCATCTC GTTGGAGTTC CCAAACXTGG GGTTCCTCAG 360 
ACrOCTGTGA ACAAG ATCGA TGA ACATTTT GTTGCAGATT CAAGAGCCCA GAACCAGCCC 420 
1 U TCXmCTATCT GTAGCACTAC AACTTCCACA CCAGCAGCTG TCCCCXjTGCA GCAGAGTA AA 480 
AGAGGCACTA TGGGCTGGCA G AAGGCTGGG CACCCAOCCT CTGACGGCCA ACAGAGAGCA 540 
ACACAACAGG GCTCTCTCAG G ACCITGCTT GGAGATGGTG TTGGCAGACA GCAGCCTCGG 600 
GOCAAACAAG TGTGCAATGT GGAGGTGGGC CTTCAGACCX: AGGAGAGGCC ACCTGCCATG 660 
GCXK3CCAAGC ACAGCAGCAG CGGACACTGT GTTCAGAACT TTCCTCCATC CCTAGCTTCA 720 
1 D AAACXX:AGCC TGGTCCAGCA GAAACXGACC GCGTATGTGA GGCCAATGGA CGGCCA AG at 780 
CAGGCCCCTG ATGAGTCTCC TA AGCTGAAG TCGTCTTCGG AAACCAGCGT GCACTGCACA 840 
TCATACAGGG GAGTCCCTGC CAGCAAGCCG GAGCCTGCCA GAGCCAAGGC CAAGCTCTCC 900 
AAGTTCAGCA TCCCCAAGCA GGGGGAGGAG AGTAGATCTG GAGAAACCAA CAGCTGTGTT 960 
GAAGAAATAA TCCGGGAGAT GACCTGGCTT CCACCACTTT CTGCTATTCA AGCACCTGGC 1020 
AAAGTGGAAC CAACCAAATT TCCATTTCCA AATAAGGACT CTCAGCTTGT ATCCTCTGGA 1080 
CACAATAATC CAAAGAAAGG TGATGCAGAG CCAGAGAGTC CAGACAATGG CACATCGAAT 1140 
ACATCAATGC TGGAAGATG A CCTTAAGCTA AGCAGTGATG AAGAGGAGAA TGAACAGCAG 1200 
GCAGCTCAGA GAACGGCTCT CCGCGCTCTC TCTGACAGCG CCGTGGTCCA GCAGCCCAAC 1260 
TGCAGAA(XT CGGTGCCTTC CAGCAAGGGC AGCAGCAGCA GCAGCAGCAG CGGCACGAGC 1320 
AGCTCCTCCA GCGACTCAG a GAGCAGCTCC GGATCTGACT CXjGAGACCGA GAGCAGCTCC 1380 
AGCXSAGAGTG AGGGCAGCAA GCCCCCXCAC TTCTCCAGOC CCGAGGCTGA ACCGGCATCC 1440 
TCTAACAAGT GGCAGCTGG a TAA ATGGCTA AACAAAGTTA ATCCCCACAA GCCTCCTATT 1500 
CTG ATCCAAA ATG AAAGCC a CGGGTCAG AG AGCAATCAGT ACTACAACCC GGTG A AAG AG 1 560 
GAOjTCCAGG ACTGTGGG a a AGTCCCCGAC GTTTGCCAGC CCAGCCTGAG AGAG AAGG AG 1 620 

J U ATCAAGAGCA CTTGCAAGG A GGAGCAAAGG OCAAGGACAG CCAACAAGGC CCCTGGG AGT 1 680 
AAAGGOGTGA AGCAGAAGTC CCCGCCCGCG GCCGTGGCCG TGGCGGTGAG CGCAGCCGCC 1740 
CCGCCACXrCG CAGTGCCCTG TGCGCCCGCG GAGAACGCGC CCGCGCCTGC CCGGAGGTCC 1800 
GCXjGGCAAGA AGCCCACCAG GCGCACCGAG AGGACCTCAG CCGGGGACGG CGCCAACTGC 1860 

^ _ CACCGGCCCG AGG AGCCCGC GGCCGCGG AC GCGCTGGGGA CGAGCGTGGT GGTCCCCCCG 1 920 

3 J GAGCXXACCA a AACCAGGCC CTGTGGCAAC AACAGAGCGA GCCACCGCAA GGAGCTGCXiC 1980 
TCCTCCXjTGA CCTGCGAGAA GCGCCGCACG CGGGGGCTAA GCAGGATCGT CCCCAAATCC 2040 
AAGGAGTTCA TTGAGACAGA GTCGTCATCT TCATCCTCCT CCTCGGACTC CGACCTGG AG 2100 
TCCGAGCAGG AGGAGTACCC TCTGTCCAAA GCACAGAOCG TGGCTGCCTC TGCCTCCTCC 2160 
. ^ GGGAATGATC AGAGGCTGAA GG AGGCCGCT GCCAACGGGG GCAGTGGTCC TAGGGCCCCT 2220 

4U GTAGGCTCX:A TCAA CGCC AG GACCACCAGT G ACATCXjCCA AGG AGCTGG a GGAGCAGTTC 2280 
TACACACTGG TCCCCTTTGG CCGG AACGAA CTTCTCTCCC CTCTAAAGGA CAGTG ATG AG 2340 
ATCAGGTCTC TCTGGGTCAA AATCGACCTG ACCCTCCTGTCCAGGATCCC AGAACACCTG 2400 
CCCCAGGAGC CAGGGGTATT GAGCGCCCCT GCCACCAAGG ACTCTGAGAG CGCACCGCCC 2460 
AGCCACACCT CGGACACACC TGCAG AAAAG GCTTTGCCAA AATCCAAGAG GAAACGCAAG 2520 

4^ TGTG ACAACG AAG ACG acta CAGGG AG ATC a AGA AGTCCC AGGGAG AG a a AG AC AGCTCT 2580 
TCAAGACTGG CCACCTCCAC CAGTAATACT TTGTCTGCAA ACCACTGCAA CATGAACATC 2640 
AACAGTGTGG CAATACCAAT AAATAAAAAT GAAAAAATGCTTCGGTCGCC CATCTCACCC 2700 
CTCTCTGATG CATCTAAACA CAAATACACC AGCGAGGACTTAACTTCTTC CAGCCGACCT 2760 
AATGGCAAC A GTTTGTTTAC TTC AGCCTCT TCCAGCAAAA AGCCTAAGGC CG ACAGCCAG 2820 

JU CTGCAGCCTC ACGGCGGAGA CCTCACGAAA GCAGCrCACA ACAATTCTGA AAACATTCCC 2880 
CTCCACAAGTCACGGCCGCA GACGAAGCCG TGGTCTCCAG GCTCCAACGG CCACAGGGAC 2940 
TGCAAG AGGC AGAAACTTGT CTTCGATGAT ATGCCTCGCA GTGCCGATTA TTTTATGCAA 3000 
GAAGCTAAACGAATGAAGCATAAAGCAGATGCAATGGTGG AAAAGTTTGGAAAGGCTTTG 3060 

- AACTATGCTG AAGCAGCATT GTCGTTTATC GAGTGTGGAA ATGCAATGG A ACAAGGCCCC 3120 
D J ATGGAATCC A A ATCTCCTTA TTACCTG ATG T ATTCAG A AA CAGTAGAGCT CATCAGGTAT 3 1 80 

GCTATGAG AC TAAAAACCCA CTCAGGCCCC AATGCCACAC CAGAAGACAA ACAACTGGCT 3240 
GCATTATGTT ACCGATGCCT GGCCCTCCTG TACTGGCGGA TGTTTCGACT CAAAAGGG AC 3300 
CACGCTGTAA AGTATTCAAA AGCACTAATC G ACTATTTCA AGAACTCATC TAAAGCCGCC 3360 
CAAGCCCCAT CTCCGTGGG G GGCCAGTGG a AAGAGCACTG G AACCCCATC CCCCATTTCT 3420 
OU CCCAACCCCT TTCCCGGCAG CTCCGTGGGG TCTCAGGGCA GCCTCTCCAA CGCCAGCGCC 3480 
CTGTCCCCGTCGACCATCGT CAGCATCCCA CAGCGCATCC ACCAGATGGC GGCCAACCAC 3540 
GTCAGCATCA CCAACAGCAT CCTGCACAGC TACGACTACT GGGAGATGGC CGACAACCTG 3600 
GCCAAGGAAA ACCGAGAATT CTTCAACG AC CTGG ATCTGC TCATGGGGCC GGTCACCCTG 3660 

- CACAGCAGCA TGGAGCACCT GGTCCAGTAC TCCCAACAGG GCCTGCACTG GCTGCGGAAC 3720 
OJ AGCGCCCACC TGTCA TAGGG ACCTCACCCT GGGGOCAG AG TGGGCTCTGG TCTCCACAGA 3780 

TGGCTCAACG TnTTGGACA CTGTGCTACT GAAACTCCCA GCCACAGCAT TTATAG ACTG 3840 

CGGTG aacat ttcctca 

„ ScQ ID NOi 46 Protein sequence: 
/U Protein Accessions: NP_002276 

1 11 21 31 41 51 
I I t I I 1 

MDSFDLALLQ EWDLESLCVY EPDRNALRRK ERERRNQETQ QDDGTFNSSY SLFSEPYKTN 60 
75 KGDELS^mIQ^m*GNYDEMKDFLTDRT^K}SHLVGWKPGVPQTPVNKmEHFVADSR^ 120 
QPSSICSTTT STPAAVPVQQ SKRGTMGWQK AGHPPSDGQQ RATQQGSLRT LLGDGVGRQQ 180 
PRAKQVCNVE VGLQTQERPP AM AAKHSSSG HCVQNFPPSL ASKPSLVQQK PTAYVRPMDG 240 
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QDQAPDESPK LKSSSETSVH CTSYRGVPAS KPEPARAKAK LSKFSIPKQG EESRSGETNS 300 
CVEEIIREMT WLPPLSAIQA PGKVEPTKFP FPNKDSQLVS SGHNNPKKGD AEPESPDNGT 360 
SNTSMLEDDL KLSSDEEENE QQAAQRTALR ALSDSAWQQ PNCRTSVPSS KGSSSSSSSG 420 
TSSSSSDSES SSGSDSETES SSSESEGSKP PHFSSPEAEP ASSNKWQLDK WLNKVNPHKP 480 
PIUQNESHG SESNQYYNPV KEDVQDOGKV PDVCXJPSLRE KEIKSTCKEE QRPRTANKAP 540 
GSKGVKQKSP PA AVAVAVSA AAPPPAVPCA PAENAPAPAR RSAGKKPTRR TERTSAGDGA 600 
NCHRPEEPAA ADALGTSWV PPEPTKTRPC GNNRASHRKE LRSSVTCEKR RTRGLSRIVP 660 
KSKEFIETES SSSSSSSDSD LESEQEEYPL SKAQTVAASA SSGNDQRLKE AAANGGSGPR 720 
APVGSINART TSDIAKELEE QFYTLVPFGR NELLSPUCDS DEIRSLWVKI DLTLLSRIPE 780 
HLPQEPGVLS APATKDSESA PPSHTSDTPA EKALPKSKRK RKCDNEDDYR EIKKSQGBKD 840 
SSSRLATSTS NTLSANHCNM NINSVAIPIN KNEKMLRSPl SPLSDASKHK YTSEDLTSSS 900 
RPNGNSLFTS ASSSKKPKAD SQLQPHGGDL TKAAHNNSEN IPLHKSRPQT KPWSPGSNGH 960 
RDCICRQKLVF DDMPRSADYF MQEAKRMKHK ADAMVEICFGK ALNYAEAALS HECGNAMEQ 1020 
GPMESKSPYY LMYSETVEU RY AMRLKTHS GPNATPEDKQ LAALCYRCLA LLYWRMFRLK 1080 
RDHA VKYSKA LIDYFKNSSK AAQAPSPWGA SGKSTGTPSP ISPNPFPGSS VGSQGSLSNA 1 140 
SALSPSTIVS IPQRIHQMAA NHVSITNSIL HSYDYWEMAD NLAKENREFF NDLDLLMGPV 1200 
TLHSSMEHLV QYSQQGLHWL RNSAHLS 

Seq ID NO: 47 DNA sequence 

Nucleic Acid Accession U: NM_033151 

Coding sequence: 351 -4499(underltned sequences correspond to start and stop codons) 



1 11 21 31 41 5! 
I I I i i t 

ACTGGGATAA AGCAAGAAG A CTGATTTTAT GAGCAGGGGT TTG ATACATC AAAGGAG ATT 60 
GCCCAGGATC AAGGGTGCGG TQTTGGGGGT GGGTTGGGGA GGGTGGTTAG AGAAGGTTTC 120 
ACTAAGTG AT TTGGGCXTG A GGCCTGAG AA G ATGTTTAA A A AG AGGG ATC AAGCACAGGC 1 80 
TAAGG AG AGG A A AGAGCAGG C ACCCAAACC TCTGCATGGC CCCAATATG C T(XCTGCAGG 240 
GTAGTGCCCC CTCTTCTGGC TGCTCAAGGC GAGATCTAAG CTTCTTCTAA CTCCTGCTGT 300 
CTTTTCATAT TCTCTGATTC TGGGAAACGA AGAATTGGCA GG AACTGAAA ATGACTAGGA 360 
AG AGG ACATA CTGGGTGCCC AACTCTTCTG GTGGCCTCGT GAATCGTGGC ATCGACATAG 420 
GCGATGACAT GGTTTCAGG A CTTATTTATA AAACCTATAC TCTCCAAG AT GGCCCCTGGA 480 
GTCAGCAAGA G AGAAATCCT GAGGCTCXIAG GGAGGGCAGC TGTCCCACCG TGGGGGAAGT 540 
ATGATGCTGC CTTGAGAACC ATGATTCCCT TCCGTCCCAA GCCGAGGTTT CCTGCCCCCC 600 
AGCCCCTGGA CA ATGCTGGC CTGTTCTCCT ACCTCACCXjT GTCATGGCTC ACCCCGCTCA 660 
TGATCCAAAG CTTACGG AGT CGCTTAGATG AGAACACCAT CCCTCCACTG TCAGTCCATG 720 
ATGCCTCAGA CAAAAATGTC CAAAGGCTTC ACX:GCX:TTTG GGAAGAAG AA GTCTCAAGGC 780 
GAGGGATTGA AAAAGCTTCA GTGCITCTGG TGATGCTGAG GTTCCAGAGA ACAAGGTTGA 840 
TTTTCGATGC ACTTCTGGGC ATCTGCITCT GCATTGCCAG TGTACTCGGG CCAATATTGA 900 
TTATACCAAA GATCCTGGAA TATTCAGAAG AGCAGTTGGG GAATGTTGTC CATGGAGTGG 960 
GACrCTGCrr TGCCCrmTCTCTCCGAAT GTGTGAAGTC TCTGAGTTTC TCCTCCAGTT 1020 
GGATCATCAA CCAACGCACA GCCATCAGGT TCCGAGCAGC TGTTTCCTCC TTTGCCTTTG 1080 
AGAAGCTCAT CCAATTTAAG TCTGTAATAC ACATCACCTC AGGAGAGGCC ATCAGCTTCT 1 140 
TCACCGGTGA TGTAAACTAC CTGTTTGAAG GGGTGTGCTA TGGACCCCTA GTACTGATCA 1200 
CCTGCGCATC GCTGGTCATC TGCAGCATTT CITCCTACTT C ATTATTGGA TACACTGCAT 1 260 
TTATTGCCAT CTTATGCTAT CTCCTGGTTT TCCCACTGGC GGTATTCATG ACAAGAATGG 1320 
CTGTGAAGGC TCAGCATCAC ACATCTGAGG TCAGCGACCA GCGCATCCGT GTGACCAGTG 1380 
AAGTTCTCAC TTGCATTAAG CTGATTAAAA TGTACACATG GGAGAAACCA TTTGCAAAAA 1440 
TCATTGAAGA CCTAAGAAGG AAGGAAAGGA AACTATTGGA GAAGTGCGGG CTTGTCrAGA 1500 
GCCTGACAAG TATAACCTTG TTCATCATCC CCACAGTGGC CACAGCGGTC TGGGTTCTCA 1560 
TCCACACATC CTTAAAGCTG AAACTCACAG CGTCAATGGC CTTCAGCATG CTGGCCTCCT 1620 
TGAATCTCCT TCGGCTGTCA GTGrrCTTTG TGCCTATTGC AGTCAAAGGT CT CACG AATT 1680 
CCAAGTCTGC AGTGATGAGQ TTCAAGAAGTTTTTCCTCrA GGAGAGCCTT GTITrCTATG 1740 
TCCAG ACATT AC A AG ACCXTC AGCAAAGCTC TGGTCTTTG A GG AGGCCACC TTOTC ATGGC 1 800 
AACAGACCTG TCCCGGGATC GTCAATGGGG CACTGGAGCT GGAGAGGAAC GGGCATGCTT 1860 
CTGAGGGGAT GACCAGGCCT AGAGATGCCC TCGGGCCAG A GGAAGAAGGG AACAGCCrGG 1920 
GCCCAGAGTT GCACAAGATC AACCTGGTGG TGTCCAAGGG GATGATGTTA GGGGTCTGCG 1980 
GCAACACGGG GAGTGGTAAG AGCAGCCTGT TGTCAGCCAT CCTGGAGGAG ATGCACTTGC 2040 
TCGAGGGCTC GGTGGGGGTG CAGGG AAGCC TGGCCTATGT CCCCCAGCAG GCCTGGATCG 2100 
TCAGCGGGAA CATCAGGGAG AACATCCTCA TGGGAGGCGC ATATGACAAG GCCCGATACC 2160 
TCCAGGTGCT CCACTGCTGC TCCCTGAATC GGGACCTGG A ACTTCTGCCC TTTGGAGACA 2220 
TGACAGAGAT TGGAGAGCGG GGCCTCAACC TCTCTGGGGG GCAGAAACAG AGGATCAGCC 2280 
TGGCCCGCGC CGTCTATTCC GACCGTCAGA TCTACCTGCT GGACGACCCC CTGTCTGCTG 2340 
TGGACGCCCA CGTGGGGAAG CACAnTTTG AGGAGTGCAT T AAGA AGACA CTCAGGGGGA 2400 
AGACGGTCGT CCTGGTGACC CACCAGCTGC AGTACTTAGA AmTGTGGC CAGATCATTT 2460 
TGTTGGAAAA TGGGAAA ATC TGTG AAAATG GAACTCACAG TGAGTTAATG CAGAAAAAGG 2520 
GGAAATATGC CCAACTTATC CAGAAGATGC ACAAGGAAGC CACTTCGG AC ATGTTGCAGG 2580 
ACACAGCAAA G ATAGCAGAG AAGCCAAAGG TAGAAAGTCA GGCTCTGGCC ACCTCCCTGG 2640 
AAGAGTCTCT CAACGGAAAT GCTGTGCCGG AGCATCAGCT CACACAGGAG GAGGAGATGG 2700 
AAGAAGGCTC CTTGAGTTGG AGGGTCTACC ACCACTACAT CCAGGCAGCT GGAGGTTACA 2760 
TGGTCTCITG CATAATTTTC TTCTTCGTGG TGCTGATCGT CTTCTTAACG ATCTTCAGCT 2820 
TCTGGTGGCT GAGCTACTGG TTGG AGCAGG GCTCGGGGAC CAATAGCAGC CGAGAGAGCA 2880 
ATGGAACCAT GGCAGACCTG GGCAACATTG CAGACAATCC TCAACTGTCC TTCTACCAGC 2940 
TGGTGTACGG GCTCAACGCC CTGCTCCTCA TCTGTGTGGG GGTCTGCTCC TCAGGGATTT 3000 
TCACCAAAGT CACGAGGAAG GCATCCACGG CXXTTGCACAA CAA GCTCT TC AACAAGGTIT 3060 
TCCGCTGCCC CATGAGTTTC TTTGACACCA TCCCAATAGG CCGGCi'i riG AACTGCTTCG 3120 

368 



wo 02/059377 



PCT/US02/02242 



CAGGGGACTT GGAACAGCTG G ACCAGCTCT TGCXTCATCTT TTCAG AGCAG TTCCTGGTCX: 3 1 80 
TGTCCTTAATGGTGATCGCC GTCCTGTTGA TTGTCAGTGT GCTGTCrCCA TATATCCTGT 3240 
TAATGGGAGC CATAATCATG GTTAnTGCT TCATTTATTA TATGATQTTC A AG AAGGCCA 3300 
TCGGTGTGTTCAAGAGACTG GAGAACTATA GCCGGTCTCC TTTATTCTCC CACATCCTCA 3360 
• 5 ATTCTCTGCAAGGCXrrGAGCTCCATCCATGTCTATGGAAAAACTGAAGACTTCATCAGCC 3420 
AGTTTAAGAG GCTGACTGAT GCGCAGAATA ACTACCTGCT GTTGTTrCTA TCTTCCACAC 3480 
GATGGATGGC ATTGAGGCTG GAGATCATGA CCAACCTTGT GACCTTGGCT GTTGCCCTGT 3540 
TCGTGGCrrr TGGCATTTCC TCCACTCCCT ACrcCTTTAA AGTCATGGCT GTCAAC ATCG 3600 
TGCTGCAGCr GGCGTCCAGC TTCCAGGCC A CTGCCCGGAT TGGCTTGG AG ACAGAGGCAC 3660 

1 0 AGTTCACGGC TGTAGAG AGG ATACTGC AGT AC ATG A AG AT GTGTGTCTCG GAAGCTCCTT 3720 
TACACATGGA AGGCACAAGT TGTCCCC AGG GGTGGCCACA GCATGGGGAA ATCATATTTC 3780 
AGGATTATCA CATGAAATAC AGAGACAACA CACCCACCGT GCTTCACGGC ATCAACCTGA 3840 
CCATCCGCGG CCACGAAGTG GTGGGC ATCG TGGGAAGGAC GGGCTCTGGG AAGTCCTCCT 3900 
TGGGCATGGC TCTCTTCCGC CTGGTGG AGC CCATGGCAGG CCGGATTCTC ATTGACGGCG 3960 

1 5 TGGACATTTG CAGCATCGGC CTGGAGG ACT TGCGGTCCAA GCTCTCAGTG ATCCCTCAAG 4020 
AtCCAGTGCT GCTCTCAGGA ACCATCAGAT TCAACCTAGA TOCCTTTGAC CGTCACACTG 4080 
ACCAGCAGAT CTGGGATGCC TTGGAGAGGA CATTCCTGAC CAAGGCCATC TCAAAGTTCC 4140 
CCAAAAAGCT GCATACAGAT GTGGTGGAAA ACGGTGGAAA CTTCTCTGTG GGGGAGAGGC 4200 
AGCTGCTCTG CATTGCCAGG GCTGTGCTTC GCAACTCCAA GATCATCCIT ATCGATGAAG 4260 

20 CCACAGCCTC CATTGACATG GAGACAGACA CCCTGATCCA GCGCACAATC CGTGA AGCCT 4320 
TCCAGGGCTO CACCGTOCTC OTCATTOCCC ACCOTOTCAC CACTOTGCTG AACTGTGACC 4380 
ACATCCTGGTTATGGGCAATGGGAAGGTOG TAGAATTTGA TCGGCCGGAG GTACTGCGGA 4440 
AGAAGCCTGG GTCATTGTTC GCAGCCCTCA TGGCCACAGC CACTTCTTCA CTGAGA TAAG 4500 

. _ GAGATGTGGA GACTTCATGG AGGCTGGCAG CTGAGCTCAG AGGTTCACAC AGGTGCAGCT 4560 

25 TCGAGGCCCA CAGTCTGCG A CCTTCTTGTT TGGAGATGAG AACTTCTCCT GGAAGCAGGG 4620 
GTAAATGTAG GGGGGGTGGG GATTGCTGGA TGGAAACCCT GGAATAGGCT ACTTG ATGGC 4680 
TCTCAAGACCTTAGAACCCC AGAACCATCT AAGACATGGG ATTCAGTGAT CATGTGGTTC 4740 
TCCrnTAAC TTACATGCTG AATAATTTTA TAATAAGGTA AAAGCTTATA GTTTTCTGAT 4800 
CTGTGTTAGA AGTGTTGCAA ATGCTGTACT GACTTTGTAA AATATAAAAC TA AGG AAA AC 4860 

30 TC 

ScQ ID NO: 48 Protein sequence: 
Protein Accession #: NF_1 491 63.2 

35 MTRKRTYWVPNSSGGLVmOIDIGDDMVSGLIYKTYTLQIXjPWSQQEW^n'EAPGRAAVPPWGKY^ 
MIPFRPKPRPPAPQPLDNAGLFSYLTVSWLTPLMIQSLRSRLDENTIPPl^VaroASDKKVQRUIR^ 
VSRRGIEKASVLLVMLRFQRTRLIFDALLGICFCIASVLGPIUIPKILEYSEEQLGNVVHGVGLCFALF 
LSECVKSLSFSSSWIINQRTAIRFRAAVSSFAFEKLIQFKSVIHrrSGEAISFFTGDVNYLFEGVCYGPL 
WJTCASLVICSISSYFnGYTAFIAlIXTLLVFPLAVFMTRMAVKAQHHTSEVSIXjmVTSEVLTCIK 

40 LimYTWEKPFAKIIEDLRRKERKLLEKCGLVQSLTSITLFIIPTVATAVWVLIHTSLK^^ 

LASI^LLRLSVFFVPIAVKGLTNSKSAVMRFKKFFLQESPVFYVQTLQDPSKALVFEEATLSWQQTCPGI 
VNGALELEIU^GHASEGMTRFRDALGPEEEGNSLGPEUKINLVVSKGMMLGVCGNTG^ 
MHLLEGSVGVQGSIAYVPQQAWlVSGNIRE^^LMGGAYDKARYLQVLHa:SL^mDL^ 
GmLSGGQKQRISU^WSDRQIYLLDDPLSAVDAHVGKHIFEEaKKTLRGKTVVLVTHQLQYLEFCG 

45 QllLIJENGKICENGTHSELMQKKGKYAQLlQKMHKEATSDMIjQDTAKIAEKPKVESQALATSLEESLNGN 
AVPEHQLTQEEEMEEGSLSWRVYHHYIQAAGGYMVSCIIFFFVVUVFLTIFSFWWLSYWLEQGSGTNSS 
RESNGTMADlXjNlADWQLSFYQLVYGLNALLLlCVGVCSSGIFrKVTRKASTAUiNI^ 
FDTIPIGRLU^CFAGDLEQLDQIXPIFSEQFLVLSU^IAVLUVSVLSPYILU^GAIIMVICFIYYMMF 
KKAIGVFKRLENYSRSPU^SfflLNSLQGLSSIHV YGKTEDFISQFKRLTDA<^'l>rV^ 

5 0 EIMTNLVTl^ VALFVAFGISSTPYSFKVM AVhnVLQLASSFQATARIGLETEAQFTAVERILQYMK^ 

EAPLHMEGTSCPQGWPQHGEIIFQDYHMKYRDNTPTVLHGINLTIRGHEVVGIVGRTGSGKSSLGMALFR 

LVEPMAGRILIMVDlCSIGLEDUlSKLSVIPQDPVLl^GTIRFNLDPFDRHTIXJQrWDALERTFLT!^ 

SKFPKKLHTDVVENGGNFSVGERQLLCIARAVLRNSKIILIDEATASmMETDTUQRTmEAFQGC^ 

^ ^ VIAHRVTTVLNCDHILVMGNGK VVEFDRPEVLRKKPGSLFAALMATATSSLR 

Scq ID NO: 49 DNA sequence 

Nucleic Acid Accession #: NM_033 419 

Coding sequence: 1 8-980 (underlined sequences correspond to start and stop codons) 

'60 I 11 21 31 41 51 

t ) I I I I 

CGAGCCAGGG AGAAAGGATG GCCGGCCTGG CGGCGCGGTT GGTCCTGCTA GCTGGGGCAG 60 
CGGCGCTGGC GAGCGGCTCC CAGGGCG ACC GTG AGCCGGT GTACCGCG AC TGCGTACTGC 1 20 
AGTGCGAAGA GCAGAACTGC TCTGGGGGCG CTCTGAATCA CTTCCGCTCC CGCCAGCCAA 1 80 

65 TCTACATGAG TCTAGCAGGC TGGACCTGTC GGGACG ACTG TAAGTATGAG TGTATGTGGG 240 
TCACCGTTGG GCTCTACCTC CAGGAAGGTC ACAAAGTGCC TCAGTTCCAT GGCAAGTGGC 300 
CCTTCrCCCG GTTCCTGTTC TTTCAAG AGC CGGCATCGGC CGTGGCCTCG TTTCTCAATG 360 
GCCTGGCCAG CCTGGTG ATG CTCTGCCGCT ACCGCACCTT CGTGCCAGCC TCCTCCCCCA 420 
TGTACCACACCTGTGTGGCCTTCGCCTGGGTGTCCCTCAATGCATGGITCTGGTCCACAG 480 

70 TTTrcCACACCAGGGACACTGACCTCACAGAGAAAATGGACrACrrCTGTGCCTCCACTG 540 
TCATCCTACA CTCAATCTAC CTGTGCTGCG TCAGGACCGT GGGGCTGCAG CACCCAGCTG 600 
TGGTCAGTGC CTTCCGGGCT CTCCTGCTGC TCATGCTGAC CGTGCACGTC TCCTACCTGA 660 
GCCTCATCCG CTTCGACTAT GGCTACA ACC TGGTGGCCAA CGTGGCTATT GGCCTGGTCA 720 
ACGTGGTGTG GTGGCTGGCC TGGTGCCTGT GGAACCAGCG GCGGCTGCCT CACGTGCGCA 780 

75 AGTGCGTGGT GGTGGTCTTG CTGCTGCAGG GGCTGTCCCT GCTCGAGCTG CTTGACTTCC 840 
CACCGCrCTT CTGGGTCCTG GATGCCC ATG CCATCTGGCA CATCAGCACC ATCCCTGTCC 900 
ACGTCCTCTTTTTCAGCTTTCTGGAAGATGACAGCCTGTACCrGCTGAAGGAATCAGAGG 960 
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ACAAGTTCAA GCTGGACTGA AGACCTTGGA GCGAGTCTGC CCCAGTGGGG ATCCTGCCCC 1020 
CGCCCTGCTG GCCTCCCTTC TCCCCTCAAC CCTTGAGATG ATTTTCTCTT TTCAACTTCT 1080 
TGAACTTGG A CATGAAGGAT GTGGGCTCAG AATCATGTGG CCAGCXCACC CCCTGTTGGC 1 140 
. CCTCACCAGC CTTGGAGTCT GTTCTAGGGA AGGCCTCCCA GCATCTGGG A CTCGAGAGTG 1 200 
J GGCAGCCCCT CTACCTCCTG GAGCTGAACT GGGGTGGAAC TGAGTGTGCT CTTAGCTCTA 1260 
CTGGGAGGAC AGCTGCCTGT TTCCTCCCCA TCAGCCTCCT CCCCACATCC CCAGCTGCCT 1320 
GGCTGGGTCC TGAAGCCCTC TGTCTACCTG GGAGACCAGG GACCACAGGC CTTAGGGATA 1380 
CAGGGGGTCC CCTTCTGTTA CCACCCCCCA CCCTCCTCCA GGACACCACT AGGTGGTGCT 1440 
GGATGCTTGT TCTTTGGCCA GCCAAGGTTC ACGGCGATTC TCCCCATGGG ATCTTGAGGG 1500 
1 0 ACCAAGCTGC TGGG ATTGGG A AGGAGTTTC ACOCTGACCA TTGCCCTAGC CAGGTTCCCA 1560 
GGAGGCCTCA CCATACTCCC TTTCAGGGCC AGGGCTCCAG CAAGCXTCAGG GCAAGGATCC 1620 
TGTGCTGCTG TCTGGTTGAG AGCCTGCCAC CGTGTGTCGG GAGTGTGGGC CAGGCTGAGT 1680 
GCATAGGTGA CAGGGCCGTG AGCATGGGCC TGGGTGTGTG TGAGCTCAGG CCTAGGTGCG 1740 
CAGTGTGGAG ACGGGTGTTG TCGGGGAAGA GGTGTGGCTT CAAAGTGTGT GTGTGCAGGG 1800 
1 J GGTGGGTGTG TTAGCGTGGG TTAGGGGAAC GTGTGTGCGC GTGCTGGTGG GCATGTGAG A 1 860 
TGAGTG ACTG CCGGTGAATG TGTCCACAGT TGAGACGTTG GAGCAGGATG AGGGAATCCT 1920 
GTC ACC ATCA ATA ATCACTT GTGG AGCGCC AGCTCTGCCC A AGGCGCCAC CTGGGCGGAC 1 980 
AGCCAGGAGC TCTCC ATGGC CAGGCTGCCT GTGTGCATGT TCCCTGTrCTG GTGCCCCTTT 2040 
GCCCGCCTCC TGCA A ACCTC ACAGGGTCCC CACACAACAG TGCCCTCCAG AAGCAGCCCC 2100 
20 TCGGAGGCAG AGGAAGGAAA ATGGGGATGG CTGGGGCrCT CTCCATCCTC CmTCTCCT 2160 
TGCCTTCGCA TGGCTGGCCT TCCCCTCCA A A AOCTCCATT CCCCTGCTGC CAGCCCCTTT 2220 
GCCATAGCCT GATTTTGGGG AGGAGGAAGG GGCGATTTGA GGGAGAAGGG G AGAAAGCTT 2280 
ATGGCTGGGT CTGGTTTCTT CCCTTCCCAG AGGGTCTTAC TGTTCCAGGG TGGCCCCAGG 2340 
GCAGGCAGGG GCCAC ACTAT GCCTGCGCCC TGGTAAAGGT GACCCCTGCC ATTTACCAGC 2400 
ZO AGCCCTGGCA TGTTCCTGCC CCACAGGAAT AGAATGGAGG GAGCTCCAGA AACITTCCAT 2460 
CCCAAAGGCA GTCTCCGTGG TTGAAGCAGA CTGGATTTTT GCTCTGCCCC TG ACCCCTTG 2520 
TCCCTCTTTG AGGGAGGGGA GCTATGCTAG GACTCCAACC TCAGGGACTC GGGTGGCCTG 2580 
CGCTAGCTTC TTTTG ATACT GAAAACmT AAGGTGGGAG GGTGGCAAGG GATGTGCTTA 2640 
ATAAATCAAT TCCAAGCCTC AA AAAAAAAA AAAAAAAAAA AA 



30 



40 



Seq ID NO: 50 Protein sequence: 
Protein Accession #: NP_219487. 1 



^_ 1 11 21 31 41 51 

35 I I I I I I 

MKDVGPESCG QPTPCWPSPA LESVLGKASQ HLGLESGQPL YLLELNWGGT ECALSSTGRT 60 
AACFLPISLL PTSPAAWLGP EALCLPGRPG TTGLRDTGGP LLLPPPTLLQ DTTRWCWMLV 120 
LWP AKVHGDS PHGILRDQA A GIGKEFHPDH CPSQVPRRPH HTPFQGQGSS KPRARILCCC 1 80 
LVESLPPCVG SVGQAECIGD RAVSMGLGVC ELRPRCAVWR RVLSGKRCGF KVCVCRGWVC 



Seq ID NO: 51 DNA sequence 
Nucleic Acid Accession XM 059098.1 



Coding sequence: 1 78-6 1 8(underlincd sequences oorrcspond to start and stop codons) 

1 11 21 31 41 51 

t I t I 1 I 

GATGTACACT CTG AAGTGAG CACATTCCTG TTGGCAGGAC ATGACACCTT GGCAGCAAGC 60 
ATCTCCTGGA TCCTTTACTG CCTGGCTCTG AACCCTGAGC ATCAAGAGAG ATGCCGGGAG 120 

jU GAGGTCAGGG GCATCCTGGG GGATGGGTCT TCTATCACTT GGGACCAGCT GGGTGAGATG 1 80 
TCGTACACCA CAATGTGCAT CAAGGAGACG TGOCGATTGA TTCCTGCAGT CCCGTCCATT 240 
TCCAGAGATC TCAGCAAGCC ACTTACCTTC CCAGATGGAT GCACATTGCC TGCAGGGATC 300 
ACCGTGGTTC TTAGTATTTG GGGTCTTCAC CACAACXTCTG CTGTCTGGAA AA ACCCAAAG 360 
GTCTTTG ACC CCTTG AGGTT CTCTCAGGAG AATTCTGATC AGAG ACACCC CTATGCCTAC 420 

DD TTACCATTCr CAGCTGGATC AAGGAACTGC ATTGGGCAGG AGTTTGCCAT GATTGAGTTA 480 
AAG GTAAC CA TTGCCTTGAT TCTGCTCCAC TTCAGAGTGA CTCCAGACCC CACCAGGCCT 540 
CTTACTTTCC CCAACCATTT TATCCTCAAG CCCAAGAATG GG ATGTATrT GCACCTGAAG 600 
AAACTCTCTG AATGTIAGAT CTCAGGGTAC AATGATTAAA CGTACTTTGT TTTTCGAAGT 660 
TAAATTTACA GCTA ATGATC CAAGCAGATA GAAAGGGATC AATGTATGGT GGGAGGATTG 720 

OU GAGGTTGGTG GGATAGGGGT CTCTGTGAAG AGATCCAAAA TCATTTCTAG GTACACAGTG 780 
TGTCAGCTAG ATCTGTTTCT ATATAACTTT GGGAGATTTT CAGATCTTTT CTGTTAAACT 840 
TTCAC TACTA TTAATGCTGT ATACACCAAT AGACTTTCAT ATATTTTCTG TTGTTTTTAA 900 
AATAGTTTTC AGAATTATGC A AGTAATAAG TGCATGTATG CTCACTGTCA AAAATTCCCA 960 
ACACTAGAAA ATCATGTAGA ATAAAAATTT TAAATCTCAC TTCACTTAGC CGACATTCCA 1020 

65 TGC CCTGA CC AATCCTACTG CmTCCTAA AAACAGAATA ATTTGGTGTG CATTCTTTCA 1080 
GACTl Mice TATACATTTT ATATGTAGAA ATGTAGCAAT GTATTTGTAT AGATGTGATC 1 140 
ATTCCTATAT TGTTATTGAT TmTTCACT TAATAAAAAT TCACCTTATT CCTT 

Seq ID NO: 52 Prpteln sequence: 
70 Protein Accession #; XP_059098.1 

1 11 21 31 41 51 

111)11 

MSYTTMCnCE TCRLDPA VPS ISRDLSKPLT FPDGCTLPAG ITV VLSIWG L HHNPAVWKNP 60 
75 KVFDPLRFSQENSDQRHPYAYLPFSAGSRNaGQEFAMIELKVTlAULLHFRVTPDPTR 120 
PLTFPNHFIL KPKNGMYLHL KKLSEC 
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Seq ID NO: S3 DNA sequence 

Nucleic Add Accession #: NM_03091 6 

Coding sequence: 1-1533 (underlined sequences correspond to start and stop codons) 

t II 21 31 41 51 

111)11 

ATGCCrCTGT CCXrTGGGAGC CGAGATGTGG GGGCCTG AGG CCTGGCTGCT GCTGCTGCTA 60 
CTGCTGGCAT CATTTACAGG CCGGTGCCCC GCGGGTGAGC TGGAGACCTC AGACGTGGTA 120 
ACTGTGGTGC TGGGCCAGGA CGCAAAACTG CCCTGCTTCT ACCGAGGGGA CTCCGGCGAG 180 
CAAGTGGGGC AAGTGGCATG GGCTCGGGTG GACGCGGGCG AAGGCGCCCA GGAACTAGCG 240 
CTACTGCACT CCAAATACGG GCTTCATGTG AGCCCGGCTT ACGAGGGCCG CGTGGAGCAG 300 
CCGCCGCCCC CACGCAACCC CCTGGACGGC TCAGTGCTCC TGCGCAACGC AGTGCAGGCG 360 
GATGAGGGCG AGTACGAGTG CCGGGTCAGC ACCTTCCCCG CCGGCAGCTT OCAGGCGCGG 420 
CTGCGGCrCC GAGTGCTGGT GCXnXXCCTG CCCTCACTGA ATCCTGGTCC AGCACTAGAA 480 
G AGGGCCAGG GCCTGACCCT GGCAGCCTCC TGCACAGCTG AGGGCAGCXTC AGCCCCX:aGC 540 
GTGACCTGGG ACACGGAGGT CAAAGGCACA ACGTCCAGCC GTTCCTTCAA GCACTCCCGC 600 
TCTGCTGCCG TCACCTCAGA GTrCCACTTG GTGCCTAGCC GCAGCATGAA TGGGCAGCCA 660 
CTGACTTGTG TGGTGTCCCA TCCTGGCCTG CTCCAGGACC AAAGGATCAC CCACATCCTC 720 
CACGTGTCCT TCCTTGCTGA GGCCTCTGTG AGGGGCCTTG AAGACCAAAA TCTGTGGCAC 780 
ATTGGCAG AG AAGGAGCTAT GCTCA AGTGC CTG AGTG A AG GGCAGCCCCC TCCCTCATAC 840 
AACTGGACAC GGCTGGATGG GCCTCTGCCC AGTGGGGTAC GAGTGGATGG GGACACTTTG 900 
GGCTTTCCCC CACTGACCAC TGAGCACAGC GGCATCTACG TCTGCCATGT CAGCAATGAG 960 
TTCTCCTCAA GGGATTCTCA GGTCACTGTG GATGTTCTTG ACCCCCAGGA AGACTCTGGG 1020 
AAGCAGGTGG ACCTAGTGTC AGCCTCGGTG GTGGTGGTGG GTGTGATCGC CGCACTCTTG 1080 
TTCTGCX:TTC TGGTGGTGGT GGTGGTGCTC ATGTCCCGAT ACCATCXjGCG CAAGGCOCAG 1 140 
CAGATGACCC AGAAATATGA GGAGGAGCTG ACCCTGACCA GGGAGAACTC CATCCGGAGG 1200 
CTGCATTCCC ATCACACGGA CCCCAGGAGC CAGCCGGAGG AGAGTGTAGG GCTGAGAGCC 1260 
GAGGGCCACC CTGATAGTCT CAAGGACAAC AGTAGCTGCT CTGTGATGAG TGAAGAGCCC 1320 
GAGGGCCGCA GTTACTCCAC GCTGACCACG GTGAGGGAGA TAGAAACACA GACTGAACTG 1380 
CTGTCTCCAG GCTCTGGGCG GGCCGAGGAG GAGGAAG ATC AGGATGAAGG CATCAAACAG 1440 
GCCATGAACC ATTTTGTTCA GGAGAATGGG ACCCTACGGG CCAAGCCCAC GGGCAATGGC 1500 
ATCTACATCA ATGGGCGGGG ACACCTGGTC TGA 

Seq ID NO: 54 Protein sequence: 
Protein Accession NP_1 12t 78.1 

1 11 21 31 41 51 

I I I I I i 

MPLSLGAEMW GPEAWLLLLL LLASFTGRCP AGELETSDVV TVVLGQDAKL PCFYRGDSGE 60 
QVGQVAWAR V DAGEGAQELA LLHSKYGLHV SPAYEGRVEQ PPPPRNPLDG SVLLRNAVQA 1 20 
DEG EYECRVS TFPAGSFQAR LRLRVLVPPL PSLNPGPALE EGQGLTLA AS CTAEGSPAPS 1 80 
VTWDTEVKGT TSSRSFKHSR SAAVTSEFHL VPSRSMNGQP LTCWSHPGL LQDQRITHIL 240 
HVSFLAEASV RGLEDQNLWH IGREGAMLKC LSEGQPPPSY NWTRLDGPLP SGVRVDGDTL 300 
GFPPLTTEHS GIYVCHVSNE FSSRDSQVTV DVLDPQEDSG KQVDLVSASV VWGVIAALL 360 
FCLLVWVVL MSRYHRRKAQ QMTQKYEEEL TLTRENSIRR LHSHHTDPRS QPEESVGLRA 420 
EGHPDSLKDN SSCSVMSEEP EGRSYSTLTT VREIETQTEL LSPGSGRAEE EEDQDEGIKQ 480 
AMNHFVQENG TLRAKPTGNG lYINGRGHLV 

Seq ID NO: 55 DNA sequence 

Nucleic Acid Accession #: AFOOT 1 70. 1 

Coding sequence: 73-1725 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

AAGGAGGCGG CCTCCGGGAA AAGCGACCGC AGGACTCCTG AGAGCAGCCT CCATGAGGCC 60 
CTGGACCAGT GCATGACCGC CCTGGACCTC TTCCTCACCA ACCAGTTCTC AGAAGCi^CTC 120 
AGCTACCTCA AGCCCAGA AC CAAGGA AAGC ATGTACCACT CACTGACATA TGCCACCATC 1 80 
CTGGAGATGC AGGCCATGAT GACCTTTGAC CCTCAGGACA TCCTGCTTGC CGGCAACATG 240 
ATG AAGGAGG CACAGATGCT GTGTCAG AGG CACCGGAGGA AGTCTTCTGT AACAGATTCC 300 
TTCAGCAGCC TGGTGAACCG CCCCACGCTG GGCCAATTCA CTGAAGAAGA AATCCACGCT 360 
G AGGTCTGCT ATGCAGAGTG CCTGCTGCAG CGAGCAGCCC TGACCTTCCT GCAGGACGAG 420 
AACATGGTGA GCTTCATCAA AGGCGGCATC AAAGTTCGAA ACAGCTACCA GACCTACAAG 480 
GAGCTGGACA GCCTTGTTCA GTCCTCACAA TACTGCAAGG GTGAGAACCA CCCGCACTTT 540 
GAAGG AGGAG TGAAGCTTGG TGTAGGGGCC TTCAACCTGA CACTGTCCAT GCTTCCTACT 600 
AGGATCCTGA GGCTCjTTGGA GTTTGTGGGG TTTTCAGGAA ACAAGGACTA TGGGCTGCTG 660 
CAGCTGGAGG AGGGAGCGTC AGGGCACAGC TTCCGCTCTG TGCTCTGTGT CATGCTCCTG 720 
CTGTGCTACC ACACCTTCCT CACCTTCGTG CTCGGTACTG GGAACGTCAA CATCGAGGAG 780 
GCCGAGAAGC TCTTGAAGCC CTACCTG A AC CGGTACCCTA AGGGT<X:CAT CTTCCTGTTC 840 
TTTGCAGGGA GGATTGAAGT CATTAAAGGC AACATTGATG CAGCCATCCG GCGTTTCGAG 900 
GAGTGCTGTG AGGCCCAGCA GCACTGGAAG CAGTTTCACC ACATGTGCTA CTGGGAGCTG 960 
ATGTGGTGCT TCACCTACAA GG(jCCAGTGG AAGATC3TCCT ACTTCTACGC CGACCTGCTC 1 020 
AGCAAGGAGA ACTGCTGGTC CAAGGCCACC TACATTTACA TGAAGGCCGC CTACCTCAGC 1080 
ATGTTTGGGA AGGAGGACCA CAAGCCGTTC GGGGACGACG AAGTGGAATT ATTTCXjAGCT 1 140 
GTGCCAGGCC TGAA<5CTCAA GATTGCTGGG AAATCTCTAC CCACAGAGAA GTTTGCCATC 1200 
CGGAAGTCCC (KKXJCTACTr CTCCTCCAAC CCTATCTCGC TGCCAGTGCC TGCTCTGGAA 1260 
ATGATGTACA TCTGGAACGG CTACGCCGTG ATTGGGAAGC AGCCGAAACT CACGGATGGG 1320 
ATACTTGAGA TTATCACTAA GGCTGAAGAG ATGCTGGAGA AAGGCCCAGA GAACGAGTAC 1380 
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TCAGTGGATG ACGAGTGCTT GGTGAA ATTG TTGAAAGGCC TGTGTCTGAA ATACCTGGGC 1 440 
CGTGTCCAGG AGGCCGAGGA GAATTTTAGG AGCATCTCTG CCAATGAAAA GAAG ATTAA A 1 500 
TATGACCACT ACTTGATCCC AAACGCCCTG CTGGAGCTGG CCCTGCTGCT TATGG AGCAA 1 560 
GACAGAAACG AAGAGGCCAT CAAACTTTTG GAATCTGCCA AGCAAAACTA CAAGAATTAC 1620 
TCCATGGAGT CAAGG ACACA CTTTCGAATC CAGGCAGCCA CACTCCAAGC CAAGTCTTCC 1 680 
CTAGAGAACA GCAGCAG ATC CATGGTCTCA TCAGTGTCCT T GTAGC TTTG TGCAGCAGTT 1 740 
CCGGGCTGGA AGACAG AGAC AGCTGG ACAG AGCTCCTG AA AACATTTCA A A ATACCCCCT 1 800 
CCCCCTGCCC TGCCCTGCCT TTGGGGTCCA CCGGCACTCC AGTTGGATGG CACAACATAG 1 860 
TGTATCCGTG CAGAAGCCGA GCTGGCATTT TCACCAGTGT AGCCAAGGGC CTTTGCCAAG 1 920 
GGCAGAGCAG GTGGAGCCCT CTGCCTGCCC TATCACACAT ACGGGTACTT GCTTTTCACT 1980 
GTGATGTTTA AGAGAATGTA TGAACAGTTT ACATTTTCCT TAGAAATACA TTG ATGGGAT 2040 
CACAGTTGGC TTTAAAAACC AACA ACAATC AACCACCTGT AAGTCnTGT CTTCACCTAT 2100 
TATCATCTGG AGGTAAATCT CTTT AT ATG A TG ATGCCAAA GGGCAAATTG CTTTTCAAAT 2 1 60 
TCAGCAAGTTCTCAGCTTGT GTGACGGAAG GTCCTTCAGA GGACCTGAGG AATGOCTGGG 2220 
AGAGGCTAAG CCTCAGGCTT CAATGCTTCT GGGGTTGGGC ATGAGGATGT ACACAGACAC 2280 
CCACTACCTT ACTACTCACA CTTCATTTCA CTCCTTTTGT AAATTTCCAA TTTAA AAATC 2340 
AAGCACGTCTTTTTAGTGAG ATAAAATCTG AGCTCTTCTG TAGAAAAATC AATCTCTACC 2400 
AGTAGAAAAT GCCAGGGCTT GATGG AAGAG CTGTGTAGCC CmCTATGC CAAAGCCAGG 2460 
AAATTTGGGG GGCAGGAGGA GGTTCTCAGA ATCCAGTCTG TATCTTTGCT GTATGCCAAA 2520 
CTGAAACCAC TGGGAATAAT TTATGAAACA TAAAAATCTT CTGTACTTCA CTCCAAGGTA 2580 
CATTTGCTTA CTGACAGCAT TTTTGTTAAA ACTGTTATTC TTGAAAAAAA AAAAAAAAAA 2640 
AA 

Sea ID NO: 56 Protein seouence: 
Protein Accession ,AAC39582.1 

1 11 21 31 41 51 

t t ! I I I 

MTALDLFLTN QFSEALSYLK PRTKESMYHS LTYATILEMQ AMMTFDPQDI LLAGNMMKEA 60 
QMLCQRHRRK SSVTDSFSSL VNRPTLGQFT EEeHAEVCY AECLLQRAAL TFLQDENM VS 1 20 
FDCGGDCVRN SYQTYKELDS LVQSSQYCKG ENHPHFEGGV KLGVGAFNLT LSMLPTRILR 180 
LLEFVGFSGNKDYGLLQLEEGASGHSFRSVU:VMUiCYHmTFVlXnX5hrWIEEA^^ 240 
LKPYLNRYPK GAIFLFFAGR lEVIKGNIDA AIRRFEECCE AQQHWKQFHH MCYWELMWCF 300 
TYKGQWKMSY FYADLLSKEN CWSKATYI YM KAAYLSMFGK EDHKPFGDDE VELFRAVPGL 360 
KLKIAGKSLP TEKFAIRKSR RYFSSNPISL PVPALEMMYI WNGYAVIGKQ PKLTDGILEl 420 
mCAEEMLEK GPENEYSVDD ECLVKLLKGL CLKYLGRVQE AEENFRSISA NEKKIKYDHY 480 
UPNALLELA LXLMEQDRNE EADCLLESAK QNYKNYSNfES RTHFRIQAAT LQAKSSLENS 540 
SRSMVSSVSL 

Seq ID NO: 57 DNA sequence 

Nucleic Acid Accession #: NM_006670. 1 

Coding sequence: N927 (undeilined sequences coirespond to start and stop oodons) 

I 11 21 31 41 51 

I I I I I I 

ATGCCTGGGG GGTGCTCCCG GGGCCCCGCC GCCGGGGACG GGCGTCTGCG GCTGGCGCGA 60 
CTAGCGCTGGTACTCCTGGGCTGGGTCTCCTCGTCTTCTCCCACXnx:CTCGG^ 120 
TTCTCCTCCT CGGCGCCGTT CCTGGCTTCC GCCGTGTCCG CCCAGCCCCC GCTGCCGGAC 1 80 
CAGTGCCCCG CGCTGTGCGA GTGCTCCGAG GCAGCGCGCA CAGTCAAGTG CGTTAACCGC 240 
AATCTGACCG AGGTGCCCAC GGACCTGCCC GCCTACGTGC GCAACXTTCTT CCTTACCGGC 300 
AACCAGCTGG CCAGCAACC A CTTCCTTTAC CTGCCGCGGG ATGTGCTGGC CCA ACTGCCC 3 60 
AGCCTCAGGC ACCTGGACTT AAGTA ATAAT TCGCTGGTGA GCCTGACCTA CGTGTCCTTC 420 
CGCAACCTGA CACATCTAGA AAGCCTCCAC CTGGAGGACA ATGCCCTCAA GGTCCTTCAC 480 
AATGGCACCC TGGCTG AGTT GCA AGGTCTA CCCCACATTA GGGTTTTCCT GGACAACAAT 540 
CCCTGGGTCT GCGACTGCCA CATGGCAGAC ATGGTGACCT GGCTCAAGGA AACAG AGGTA 600 
GTGCAGGGCA AAGACCGGCT CACCTGTGCA TATCCGGAAA AAATGAGGAA TCGGGTCCTC 660 
TTGGAACTCA ACAGTGCTG A CCTGG ACTGT G ACCCGATTC TTCCCCCATC CCTGCAAACC 720 
TCTTATGTCT TCCTGGGTAT TGTTTTAGCC CTGATAGGCG CTATTTTCCT CCTGGTTTTG 780 
TATTTGAACC GCAAGGGGAT AAA AAAGTGG ATGCATAACA TCAGAGATGC CTGCAGGGAT 840 
CACATGGAAG GGTATCATTA CAGATATGAA ATCAATGCGG ACCCCAGATT AACAAACCTC 900 
AGTTCTAACT CGGATGTCCT CGAGTGA 

Seq ID NO: 58 Protein seouence 
Protein Accession U: NP_P06661 . 1 



1 11 21 31 41 51 

11)111 

MPGGCSRGPA AGDGRLRLAR LALVLLGWVS SSSPTSSASS FSSSAPFLAS AVSAQPPLPD 60 
QCPALCECSE AARTVKCVNR NLTEVPTDLP AYVRNLFLTG NQLASNHFLY LPRDVLAQLP 120 
SLRHLDLSNN SLVSLTYVSF RNLTHLESLH LEDNALKVLH NGTLAELQGL PHIRVFLDNN 1 80 
PWVCDCHM AD MVTW LKETEV VQGKDRLTCA YPEKMRNRVL LELNSADLDC DPILPPSLQT 240 
SYVFLGIVLA LIG AIFLLVL YLNRKGOCKW MHNIRDACRD HMEGYHYRYE INADPRLTNL 300 
SSNSDVLE 

Seq ID NO: 59 DNA sequence 

Nudeic Acid Accession U: NM_024022 
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Coding sequence: l-1362(iin(lerlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

ATGGGGGAAA ATG ATCCGCC TGCTGTTGAA GCCCCCTTCT CATTCCGATC GCn iTTGGC 60 
CTTGATGATT TG AA A ATAAG TCCTGTTGCA CCAGATGCAG ATGCTGTTGC TGCACAGATC 120 
CTGTCACTGC TGCCATTG AA GlTl'l TICCA ATCATCGTC A TTGGG ATC AT TGCATTG ATA 1 80 
TTAGCACTGG CCATTGGTCT GGGCATCCAC TTCG ACTGCT CAGGG AAGTA CAGATGTCGC 240 
TCATCCTTTA AGTGTATCGA GCTGATAGCT CGATGTGACG GAGTCTCGGA TTGCAAAGAC 300 
GGGGAGGACG AGTACCGCTG TGTCCGGGTG GGTGGTCAGA ATGCCGTGCT CCAGGTGTTC 360 
ACAGCTGCTT CGTGG AAG AC CATGTGCTCC GATGACTGGA AGGGTCACTA CGCAAATGTT 420 
GCCTGTGCCC AACTGGGTTT CCCAAGCTAT GTGAGTTCAG ATAACCTCAG AGTG AGCTCG 480 
CTGGAGGGGC AGTTCCGGGA GGAGTTTGTG TCCATCGATC ACCTCTTGCC AGATG ACAAG 540 
GTG ACTGCAT TACACCACTC AGTATATGTG AGGG AGGGAT GTGCCTCTGG CCACGTGGTT 600 
ACCTTGCAGT GCACAGCCTG TGGTCATAGA AGGGGCTACA GCTCACGCAT CGTGGGTGGA 660 
AACATGTCCT TGCTCTCGCA GTGGCCCTGG CAGGCCAGCC TTCAGTTCCA GGGCTACCAC 720 
CTGTGCGGGG GCTCTGTCAT CACGCCCCTG TGGATCATCA CTGCTGCACA CTGTG1TTAT 780 
GACTTGTACC TCCCCA AGTC ATGGACCATC CAGGTGGGTC TAGTTTCCCT GTTGG ACAAT 840 
CCAGCCCCAT CCCACTTGGT GGAG AAGATT GTCTACC ACA GCA AGTAC AA GCCAAAGAGG 900 
CTGGGCAATG ACATCGCCCT TATGAAGCTG GCCGGGCCAC TCACGTTCAA TG AAATGATC 960 
CAGCCTGTGT GCCTGCCCAA CTCTGAAGAG AACTTCCCCG ATGGAAAAGT GTGCTGGACG 1020 
TCAGGATGGG GGGCCACAGA GGATGGAGGT GACGCCTCCC CTGTCCTGAA CCACGCGGCC 1080 
GTCCCTTTG A TTTCCAACAA GATCTGCAAC CACAGGGACG TGTACGGTGG CATCATCTCC 1 140 
CCCTCCATGC TCTGCGCGGG CTACCTGACG GGTGGCGTGG ACAGCTGCCA GGGGGACAGC 1200 
GGGGGGCCCC TGGTGTGTCA AGAGAGGAGG CTGTGGAAGT TAGTGGGAGC GACCAGCTTT 1260 
GGCATCGGCT GCGCAGAGGT GAACAAGCCT GGGGTGTACA CCCGTGTCAC CTCCTTCCTG 1320 
GACTGGATCC ACG AGCAGAT GGAGAGAGAC CTAAAAAC CT GA 

Seq ID NO: 60 Protein sequence 
Protein Accession U: NP_076927 

I II 21 31 41 51 

t I I I I i 

MGENDPPAVE APFSFRSLFG LDDLKISPVA PDADAVAAQI LSLLPLKFFP IIVIGIIALI 60 
LALAIGLGIH FDCSGKYRCR SSFKOEUA RCDGVSDCKD GEDEYRCVRV GGQNAVLQVF 120 
TAASWKTMCS DDWKGHYANV ACAQLGFPSY VSSDNLRVSS LEGQFREEFV SIDHLLPDDK 1 80 
VTALHHSVY V REGCASGHVV TLQCTACGHR RGYSSRIVGG NMSLLSQWPW QASLQFQGYH 240 
LCGGSVITPL WIITAAHCVY DLYLPKSWTI QVGLVSLLDN PAPSHLVEKI VYHSKYKPKR 300 
LGNDIALMKL AGPLTFNEMI QPVCLPNSEE NFPDGKVCWT SGWGATEDGG DASPVLNHAA 360 
VPUSNKICN HRDVYGGIIS PSMLCAGYLT GGVDSCQGDS GGPLVCQERR LWKLVGATSF 420 
GIGCAEVNKP G VYTRVTSFL DWIHEQMERD LKT 

Seq ID NO: 61 DNA sequence 

Nucleic Acid Accessbn H: NM_006475 

Coding sequence: 28'2538 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

AACAGAACTG CAACGGAGAG ACTCAAGATQ ATTCCCTTTT TACCCATGTT TTCTCTACTA 60 
TTGCTGCTTA TTGTTA ACCC TATAAACGCC AACA ATCATT ATGACAAGAT CTT GGCT CAT 120 
AGTCGTATCA GGGGTCGGG A CCAAGGCCCA AATGTCTGTG CCCTTC AACA G ATTTTGGGC 1 80 
ACCAAAAAGA AATACTTCAG CACTTGTAAG AACTGGTATA AAAAGTCCAT CTGTGGACAG 240 
AAAACGACTG TTTTATATGA ATGTTGCCCT GGTTATATGA GAATGGAAGG AATGAAAGGC 300 
TGCCCAGCAG TTTTGCCCAT TGACCATGTT TATGGCACTC TGGGC ATCGT GGGAGCCACC 360 
ACAACGCAGC GCTATTCTGA CGCCTCAAAA CTGAGGGAGG AG ATCGAGGG AAAGGGATCC 420 
TTCACTTACT TTGCACCGAG TAATGAGGCT TGGGACAACT TGGATTCTGA TATCCGTAGA 480 
GGTTTGGAGA GCAACGTGAA TGTTGAATTA CTGAATGCTT TACATAGTCA CATGATTAAT 540 
AAGAGAATGT TG ACCAAGGA CTTAAAAAAT GGCATGATTA TTCCTTCAAT GTATAACAAT 600 
TTGGGGCTTT TCATTAACCA TTATCCTAAT GGGGTTGTCA CTGTTAATTG TGCTCGAATC 660 
ATCCATGGGA ACCAGATTGC AACAAATGGT GTTGTCCATG TCATTGACCG TGTGCTTACA 720 
CAA ATTGGTA CCTCA ATTCA AGACTTCATT GAAGCAGAAG ATGACCTTTC ATCTTTTAGA 780 
GCAGCrcCCA TCACATCGGA CATATTGGAG GCCCTTGGAA GAGACGGTCA CTTCACACTC 840 
TTTGCTCCCA CCAATGAGGC TTTTGAGAAA CTFCCACG AG GTGTCCTAGA AAGGTTCATG 900 
GGAGACAAAG TGGCTTCCGA AGCTCTTATG AAGTACCACA TCTTAA ATAC TCTCCAGTGT 960 
TCTGAGTCTA TTATGGGAGG AGCAGTCnT GAGACGCTGG AAGGAAATAC AATTGAGATA 1020 
GG ATGTG ACG GTGACAGTAT AACAGTAAAT GGAATCAAAA TGGTGAACAA AAAGGATATT 1080 
GTG ACAAAT A ATGGTGTG AT CCATTTGATT G ATC AGGTCC TAATTCCTG A TTCTGCCAAA 1 1 40 
CAAGTTATTG AGCTGGCTGG AAAACAGCAA ACCACCTTCA CGGATCTTGT GGCCCAATTA 1200 
GGCTTGGC AT CTGCTCTGAG GCCAGATGGA GAATACACTT TGCTGGCACC TGTGAATAAT 1260 
GCATTTTCTG ATG AT ACTCT C AGCATGGTT CAGCGCCTCC TTAA ATTAAT TCTGCAG AAT 1 320 
CACATATTGA AAGTAAAAGT TGGCCTTAAT GAGCTTTACA ACGGGCAAAT ACTGGAAACC 1380 
ATCGGAGGCA AACAGCTCAG AGTCTTCGTA TATCGTACAG CTGTCTGCAT TGAAAATTCA 1440 
TGCATGGAG A AAGGGAGTAA GCAAGGGAGA AACGGTGCGA TTCACATATT CCGCGAGATC 1500 
ATCAAGCCAG CAGAGAAATC CCTCCATGAA AAGTTAAAAC AAGATAAGCG CTTTAGCACC 1560 
TTCCTCAGCC TACTTGAAGC TGCAGACTTG AAAGAGCTCC TGACACAACC TGGAGACTGG 1620 
ACATTATTTG TGCCAACCAA TCATGCUTl AAGGGAATGA CTAGTGAAGA AAAAGAAATT 1680 
CTGATACGGG ACAAAAATGC TCTTCAAAAC ATCATTCTTT ATCACCTGAC ACCAGGAGTT 1740 
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TTCATTGG AA AAGGATTTGA ACCTGGTGTT ACTAACATTT TAAAGACCAC ACAAGGA AGC 1 800 
AAAATCTTTCTGAAAGAAGT AAATGATACA CTTCTGGTGA ATGAATTGAA ATCAAAAGAA I860 
TCTGACATCA TGACAACAAA TGGTGTAATT CATCTTGTAG ATAAACTCCT CTATCCAGCA 1920 
GACACACCTGTTGGAAATGA TCAACTGCTG GAAATACTTA ATAAATTAAT CAAATACATC 1980 
CAAATTAAGTTTGTTCGTGG TAGCACCTTC AAAGAAATCC CCGTGACTGT CTATACAACT 2040 
AAAATTATAA CCAAAGTTGT GGAACCAAAA ATTAAAGTGA TTGAAGGCAG TCTTCAGCCT 2100 
ATTATCAAAA CTGAAGGACC CACACTAACA AAAGTCAAAA TTGAAGGTGA ACCTGAATTC 21 60 
AGACTGATTA AAGAAGGTGA AACAATAACT G AAGTGATCC ATGGAGAGCC AATTATTAAA 2220 
AAATACACCA AAATCATTGA TGGAGTGCCT GTGG AAATAA CTG AAAAAGA GACACGAGAA 2280 
GAACG AATCA TTACAGGTCC TGAAATAAAA TACACTAGGA TTTCTACTGG AGGTGGAGAA 2340 
ACAGAAGAAA CTCTGAAGAA ATTGTTACAA GAAGAGGTCA CCAAGGTCAC CAAATTCATT 2400 
GAAGGTGGTG ATGGTCATTT ATTTGAAGAT GAAGAAATTA AA AGACTGCT TCAGGGAGAC 2460 
ACACCCGTGA GGAAGTTGCA AGCCAACAAA AAAGTTCAAG GTTCTAGAAG ACGATTAAGG 2520 
GAAGGTCGTT CTCAGTGAAA ATCCAAA AAC CAGAAAAAAA TGTTTATACA ACCCTA AGTC 2580 
AATAACCTGACCrTAGAAAATTGTGAGAGCCAAGTTGACTTCAGGAACTG AAACATCAGC 2640 
ACAAAGAAGCAATCATCAAATAATTCTGAA CACAAATTTA AT A TTTI H T TTCTGAATGA 2700 
GAAACATGAG GGAAATTGTG GAGTTAGCCT CCTGTGGTAA AGGAATTGAA GAAAATATAA 2760 
CACCTTACAC CCTTTTTCAT CTTGACATTA AAAGTTCTGG CTAACTTTGG AATCCATTAG 2820 
AGAAAAATCCTTGTCACCAG ATTCATTACA ATTCAAATCG AAGAGTTGTG AACTGTTATC 2880 
CCATTG AAAA GACCG AGCCT TGTATGTATG TTATGGATAC ATAAAATGCA CGCAAGCCAT 2940 
TATCTCTCCA TGGGAAGCTA AGITATA AAA ATAGGTGCTT GGTGTACAAA ACTTTTTATA 3000 
TCAAAAGGCT TTGCACATTT CTATATGAGT GGGTTTACTG GTAA ATTATG TTATTTTTTA 3060 
CAACTAATTTTGTACTCTCA GAATGTTTGT CATATGCTTC TTGCAATGCA TATT'l' 1 IT AA 3120 
TCrCAAACGT TTCAATAAAA CCATTITICA GATATAAAGA GAATTACTTC AAATTGAGTA 31 80 
ATTCAGAAAA ACTCAAGATT TAAGTTA AAA AGTGGTTTGG ACTTGGGAA 

Scq ID NO: 62 faptein syqwpce; 
Protein Accession #: NP_006466 

1 11 21 31 41 51 

I } I 1 I I 

MIPFLPMFSL LLLUVNPIN ANNHYDKILA HSRIRGRDQG PNVCALQQIL GTKKKYFSTC 60 
KNWYKKSICG QKTTVLYECC PGYMRMEGMK GCPAVLPIDH VYGTLGIVGA TTTQRVSDAS 120 
KLREEIEGKG SFTYFAPSNE AWDNLDSDIR RGLESNVNVE LLNALHSHMI NKRMLTKDUC 1 80 
NGMIIPSMYN NLG LFINHYP NG WTVNCAR IIHGNQIATN G WHVIDRVL TQIGTSIQDF 240 
lEAEDDLSSF RAAAITSDIL EALGRDGHFT LFAPTNEAFE KLPRGVLERF MGDKVASEAL 300 
MKYHILNTLQ CSESSMGCAV FETLEGNTIE IGCDGDSITV NGIKMVNKKD IVTNNG VIHL 360 
IDQVUPDSA KQVIELAGKQ QTTFTDLVAQ LGLASALRPD GEYTLLAPVN NAFSDDTLSM 420 
VQRLLKLILQ NHILKVKVGL NELYNGQILE TIGGKQLRVF VYRTAVQEN SCMEKGSKQG 480 
RNGAIHIFRE IIKPAEKSLH EKLKQDKRFS TFLSLLEAAD LKELLTQPGD WTLFVPTNDA 540 
FKGMTSEEKE ILIRDKNALQ NIlLYHLTPG VFIGKGFEPG VTNILKTrQG SKIFLKEVND 600 
TLLVNELKSKESDIMTTNGVIHWDFaLVPADTPVGNDQLl£ILNKlJKYIQIKFVRGST 660 
FKEIPVTVYT TKIITKVVEP KKVIEGSLQ PIDCTEGPTL TKVKIEGEPE FRUKEGETI 720 
TEVIHGEPU KKYTKUDGV PVEITEKETR EERUTGPEI KYTRISTGGG ETEETLKKLL 780 
QEEVTKVTKF lEGGDGHLFE DEEKRLLQG DTPVRKLQAN KKVQGSRRRL REGRSQ 

Seq ID NO: 63 DNA sequence 

Nucleic Acid Accession NM_020974 

Coding sequence: 81-3080 (underlined sequences conespond to start and stop codons) 

1 It 21 31 41 51 

I I I I I I 

GGCGTCCGCG CACACCTCXrC CGCGCCGCCG CCGCCACCGC CCGCACTOCG CCGCCTCTGC 60 
CCGCAACCGC TGAGCCATCC ATGGGGGTCG CGGGCCGCAA CCGTCCCGGG GCGGCCTGGG 120 
CGGTGCTGCr GCTGCTGCTG CTGCTGOCGC CACTGCTGCT GCTGGCGGGG GCCGTCCCGC 180 
CGGGTCGGGG CCGTGCCGCG GGGCCGCAGG AGGATGTAGA TGAGTGTGCC CAAGGGCTAG 240 
ATGACTGCCA TGCCGACGCC CTGTGTCAGA ACACACCCAC CTCCTACAAG TGCTCCTGCA 300 
AGCCTGGCTA CCAAGGGGAA GGCAGGCAGT GTGAGGACAT CGATGAATGT GGAAATGAGC 360 
TCAATGGAGG CTGTGTCCAT GACTGTTTGA ATATTCCAGG CAATTATCGT TGCACTTGTT 420 
TTGATGGCTT CATGTTGGCT CATGACGGTC ATAATTGTCT TGATGTGGAC GAGTGCCTGG 480 
AGAACAATGG CGGCTGCCAG CATACCTGTG TCAACGTCAT GGGGAGCTAT GAGTGCTGCT 540 
GCAAGGAGGG GTl'l ITCCTG AGTGACAATC AGCACACCTG CATTCACCGC TCGGAAGAGG 600 
GCCTGAGCTG CATGAATAAG GATCACGGCT GTAGTCACAT CTGCAAGGAG GCCCCAAGGG 660 
GCAGCGTCGC CTGTGAGTGC AGGCCTGGTT TTGAGCTGGC CAAGAACCAG AGAG ACTGCA 720 
TCTTGACCTG TAACCATGGG AACGGTGGGT GCCAGCACTC CTGTGACGAT ACAGCCGATG 780 
GCCCAGAGTG CAGCTGCCAT CCACAGTACA AGATGCACAC AGATGGGAGG AGCTGCCTTG 840 
AGCGAGAGGA CACTGTCCTG GAGGTGACAG AGAGCAACAC CACATCAGTG GTGGATGGGG 900 
ATAAACGGGT GAAACGGCGG CTGCTCATGG AAACGTGTGC TGTCAACAAT GGAGGCTGTG 960 
ACCGCACCTG TAAGGATACT TCGACAGGTG TCCACTGCAG rTGTCCTGTT GGATTCACTC 1020 
TCCAGTTGGA TGGGAAGACA TGTAAAGATA TTGATGAGTG CCAGACCCGC AATGGAGGTT 1080 
GTGATCAITT CTGCAAAAAC ATCGTGGGCA GnTTGACTG CGGCTGCAAG AAAGGATTTA 1 140 
AATTATTAAC AGATGAGAAG TCTTGCCAAG ATGTGGATGA GTGCTCTTTG GATAGG ACCT 1200 
GTGACCACAG CTGCATCAAC CACCCTGGCA CATTTGCTTG TGCITGCAAC CGAGGGTACA 1260 
CCCTGTATGG CTTCACCCAC TGTGGAGACA CCAATGAGTG CAGCATCAAC AACGGAGGCT 1320 
GTCAGCAGGT CTGTGTGAAC ACAGTGGGCA GCTATGAATG CCAGTGCCAC CCTGGGTACA 1 380 
AGCTCCACTG GAATAAAAAA GACTGTGTGG AAGTGAAGGG GCTCCTGCCC ACAAGTGTGT 1440 
CACCCCGTGT GTCOCTGCAC TGCGGTAAGA GTGGTGG AGG AGACGGGTGC TTCCTCAGAT 1500 
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GTCACTCTGG CATTCACCTC TCTTCAGATG TCACCACCAT CAGGACAAGT GTAACCTTTA 1560 
AGCTAAATGA AGGCAAGTGT AGTTTGAAAA ATGCTGAGCT GTTTCCCGAG GGTCTGCGAC 1620 
CAGCACTACC AGAGAAGCAC AGCTCAGTAA AAGAGAGCTT CCGCTACGTA AACCTTACAT 1680 
GCAGCTCTGG CAAGCAAGTC CCAGGAGCOC CTGGCCGACC AAGCACCCCT AAGGAAATGT 1740 
TTATCACTGT TG AGTTTGAG CTTG AAACTA ACCAAAAGG A GGTGACAGCT TCTTGTGACC 1 800 
TGAGCTGCAT CGTAAAGCGA ACCGAGAAGC GGCTCCGTAA AGCCATCCGC ACGCTCAGAA 1 860 
AGGCCGTCCA CAGGGAGCAG TTTCACCTCC AGCTCTCAGG CATGAACCTC GACGTGGCTA 1920 
AAAAGCCrCC CAGAACATCT GAACGCCAGG CAGAGTCCTG TGGAGTGGGC CAGGGTCATG 1980 
CAGAAAACCA ATGTGTCAGTTGCAGGGCTG GGACCTATTA TGATGGAGCA CGAGAACGCT 2040 
GCATTTTATG TCCAAATGGA ACCTTCCAAA ATGAGGAAGG ACAAATGACT TGTGAACCAT 2100 
GCCCAAGACC AGG AAATTCT GGGGCCCTGA AGACCCCAGA AGCTTGGAAT ATGTCTGAAT 2160 
GTGGAGGTCT GTGTCAACCT GGTGAATATT CTGCAGATGG CTTTGCACCT TGCCAGCTCT 2220 
GTGCCCTGGG CACGTTCCAG CCTGAAGCTG GTCGAACTTC CTGCTTCCCC TGTGGAGGAG 2280 
GCCTTGCCAC CAAACATCAG GGAGCTACIT CCTTTCAGGA CTGTGAAACC AGAGTTCAAT 2340 
GTTCACCTGG ACATTTCTAC AACACCACCA CTCACCGATG TATTCGTTGC CCAGTGGGAA 2400 
CATACCAGCC TG AATTTGGA AAAAATAATT GTGTTTCTTG CCCAGGAAAT ACTACGACTG 2460 
ACTTTGATGG CTCCACAAAC ATAACCCAGT GTAAAAACAG AAGATGTGGA GGGGAGCTGG 2520 
GAGATTTCAC TGGGTACATT GAATCCCCAA ACTACCCAGG CAATTACCCA GCCAACACCG 2580 
AGTGTACGTG G ACCATCAAC CCACCCCCCA AGCGCCGCAT CCTGATCGTG GTCCCTGAGA 2640 
TCTTCCTGCC CATAG AGGAC GACTGTGGGG ACTATCTGGT GATGCGGAAA ACCTCTTCAT 2700 
CXTAATTCTGT GACAACATAT G AAACCTGCC AGACCTACGA ACGCCCCATC GCCTTCACCT 2760 
CCAGGTCAAA GA AGCTGTGG ATTCAGTTCA AGTCCAATG A AGGGAACAGC GCTAGAGGGT 2820 
TCCAGGTCCC ATACGTGACA TATGATGAGG ACTACCAGGA ACTCATTGAA GACATAGTTC 2880 
GAG ATGGCAG GCTCTATGCA TCTGAGAACC ATCAGGAAAT ACTTAAGGAT AAGAAACTTA 2940 
TCAAGGCTCT GTTTG ATGTC CTGGCCC ATC CCCAGAACTA TTTCAAGTAC ACAGCCCAGG 3000 
AGTCCCGAGA GATGTTTCCA AG ATCGTTCA TCCGATTGCT ACGTTCCAAA GTGTCCAGGT 3060 
TTTTG AG ACC TTACA A ATGA CTCAGCCC AC GTGCCACTCA ATACA AATGT TCTGCTATAG 3 1 20 
GGTTGGTGGG ACAGAGCTGT CTTCCTTCTG CATGTCAGCA CAGTCGGGTA TTGCTGCCTC 3 180 
CCGTATCAGT GACTCATTAG AGTTCAATTT TTATAGATAA TACAGATATT TTGGTAAATT 3240 
GAACTTGGTTTTTCTTTCCC AGCATCGTGG ATGTAGACTG AG AATGGCTT TG AGTGGCAT 3300 
CAGCTTCTCA CTGCTGTGGG CGG ATGTCTT GG ATAGATCA CGGGCTGGCT GAGCTGGACT 3360 
TTGGTCAGCC TAGGTGAGAC TCACCTGTCC TTCTGGGGTC TTACTCCTCC TCAAGGAGTC 3420 
TGTAGTGGAA AGGAGGCTAC AG AATAAGCT GCTTATTCTG AAACTTCAGC TTCCTCTAGC 3480 
CCGGCCCTCT CTAAGGGAGC CCTCTGCACT CGTGTGCAGG CTCTGACCAG GCAGAACAGG 3540 
CAAGAGGGGA GGGAAGGAGA CCCCTGCAGG CTCCCTCCAC CCACCTTG AG ACCTGGGAGG 3600 
ACTCAGTrrC TCCACAGCCT TCTCCAGCCT GTGTGATACA AGTTTGATCC CAGG AACTTG 3660 
AGTTCTAAGC AGTGCTCGTG AA AAAAAAAA GCAGAAAG AA TTAGAAATA A ATAAAAACTA 3720 
AGCACTTCTG GAGACAT 

Scq ID NO: 64 Protein sequence; 
Protein Accession it: NP_066025. 1 

1 II 21 31 41 51 

I 1 I I I I 

MG V AG RNRPG A A W AVLLLLL LLPPLLLLAG AVPPGRGRA A GPQEDVDECA QGLDDCHADA 60 
LCQNTPTSYK CSCKPGYQGE GRQCEDIDEC GNELNGGCVH DCLNIPGNYR CTCFDGFMLA 120 
HDGHNCLDVD ECLENNGGCQ HTCVNVMGSY ECXXICEGFFL SDNQHTCIHR SEEGLSCMNK 1 80 
DHGCSHICKE APRGSVACEC RPGFELAKNQ RDOLTCNHG NGGCQHSCDD TADGPECSCH 240 
PQYKMHTDGR SCLEREDTVL EVTESNTTSV VDGDKRVKRR LLMETCAVNN GGCDRTCKDT 300 
STGVHCSCPV GFTLQLDGKT CKDIDECQTR NGGCDHFCKN IVGSFDCGCK KGFKLLTDEK 360 
SCQDVDECSL DRTCDHSCIN HPGTFACACN RGYTLYGFTH CGDTNECSIN NGGCQQVCVN 420 
TVGSYECQCH PGYKLHWNKK DCVEVKGLLP TSVSPRVSLH CGKSGGGDGC FLRCHSGIHL 480 
SSDVTTIRTS VTFKLNEGKC SLKNAELFPE GLRPALPEKH SSVKESFRYV NLTCSSGKQV 540 
PGAPGRPSTP KEMFITVEFE LETNQKEVTA SCDLSCIVKR TEKRLRKAIR TLRKAVHREQ 600 
FHLQLSGMNL DVAKKPPRTS ERQAESCGVG QGHAENQCVS CRAGTYYDGA RERCILCPNG 660 
TFQNEEGQMT CEPCPRPGNS GALKTPEAWN MSECGGLCQP GEYSADGFAP CQLCALGTFQ 720 
PEAGRTSCFP CGGGLATKHQ GATSFQDCET RVQCSPGHFY NTTTHRCIRC PVGTYQPEFG 780 
KNNCVSCPGN TTTDFDGSTN ITQCKNRRCG GELGDFTGYI ESPhfYPGNYP ANTECTWTIN 840 
PPPKRRILI V VPEIFLPIED DCGDYLVMRK TSSSNSVTTY ETCQTYERPI AFTSRSKKLW 900 
IQFKSNEGNS ARGFQVPYVT YDEDYQEUE DIVRDGRLYA SENHQEILKD KKLKALFDV 960 
LAHPQNYFKY TAQESREMFP RSFIRLLRSK VSRFLRPYK 

Seq ID NO: 65 DN A sequence 

Nucleic Add Accession H: NM J)072 1 0 

Coding sequence: 1-1 869 (underlined sequences oonespond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

ATG AGGCTCC TCCGCAGACG CCACATGCCC CTGCGCCrGG CCATGGTG GG CTGCGCCTTT 60 
GTGCTCTTCC TCTTCCTCCT GCATAGGGAT GTGAGCAGCA G AGAGGAGGC CACAGAGAAG 120 
CCGTGGCTGA AGTCCCTGGT GAGCCGG AAG GATCACGTCC TGGACCTCAT GCTGGAGGCC 1 80 
ATG AACAACC TTAGAGATTC AATGCCCAAG CTCCAAATCA GGGCTCCAGA AGCCCAGCAG 240 
ACTCTGTTCT CCATAAACCA GTCCTGCCTC CCTGGGTTCT ATACCCCAGC TG AACTGAAG 300 
CCCTTCTGGG AACGGCCACC ACAGGACCCC AATGCCCCTG GGGCAGATGG AAAAGCATTT 360 
CAGAAG AGCA AGTGGACCCC CCTGGAGACC CAGGAAAAGG AAGAAGGCTA TAAGAAGCAC 420 
TGTTTCAATG CCTTTGCCAG CGACCGGATC TCCCTGCAGA GGTCCCTGGG GCCAGACACC 480 
CGACCACCTG AGTGTGTGGA CCAGAAGTTC CGGCGCTGCC CCCCACTGGC CACCACCAGC 540 
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GTGATCATTG TGITCCACAA CGAAGCCTGG TCCACACTGC TGCGAACAGT GTACAGCGTC 600 
CTACACACCA CXCCTGCXAT CTTGCTCAAG GAGATCATAC TGGTGGATGA TGCCAGCACA 660 
GAGGAGCACC TAAAGGAGAA GCTGG AGCAG TACGTGAAGC AGCTGCAGGT GGTGAGGGTG 720 
GTGCGGCAGG AGGAGCGGAA GGGGTTGATC ACCGCCCGGC TGCTGGGGGC CAGCGTGGCA 780 
CAGGCGGAGG TGCTCACGTT CCTGGATGCC CACTGTGAGT GCTTCCACGG CTGGCTGGAG 840 
CCCCTCCTGG CTCGAATCGC TGAGGACAAG ACAGTGGTGG TGAGCCCAGA CATCGTCACC 900 
ATCGACCTTA ATACTTTTGA GTTCGCCAAG CCCGTCCAGA GGGGCAGAGT CCATAGCCGA 960 
GGCAACTTTG ACTGGAGCCT GACCTTCGGC TGGGA AACAC TTCCTCCACA TGAGAAGCAG 1 020 
AGGCGCAAGG ATGAAACATA CCCCATCAAA TCCCCGACGT TTGCTGGTGG CCTCTTCTCC 1080 
ATCCCCAAGT CGTACTTTGA GCACATCGGT ACCTATGATA ATCAGATGGA GATCTGGGGA 1 140 
GGGGAGAACG TGGAAATGTC CTTCCGGGTG TGGCAGTGTG GGGGCCAGCT GGAGATCATC 1200 
CCCTGCTCTG TCGTAGGCCA TGTGTTCCGG ACCAAG AGCC CCCACACCTT CCCCAAGGGC 1 260 
ACTAGTGTCA TTGCTCGCAA TCAAGTGCGC CTGGCAGAGG TCTGGATGGA CAGCTACAAG 1320 
AAGATTTTCT ATAGGAGAAA TCTGCAGGCA GCAAAGATGG CCCAAGAGAA ATCCTTCGGT 1380 
GACATTTCGG AACGACTGCA GCTG AGGGAA CAACTGCACT GTCACAACTT TTCXTGGTAC 1440 
CTGCACA ATG TCTACCCAG A GATGTTTGTT CCTG ACCTG A CGCCC ACCTT CTATGGTGCC 1 500 
ATC AAG A ACC TCGGCACCAA CCAATGCCTG G ATGTGGGTG AGAACAACCG CGGGGGG AAG 1 560 
CCCCTCATCA TGTACTCCTG CC ACGGCCTT GGCGGCAACC AGTACTTTGA GTACACAACT 1 620 
CAGAGGG ACC TTCGCCACAA C ATCGCAAAG CAGCTGTGTC TACATGTCAG C AAGGGTGCT 1 680 
CTGGGCCTTG GGAGCTGTCA CTTCACTGGC AAGAATAGCC AGGTCCCCAA GGACGAGGAA 1740 
TGGGAATTGG CCCAGG ATCA GCTCATCAGG A ACTCAGGAT CTGGTACCTG CCTG ACATCC 1 800 
CAGGACAAAA AGCCAGCCAT GGCCCCCTGC A ATCCCAGTG ACCCCCATCA GTTGTGGCTC 1 860 
TTTGTCTAGG ACCCAGATCA TCCCCAGAGA GAGCCCCCAC AAGCTCCTCA GGAAACAGGA 1920 
TTGCTGATGT CTGGGAACCT GATCACCAGC TTCTCTGGAG GCCGTAAAGA TGGATTTCTA 1980 
AACCCACTGG GTGGCAAGGC AGG ACCTTCC TAATCCTTGC AACAACATTG GGCCCATTTT 2040 
CTTTCCTTCA CACCGATGGA AGAGACCATT AGGACATATA TTTAGCCTAG CGTTTTCCTG 2100 
TTCTAGAAAT AGAGGCTCCC AAAGTAGGGA AGGCAGCTGG GGGAGGGTTC AGGGCAGCAA 2160 
TGCTGAGTTC AAGAAAAGTA CTTCAGGCTG GGCACAGTGG CTCATGCCTG AAATCCTAGC 2220 
ACTTTGGGAA GACAATGTGG GAGAATGGCT TG AGCCCAGG AGTTCAAGAC CGGCCTGAGC 2280 
AACATAGTGAGGATCCCATCTCTACGCCCACCCTCCCCCCGGCAAAAAAAAAAGCTGGGT 2340 
ATGGTGGCTT ATGCCTGTAG TCGCAGCTAC TCAGAAGGCT GAGGTGGGAG GATTGCTTGT 2400 
TCCCCGG AGG TTGAAGCTAC AGTGAGCCIT GATTGTGTCA CTGCACTCCA GCCTGGCCAA 2460 
CAGGTAAG AC TCTGTCTCAA AAAAAAAACA A AAAAGAAGA AGAAAAGTAC TTCTACAGCC 2520 
ATGTCCTATT CCTTGATCAT CCAAAGCACC TGCAGAGTCC AGTGA AATGA TATATTCTGG 2580 
CTGGGCACAG TGGCTCACAC CTGTAATCCT AGCACTTTGG G AGGCCAAGG CAGGTGGATC 2640 
ACCTGAGGTC AGAAGTTTGA AACCAGCCTG G ACTACATGG TGAAACTCCA TCTCTACTAA 2700 
AAGTACAAAA ATTAGCTGGG CATGATGGCA CGCACCTGCA GTCCCAGCTA CTTGGGAGGC 2760 
TGAGGCAGGA GAATCACTCG AACCCAGGAG GCAGAGGTTG CAGTGAGCCA AGACAGCACC 2820 
ATTGCACCCC AGCCTGAGCA ACAAGAGCGA AACTCCATCT CAGGAAAAAA AAAAAAAAAA 2880 
A 

Scq ID NO: 66 Protein seouence: 
Protein Accession #: NP_009 14 1 

1 II 21 3! 41 SI 

1 1 I I I 1 

MRLLRRRHMP LRLAMVGCAF VLFLFLLHRD VSSREEATEK. PWLKSLVSRK DHVLDLMLEA 60 
MNNLRDSMPK LQIRAPEAQQ TLFSINQSCL PGFYTPAELK PFWERPPQDP NAPGADGKAF 120 
QKSKWTPLET QEKEEGYKKH CFNAF ASDRI SLQRSLGPDT RPPECVDQKF RRCPPLATTS 1 80 
VnVFHNEAW STLLRTVYSV LHTTPAILLK EIILVDDAST EEHLKEKLEQ YVKQLQWRV 240 
VRQEERKGLI TARLLGASVA QAEVLTFLDA HCECFHGWLE PLLARIAEDK TVWSPDIVT 300 
IDLNTFEFAK PVQRGRVHSR GNFDWS LTFG WETLPPHEKQ RRKDETYPIK SPTFAGGLFS 360 
IPKS YFEHIG TYDNQMEIWG GENVEMSFRV WQCGGQLEII PCS VVGHVFR TKSPHTFPKG 420 
TSVIARNQVR LAEVWMDSYK KIFYRRNLQA AKMAQEKSFG DISERLQLRE QLHCHNFSWY 480 
LHNVYPEMFV PDLTPTFYGA IKNLGTNQCL DVGENNRGGK PLIMYSCHGL GGNQYFEYTT 540 
QRDLRHNIAK QLCLHVSKGA LGLGSCHFTG KNSQVPKDEE WELAQDQLIR NSGSGTCLTS 600 
QDKKPAMAPC NPSDPHQLWL FV 

Seq ID NO. 67 DNA sequence 

Nucleic Acid Accession #: NM_0141 1 2 

Coding sequence: 600-4484 (undefined sequences coTrespond to start and stop oodons) 

I 11 21 31 41 51 
I I I t I I 

TTCCTCCGCG AAGGCTCCTT TG ATATTAAT AGTGTTGGTG TCTTG AAACT GACGTAATGC 60 
GCGGAGACTG AGGTCCTGAC AAGCGATAAC ATTTCTGATA AAGACCCGAT CTTACTGCAA 120 
TCTCTAGCGT CCTCTTTTTT GGTGCTGCTG GTTTCTCCAG ACCTCGCGTC CTCTCGATTG 1 80 
CTCTCTCGCC TTCCTATTTC iilllillli TTTTAA ACAA AAAACAACAC CCCCTCCCCT 240 
CTOCCACCCG GCACCGGGCA CATCCTTGCT CTATTTCCTT TCTCTTTCTC TCTCTCTCTC 300 
TCT Ci ri TTT AATAAGGGTG GGGGAGGGAA AGGGGGGCGA GGCAGGAAAG ACCTTTTTCT 360 
CrcCCCCCCG CAATAATCCA AGATCA ACTC TGCAAACAAC AGAAGACGGT TCATGGCTTT 420 
GGCCGCCGCG CCACCATCTT TCGGGCTGCC GAGGGTGTTC TTGACGATTA ATCAACAGAT 480 
GTACAGATC A GCTCTCAAAA TGTCTTCTGT GTCTTCTG AG CGTCTTCTAA G ACAATTGC A 540 
TTAGCCTCCT GCTAGTTGAC TAATAGAATT AATAATTGTA AAAAGCACTC TAAAGCCACA 600 
TGCCTTATGA AGTCAATGCT GGGTATGATT TTACAAATAT GGTCCGGAAA AAGAACCCCC 660 
CTCTGAG AAA CGTTGCAAGT GAAGGCGAGG GCCAGATCCT GGAGCCTATA GGTACAGAAA 720 
GCAAGGTATC TGGAAAGAAC AAAGAATTCT CTGCAG ATCA GATGTCAGAA AATACGGATC 780 
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AGAGTGATGC TGCAG AACTA AATCATAAGG AGGAACATAG CTTGCATGTT CAAGATCCAT 840 
CrrCTAGCAG TAAGAAGGAC TTGAAAAGCG CAGTTCTGAG TGAGAAGGCT GGCTTCAATT 900 
ATGAAAGCCC CAGTAAGGGA GG AAACTTTC CCTCCmCC GCATGATGAG GTG ACAGACA 960 
_ GAAATATGTT GGCTTTCTCA TTTCCAGCTG CTGGGGGAGT CTGTGAGCCC TTGAAGTCTC 1020 
5 CGCAAAGAGC AG AGGCAGAT G ACCCTCA AG ATATGGCCTG CACCCCCTCA GGGGACTC AC 1 080 
TGGAGACAAA GGAAGATCAG A AGATGTCAC CAAAGGCTAC AGAGGAAACA GGGCAAGCAC I i40 
AGAGTGGTCA AGCCAATTGT CAAGGTTTGA GCCCAGTTTC AGTGGCCTCA AAAAACCCAC 1200 
AAGTGCCTTC AGATGGGGGT GTAAGACTGA ATAAATCCAA AACTGACTTA CTGGTGAATG 1260 
ACAACCCAGA CCCGGCACCT CTGTCTCCAG AGCTTCAGGA CTTTAAATGC AATATCTGTG 1320 

1 0 G ATATGGTTA CTACGGC AAC G ACCCCACAG ATCTGATTAA GCACTTCCGA AAGTATCACT 1 380 
TAGGACTGCA TAACCGCACC AGGCAAGATG CTGAGCTGGA CAGCAAAATC TTGGC CCTT C 1440 
ATAACATGGT GCAGTTCAGC CATTCCAAAG ACTTCCAGAA GGTCAACCGT TCrGTGTTTT 1500 
CTGGrGTGCT GCAGGACATC AATTCTTCAA GGCCrGTTTT ACTAAATGGG ACCTATGATG 1560 

- - TGCAGGTGAC TTCAGGTGGA ACATTCATTG GCATTGGACG GAAAACACCA G ATTGCC A AG 1 620 

1 5 GGAACACCAA GTATTTCCGC TGTAAATTCT GCAATTTCAC TTATATGGGC A ACTCATCCA 1 680 
CCGAATTAG A AC A ACATTTT CTTCAG ACTC ACCCAAACAA AATAAAAGCT TCTCTCCCCT 1 740 
CCrCTGAGGT TGCAAAACCT TCAGAG AAAA ACTCT AACAA GTCCATCCCT GCACTTCAAT 1 800 
CCAGTGATTC TGGAGACTTG GGAAAATGGC AGGACAAGAT AACAGTCAAA GCAGGAGATG I860 
ACACrCCTGT TGGGTACTCA GTGCCCATAA AGCCCCTCGA TTCCTCTAGA CAAAATGGTA 1 920 

20 CAGAGGCCAC CAGTTACTAC TGGTGTAAAT TTTGTAGnT CAGCTGTG AG TCATCTAGCT 1 980 
CACTTAAACT GCTAGA ACAT TATGGCAAGC AGCACGGAGC AGTGCAGTCA GGCGGCCTTA 2040 
ATCCAGAGTT AAATGATAAG CTTTCCAGGG GCTCTGTCAT TAATCAGAAT GATCTAGCCA 2 100 
AAAGTTCAGA AGG AGAGACA ATGACCAAGA CAGACAAGAG CTCGAGTGGG GCTAAAAAGA 2160 

^ AGGACTTCTC CAGCA AGGGA GCCGAGGATA ATATGGTAAC GAGCTATAAT TGTCAGTTCT 2220 

25 GTGACTTCCG ATATTCCAAA AGCCATGGCC CTGATGTAAT TGTAGTGGGG CCACTTCTCC 2280 
GTCATTATCA ACAGCTCCAT AACATTCACA AGTGTACCAT TAAACACTGT CCATTCTGTC 2340 
CCAGAGGACTTTGCAGCCCA GAAAAGCACCTTGGAGAAATTACTTATCCG TTTGCTTGTA 2400 
GAAAAAGTAA TTGTTCCCAC TGTGCACTCT TGCTTCTGCA CTTGTCTCCT GGGGCGGCTG 2460 
GAAGCTCGCG AGTCAA ACAT CAGTGCCATC AGTGTTCATT CACCACCCCT GACGTAGATG 2520 

30 TACrCCTCTT TCACTATGAA AGTGTGCATG AGTCCCAAGC ATCGGATGTC AAACAAGAAG 2580 
CAAATCACCT GCAAGGATCG GATGGGCAGC AGTCTGTCAA GGAAAGCAAA GAACACTCAT 2640 
GTACCAAATG TGATTTTATT ACCCAAGTGG AAGAAGAGAT TTCCCGACAC TACAGGAG AG 2700 
CACACAGCTG CTACAA ATGC CGTCAGTGCA GTTTTACAGC TGCCGATACT CAGTCACTAC 2760 
TGGAGCACTT CAACACTGTT CACTGCCAGG AACAGGACAT CACTACAGCC AACGGCGA AG 2820 

3 5 AGGACGGTCA TGCCATATCC ACC ATCA AAG AGG AGCCCAA AATTGACTTC AGGGTCTAC A 2880 
ATCTGCTAAC TCCAG ACTCT A A AATGGGAG AGCCAGTITC TGAG AGTGTG GTGAAG AGAG 2940 
AGAAGCTGGA AG AGA AGGAC GGGCTCAAAG AGAAAGTTTG GACCGAG AGT TCCAGTG ATG 3000 
ACCTTCGCAA TGTGACTTGG AGAGGGGCAG ACATCCTGCG GGGGAGTCCG TCATACACCC 3060 
AAGCAAGCCT GGGGCTGCTG ACGCCTGTGT CTGGCACCCA AGAGCAGACA AAGACTCTA A 3 1 20 

40 GGGATAGTCC CAATGTGGAG GCCGCCCATCTGGCGCGACCTAriTATGGC TTGGCTGTGG 3180 
AAAOCAAGGG ATTCCTGCAG GGGGCGCCAG CTGGCGGAGA GAAGTCTGGG GCCCTCCCCC 3240 
AGCAGTATCC TGCATCGGGA GAAAACA AGT CCAAGGATGA ATCCCAGTCC CTGTTACGGA 3300 
GGCGTAGAGG CTCCGGTGTT TTTTGTGCCA ATTGCCTGAC CACAAAGACC TCTCTCTGGC 3360 
GAAAGAATGC AAATGGCGG A TATGTATGCA ACGCGTGTGG CCTCTACCAG AAGCTTCACT 3420 

45 CGACTCCCAG GCCTTTAA AC ATCATTAAAC AAAACAACGG TGAGCAGATT ATTAGGAGGA 3480 
GAACAAGAAA GCGCCTTAAC CCAGAGGCAC TTCAGGCTGA GCAGCTCAAC AAACAGCAGA 3540 
GGGGCAGCAA TGAGGAGCAA GTCAATGGAA GCCCCnTAGA GAGGAGGTCA GAAGATCATC 3600 
TAACTGAAAG TCACCAGAGA GAAATTCCAC TCCCCAGCCT AAGTAAATAC GAAGCCCAGG 3660 
GTTCATTGAC TA AA AGCC AT TCTGCTC AGC AGCCAGTCCT GGTCAGCCAA ACTCTGGATA 3720 

50 TTCACAAAAG GATGCAACCT TTGCACATTC AGATAAAAAG TCCTCAGGAA AGTACTGGAG 3780 
ATCCAGGAAA TAGTTCATCC GTATCTGAAG GGAAAGGAAG TTCTGAGAGA GGCAGTCCTA 3840 
TAGAAAAGTA CATGAGACCT GCGAAACACC CAAATTATTC ACCACCAGGC AGCCCTATTG 3900 
AAAAGTACCA GTACCCACTT TTTGGACTTC CCTTTGTACA TAATGACTTC CAGAGTGAAG 3960 
CTGATTGGCr GCGGTTCTGG AGTAAATATA AGCTCTCCGT TCCTGGGAAT CCGCACTACT 4020 

55 TGAGTCACGT GCCTGGCCTA CC AAATCCTT GCCAAAACTA TGTGCCTTAT CCCACCTTCA 4080 
ATCTGCCTCC TCATTTTTCA GCTGTTGGAT CAGACAATGA CATTCCTCTA GATTTGGCGA 4140 
TCAAGCATTC CAGACCTGGG CCAACTGCAA ACGGTGCCTC CAAGGAGAAA ACGAAGGCAC 4200 
CACCAAATGT AAAA AATGAA GGTCCCTTGA ATGTAGTAAA AACAGAG AAA GTTG ATAG AA 4260 

^ _ GTACrCAAGA TGAACTTTCA ACAAAATGTG TGCACTGTGG CATTGTCTTT CTGGATGAAG 4320 

60 TGATGTATGC TTTGCATATG AGTTGCCATG GTGACAGTGG ACCTTTCCAG TGCAGCATAT 4380 
GCCAGCATCT TTGCACGGAC AA ATATGACT TCACAACACA TATCCAGAGG GGCCTGCATA 4440 
GGAACAATGC ACAAGTGGAA AAAAATGGAA AACCTAAAGA GTAAAACCTT AGCACTTAGC 4500 
ACAATTAAAT AGAA ATAGGT TTTCTTGATG GGAATTCAAT AGCTTGTAAT GTCTTATGAA 4560 

, _ GACCTATTAA AAAAATACTT CATAGAGCCT GCCTTATCCA ACATGAAATT CCCTTCTTTT 4620 

65 GrTATTCTTT CTTTTGATGA GTAGGTTACC AAGATTAAAA AGTGAGATAA ATGGTCAATG 4680 
AGAAAGAATG GAAGATGGTA AACAATCACT TTTTAAA ACC TGTTAAGTCA AAACCATCTT 4740 
GGCTAATATG TACTGGGGAA ATAATCCATA AGAGATATCACCAGACTAGA ATTAATATAT 4800 
TTATAAAG AA AGAGACCAAA ACTGTCTAGA ATTTGAAAGG GTTTACATAT TATTATACTA 4860 
AAGCAGTACT GGACTGGCCA TTGGACCATT TGTTCCAAAA COCATAAATT GTTGCCTAAA 4920 

70 TTTATAATGA TCATGA AACC CTAGGCACAG GAGGAGAAAT TGAAGGTCCA GGGGAATG AA 4980 
AGAAAAATGG CGCCCTCTCA ATTTAGTCTT CTCTCATTGG CCATGTTTCA GATTTTGACC 5040 
TAGAAATGCG AGCTGTGGTT AGGCTTGGTT AG AGTGCAGC AAGCAACATG ACAG ATGGT G 5 100 ' 
GCACGCTGTT TTTACCCAGC CCTGCCTGTA CATACACATG CACACCCTCT CTGATATTTT 5160 
TGTCCnTAG ATGTTCA A AT ACTCAGTAGT CCTTTTGTTT GCGGTTTAGA TTCATTT TGT 5220 

75 CCACACATGT ACCCATTTTA AA AAACAATG TCCTCGATGC TTCT GTAGTG ATTrCATTTT 5280 
AGCCAGGTAT TTCTTTCTTG TGTGTGATGA ACCAGTATGG ATTTCCTTTT CTAAGCCTCC 5340 
TGTTCGTTAC TAATCTCACT TGGCACATTA TAACTAAAGG AATCCCCTCA ATTCAAAAGC 5400 
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ATAGATGGAT ACAAATGTCA GACCGTGGGT TTAATTTGTT TAGAACACAT GGCATTTCTT 5460 
CACAAGGTAA CCTGCTGTAT TTATTTATTT TCTTTTGGTT AAATATAATT TCCAAACTTT 5520 
GTGGTCAGGC AGCGTCTAAG GTTACGTTAC CACAGACTG A CAGTTGGTAT ATGTACCAGC 5580 
CAATCCCTTC ATTAAATGTA TACAGATTTA GTTAAGTAGC ATTAAATAGG ATTCTTAGAA 5640 
GTATGTCCTC ATAGAACTTTTAATACTTAA GGCTTTGTAA AAACTATCCA TGAAGGGAAA 5700 
GCTCCrCAGC ATAACTGCTC AGGGAAATAG GGCTAAATAA CTGA ACATTA AATAATTGGT 5760 
TAAAGGTGCT GTTAGTCGAG CCTCAATGCr TGCTACAAGG ATGTATGTAC AAGGACTGAC 5820 
TTTAATAATT TGCATTATAT TGTCCCAACC AGTAGTTTAT nTTTGCCAC GGAGATGTAG 5880 
AAGATATTAC AAGCTACTGG ATGCACTGTC AGATTAACTT ATTTC ATTAA AGAAGTTGGG 5940 
AGAACAAATA GGAAAAAAAA AACTrATm TCTAGTAAAT ATTAATGT AT TACATTTCAA 6000 
ATAATGGTGC CTGACATATT GAATAATTAT TTTCTACAGT GTACGTATGC AACA AAGATA 6060 
TTCCATCATG CATTAGAGTC AGTTCTGGCr CTGCCTAGCT GTTTACATTT GCAAATGTAG 6120 
CAAACAAGGT AATGA AGCAA CTATTTCTAT TGCAGTAG AT ATCCTTTTGT GTGTGTGTGT 61 80 
GTGCATTAAA GTTGTAAACG GTAACATGAA ACAAATGAAA GTTCTTGCTA TAATGGTATG 6240 
GAAAACAAG A AGGAAATGAA AATATnTTA TGCCTACTTA GGAAAAAAAG GGTAGCACIT 6300 
ATTCATTCCA AGTACTTTTr TTTTnTAATTnTAAGCTC TTAACTCACA TTGTTATGCT 6360 
TAAGATGATA AACATATATC CTCTmTAT TGCnTGTCT ATGTTTCATA TCAAACATTT 6420 
CAG AAATTAT TTTGATAAGT GTTGCTGGAA TCTGCAACGC TGATTTTTTT TTGCATTCTG 6480 
TAGTCGCATT TGCACTCCAT TTTTACATTA ATTCGCAGTT GCTTrGTATC ATrGTTTTGT 6540 
TTGGGnTTG TTTCTTTTTC ACAGTGCCGG GTCTTCGTTT CTTAAAGITG GATGGCAGGT 6600 
AG AGTTCA AC CAGTTCGTGA CTGTTGTAGC GA ATG AAGTT AAAAA AATGT CTTTCTGATG 6660 
TTGTGTTGTC ATTTrCATTT TTGC Al 1 i 11 TTGTTTGCAT ATTAA AAAAA GAGAAAAGAG ^720 
AAAGCAAGAG ACAGAAATCA GGACTAAGTC CTCTGCTTCA GTTTCATTGT TAACGGGCCT 6780 
TATTCTGATC TCACCTGTCG CGTAGCTCTA ATATTCACAT AAACTGAAAT AAAGAAGTGG 6840 
AATGAGGAGC TTTGACATTC AAATTATGTG ATGTAATTTA TCTTCCTTAG GAATTTTGAT 6900 
GGATGCATCT CAAAATGTAT AGCCAGACIT G AGAGGTGAC AATTAAAGAT CTAAAAAAGA 6960 
GAGGAG ATTC CCCCAAACAA CAATATTTAA TTTTCTTAGT AAAAAGAATA ACAG AATGCA 7020 
TCGTGGCAAT CCTTAAGCAA CATTATCTAT GTGGACTGCTTAAATCAGCA AAACACCAGA 7080 
AGTTTGGTTA ACTTGGGCAA TATGACAAGT ATTACmTT GGGCAAAACT ACTC ATTAA G 7140 
CAATTTCTCT AGTGTGTCGG ACACAAATAG GTTCTTTATT TTTGGCATGT ATGCCTTTTT 7200 
ATTTTCATTC AATTTTTTrT TTTTCTCAGA CAGACATAGT AGTATCAACT AGCATTGGAA 7260 
AATACATATC ACTATTCTTG GAATATTTAT GGTC AGTCTA CTTTTT AGTA A AATA TTTTT 7320 
GGATAGCGTT GACACGATAG ATCTTATTCC ATACTTCTTT ATTATTGATA ATTTTATTTT 7380 
CATTTTTTGC TTTCATTATT ATACATATTT TGGTGGAGAA GAGGTTGGGC TTTTTTGAAA 7440 
GAGACAAA AA TTTATTATAA CACTAAACAC TCCTTTTTTG ACATATTAAA GCCI TTAT TC 7500 
CATCTCTC A A G ATATATTAT AAAATTTATT TTTTTAATTT AAG ATTTCTG AATTATTTTA 7560 
TCTTAAATTG TGATTTTAAA CGAGCTATTA TGGTACGGAA CTTTTTTTAA TGAGGAATTT 7620 
CATGATG ATT TAGGAATTTT CTCTCTTGGA AAAGGCTTCC CCTGTGATGA AAATGATGTG 7680 
CCAGCTA AAA TTGTGTGCCA TTTAAAAACT GAAAATATTT TAAAATTATT TGTCTATATT 7740 
CTAAATTG AG CTTTGGATCA AACTITAGGC CAGGACCAGC TCATGCGTTC TCATTCTTCC 7800 
TTTTCrCACT CTTTCTCTCA TCACTCACCT CTGTATTCAT TCTGTTGTTT GGGATAG AAA 7860 
AATCATAA AG AGCCAACCCA TCTCAGAACG TTGTGGATTG AGAG AGACAC TACATGACTC 7920 
CAAGTATATG AGAAAAGG AC AGAGCTCTAA TTGATAACTC TGTAGTTCAA AAGGAAAAGA 7980 
GTATGCCCAA TTCTCTCTAC ATGACATATT G AGATTTTTT TTAATCA ACT TTTAAGATAG 8040 
TGATGTTCTG TTCTAAACTG TTCrGTTTTA GTGAAGGTAG AmTTATAA AACAAGCATG 8100 
GGGATTCTTT TCTAAGGTAA TATTAATGAG AAGGG AAAAA AGTATCTTTA ACAGCTCTTT 8160 
GTTGAAGCCT GTGGTAGCAC ATTATG1TTA TAATTGCACA TGTGCACATA ATCTATTATG 8220 
ATCCAATGCA AATACAGCTC CAAAAATATT AAATGTATAT ATATTTTAAA ATGCCTGAGG 8280 
AAATACATTT TTCTTA ATAA ACTGAAGAGT CTCAGTATGG CTATTAAAAT AATTATTAGC 8340 
CrCCTGTTGT GTGGCTGCAA AACATCACAA AGTGACCGGT CTTGAGACCT GTGA ACTGCT 8400 
GCCCTGTTTA GTAAATAAAA TTAATGCATT TCTAGAGGGG GAATATCTGC CATCCAGTGG 8460 
TGGAAATGTG GAGTAAAGAA GCTGGTGGTC TGCTTCTGTG CTGTATGCCA GCCTTTTGCC 8520 
TTA AGTTGAG AGGAGGTCAA CTTTAGCTAC TGTCTTTGGT TTGAG AGCCA TGGCA AAAAA 8580 
AAAAAAAG AA AAAAAGATCA AGTCGTCTTT GGTGAGCCAG TAAGGTGAAA GCTTGCTGAC 8640 
TGTCCAAGGC ACAAGAGAAA ATTGAGGAAT TGAAATGCAA CCTGAGTATC AAACTAAATA 8700 
TTCTAATCA A AGGTAGGTAC TGTTAGGTGG AATTCTATCA GCAGGCAACT GCAAATGAGA 8760 
AGAAGATAGA AGGACGCCCG TCGGGACTTT GGAGGGCATT GTTATTTTCC CAAAGAAAGA 8820 
CGGCCAAGGG CAGAGGCATG GATTCTTTGC AGAGCACTTC CTITTGGTTT TTCAGTACTG 8880 
TTTCATAG AC AGTGGGCTCA CATGTTCCTG ATAGTGCTGC AGTTGCTTAG AAAGCATCCC 8940 
AGTTAATTGC AGTAATTAGA ACTTCTGGAA TATGCTAGGG CAGAAGTATG TCAAGTATGT 9000 
CACATG AAGA AAATGTGAAA TTCAAGAGTA ATCCACACGT GAG AAACTAG ACAATGTACA 9060 
TTCATGTGTT CTCTTGAAAG GAAAGGGAGA GCTGTAAGCT TCACTCTGTC CTACACCGGA 9120 
GAAAAGCAGG AATAACTTTA OCGTGGAAAT AATG TTTAG C TTTT ATCAG A GAAAATTG TC 9180 
CTTCTAGAGC ATAGAGTCCC AAAACTCAAT TCTGGTTTTC CCCTGnTTT TTTTTTTITT 9240 
TTTTTCCCAA CATATGAACT GCAGCATATC ACTTTTTCTT TTTGTGCCTC AGGTTCCTCA 9300 
CCTGTA A AAT TGAAAAATAT ATGTATTAAT A ATATTATTA ATAATAATAA TGGTAATGTA 9360 
GTACTTGnT GTAAAGCACTTTGAGATCCTTGGTTGAAAG GCACCATAGG AGTGCCAAGT 9420 
ATTATTATGT GGCCAAGGGG GTTATTTAAA CTGTCAGTTC CCAAAGGCCA GGAAAGGITG 9480 
GGGTCATTTT TCTTAAAGAC GAGCTGTAAA TATCAACTAG GCAGCCAATA GTGTTGACTA 9540 
TGAAGATGCA AAACTATTAC TAGGCTGATA AAATCATAGT TTCTTAATGG CTACCAATAA 9600 
GGCAAATATC ACAATAATA A ACGCCAAATT CCTTAGGGCG GACTATTTGA CAACCACATG 9660 
G AAAACTTTG GGGGAGCCAT GAGGGGGGAA CATCTCAAAA TGCCAATGTA AAATTTAACT 9720 
TACAGCAATA TTCACCAGCA GAAAATGTCT TTCATATGGA ATGATTTCAT GTTGCTAAGA 9780 
AAAAGAATTC AATTTGTAGT CCTGATTTGA ATACTAGAAT GTTGGCTATA ATAGTTCTGT 9840 
TCTTACA ACA CATGAAATTT TTTCGTnTA TTTT ATTTTG TTTTCATAGT GCATGTTCAT 9900 
TTCTACTCAC AAACATGTTCTTGGTGTATT TCTTATGCAA ACAATCTTCA GGCAGCAAAG 9960 
ATGTCTGTTA CATCTAAACT TGAATAATAA AGTTTTACCA CCAGTT ACAC A 
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SeqIDNO: 68 Protein scouence: 
Ptotein Accession NP.05483 1 

1 II 21 31 41 51 

I I I I 1 I 

MPYEVNAGYD FTNM VRKKNP PLRNVA SEGE GQILEPIGTE SKVSGKNKEF S ADQMSENTD 60 
QSDAAELNHK EEHSLHVQDP SSSSKKDLKS AVLSEKAGFN YESPSKGGNF PSFPHDEVTD 120 
RNMLAFSFPA AGGVCEPLKS PQRAEADDPQ DMACTPSGDS LETKEEXJKMS PKATEETGQA 1 80 
QSGQANCQOL SPVSVASKNP QVPSDGG VRL NKSKTDLLVN DNPDPAPLSP ELQDFKCNIC 240 
GYG YYONDPT DLDCHFRKYH LGLHNRTRQD AELDSKILAL HNM VQFSHSK DFQKVNRSVF 300 
SGVLQDINSS RPVLLNGTYD VQVTSGGTFl GIGRKTPDCQ GNTKYFRCKF CNFTYMGNSS 360 
TELBQHFLQT HPNKIKASLP SSEVAKPSEK NSNKSIPALQ SSDSGDLGKW QDKITVKAGD 420 
DTPVGYSVPl KPLDSSRQNG TEATSYYWCK FCSFSCESSS SLKLLEHYGK QHQAVQSGGL 480 
NPELNDKLSR GS VINQNDLA KSSEGETMTK TDKSSSGAKK KDFSSKGAED NM VTSYNCQF 540 
CDFRYSKSHG PDVIV VGPLL RHYQQLHNIH KCTKHCPFC PRGLCSPEKH LGEITYPFAC 600 
RKSNCSHCAL LLLHLSPGAA GSSRVKHQCH QCSFTTPDVD VLLFHYESVH ESQASDVKQE 660 
ANHLQGSEXjQ QSVKESKEHS CTKCDFITQV EEEISRHYRR AHSCYKCRQC SFTAADTQSL 720 
LEHFNTVHCQ EQDITTANGE EDGHAISTIK EEPKIDFRVY NLLTPDSKMG EPVSESWKR 780 
EKLEEKDGUC EKVWTESSSD DLRNVTWRGA DILRGSPSYT QASLGLLTPV SGTQEQTKTL 840 
RDSPNVEAAH LARPIYGLAV ETKGFLQGAP AGGEKSGALP QQYPASGENK SKDESQSLLR 900 
RRRGSQVFCA NCLTTKTSLW RKNANGGYVC NACGLYQKLH STPRPLNIIK QNNGEQIIRR 960 
RTRKRLNFEA LQAEQLNKQQ RGSNEEQVNG SPLERRSEDH LTESHQREIP LPSLSKYEAQ 1020 
GSLTKSHSAQ QPVLVSQTLD IHKRMQPLHI QDCSPQESTG DPGNSSSVSE GKGSSERGSP 1080 
IEKYMRPAKH PNYSPPGSPI EKYQYPLFGL PFVH>fDFQSE ADWLRFWSKY KLSVPGNPHY 1 140 
LSHVPGLPNP CQ>fYVPYPTF NLPPHFSAVG SDNDIPLDLA OCHSRPGPTA NGASKEKTKA 1200 
PPNVKNEGPL NWKTEKVDR STQDELSTKC VHCGIVFLDE VMYALHMSCH GDSOPFQCSI 1260 
CQHLCTDKYD FTTHIQRGLH RNNAQ VEKNG KPKE 

Seq ID NO: 69 DNA sequence 

Nucleic Acid Accession #: XM_073879 

Coding sequence: l-387(uDderlincd sequences coirespond to start and stop codons) 

1 11 21 31 41 51 
I I I i I I 

ATGGGGTTTG G AG ACC AGGG AACG GTGGAA GGGAGCCTAG GAACGTCGAA AAAACCACCT 60 
G AAGTG AAAA TGTTTGG AG C C AGTC AAGGT TTGCTG ACAA TGGAAACAAA CCAGTCCXTTG 1 20 
GCACAAGGCA CAGGCTGCTC AGTAGTAAAG GTAG ACACTQ TTCTCTTOOA OAGTTTATAC 1 80 
CACTGCGGCr TTGAACACGG GAGCGTG ATG CACTGCCTTG GGGATOATCA CCCCCAGGAA 240 
GACAGGAAGG CTCACTTCTC TGCCCCAGTT GCAGCCATCG CCTCTCCAGC ACCG ACTCCT 300 
GTCTGTCCTG CACACCACTC AACAC AG AGC ATCTGCCAGT TTCTACAGCA CTG CAGGCAG 360 
AACACTCACT TGCAGGCTGC TAACI^ 

Seq ID NO: 70 Protein sequence: 
Protein Accession XP_073879 

1 11 21 31 41 51 
i i I I 1 I 

MGFGDQGTVE GSLGTSKKPP EVKMFG ASQG LLTMETNQSL AQGTGCSWK VDT VLFESLY 60 
HCX3FEHGSVM HCLGDDHPQE DRKAHFSAPV AAIASPAPTP VCPAHHSTQS ICQFLQHCRQ 120 
NTHLQAAN 

Seq ID NO: 71 DNA sequence 

Nucleic Acid Accession U: AB 033 064 

Coding sequence: 826-1986 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 ■ 

111111 

GGGGAACAGC AAATTCAGTC ACAGACAATC CTCCACTCGG TCAAGAGCCA CTTTTCTCTT 60 
CCTGCCTTGC CCCCCCGC AG GGGGTA AGG A ACTG AGCGTT TAATCTTTAG CCGGTTGGCT 120 
ACCAGCTAAA ATTCTACTTA TCTTAGTTTC TAGTGG ATAG CTTTCTTATT TTGCCCATGT 1 80 
TTTCTTAGAA TCCCTGTTTA ATATACTTTT GTCAGTAGTA GTATCTAGGA GTAGCAGGG A 240 
GAGTOACAAT AAATTAGCCC CTTCTTTTTT CCCTTGTCAT TCAGG(XCCT TTTCCTCTCC 300 
AGAGGG AAAT TACCAGTAAA CTCTTCTA AA TCTTCCACCC CTTCTCAGTC ATACTGTGAA 360 
GAAACACACT AAAGTGGACA TTATTTGACC AGTGAACACG AACCCAGCTT CAGGCATTGG 420 
TTTGTTGTOG CACATGOAG A AACATCTC1T TTAAAATATC TCCCAATTAC CCTTTTCAC A 4 80 
ATTTGTATCC ACCTAGGATT TGCTGCTGGG GTAAGTCACT AGATTTATTT CTCA AAGCTC 540 
CCCrCTCTAT G AGCTG AAAG ACTGACCA AC CATGAACACT AGTAG GGGAT GGGG AAAGGG 600 
GACAGAGCAG AGCCAGTTGT TCCACACTTT GGG AAGCAGO AGTAGCnTT ATCATCTTCC 660 
TCTCGGGAGC AGGCATAGAG ACATAAACTG AGTGAAAATG GGTGGAGGAA G AACTTCTAT 720 
ACCCACGAAC AACATOTG AA GAG AG AG AAC CAAACATAAA GTAAGGAGGG TAGACGTTAC 780 
ATCCAAGAGG AAATAATCCA GGCAAGGAAG CACAAGCTGA TCAAGAT2TG TAOTTCTGTG 840 
GCTGCCAAGT TGTGGTTTTT G ACAG ATCGT CGCATCAGGG AAG ACTATCC TCA AAAAGAG 900 
ArnTACGAG CATTG AAGGC CAAATGTTGT GAGGAGGAAC TOG AdTTAG GGCTGTGGTG 960 
ATGGATGAGG TGGTGCTGAC AATCGAGCAA GGAAACCTOG GTCTGCGGAT CAATGGAGAG 1020 
CTAATCACTG CCTACCCACA AGTGGTGOTA GTCAGAGTAC CAACCCCTTG GQTGCAAAQT 1080 
GATAGTGACA TCACTOTTTT GCGCCATCTA G AGAAGATGG G ATGTCGGTT AATOAACCQA 1 140 
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CCTCAAGCCA TCCTGAACTO CGTTAATAAG TTCTGGACAT TTCAAGAGTT GGCTGGCCAT 1200 
GGTGTTCCTC TGCCGGATAC TTTCTCTrAT GGTGGCCACG AAAATTTTGC TAAAATGATT 1260 
GATGAGGCTG AAGTTCTGGA GTTCCCAATG GTAGTAAAGA ATACGCGGGG TCACAGAGGT 1320 
AAAGCTGTTT TCTTGGCTCG AGATAAGCAC CATTTGGCTG ATCTAAGCCA TCTTATTCGC 1380 
CATGAAGCGC CATACCTGTT CCAGAAGTATGTTAAAGAGT CTCATGGACG GGATGTACGT 1440 
GTCA1TGTCG TGGGAGGCCG TGTGGTTGGC ACCATGITAC GTTOTTCAAC AGATQGGAGA 1500 
ATGCAAAGCA ACTGCTCATT AGGTGGTOTG GGOATGATGT GCTCATTGAG TOAACAAGGG 1560 
AAGCAGCTAG CTATCCAGGT GTCTAATATC CTGGGG ATGG ATGTGTGTGG CATTGACCTG 1620 
CTGATGAAAG ATGACGGCTC CTTCTGCGTC TGTGAGGCCA ATGCAAATGT AGGTTTCATC 1680 
OCCTTTGATA AGGCTTGTAA TCTAGATGTA GCTGGTATCA TAGCAGACTA TGCCGCCTCC 1740 
CTTCTACCCT CTQGCrGGCT CACCCGGCGT ATGTCCCTGC TCTCCGTGGT GTCCACTGCC 1800 
AGTGAGACTA GTGAGCCGGA GCTGGGTCCC CCAGCCAGCA CTGCTGTTGA CAACATGAGT 1860 
GCAAGTTCCA GCTCTGTTGA CAGCGACCCT GAAAGCACGG AQCGAGAGCT GCTCACCAAG 1920 
CTCCCAGGGG GCCTGTTCAA CATGAACCAG CTGCTAGCCA ATGAAATCAA ACTACTGGTG 1980 
G ACIISACTCC ACTGGTAATT AACCAACAAA ACCCTTGTAA AACTTTCTTT CTTCmTCT 2040 
ATTTTTAAAA CCAACTTGCA ATGCTGTTCATGGAGGATGCTCAGGAAGATGAGAGAAAAT 2100 
TAGTAGGATTAGTTGGAGAG AGTGGG AGAT AGATGAGACC TCTGCTAGTA AGATGTTACT 2160 
TTCATTTACA AATCCTACAA ATAGAGAGGC AGAATAGGTG GGGTATAGAA AAATGTCAGG 2220 
CTCTCATAGT TACCCTTTTA AATTGCTAAA AAATGTGTAT GCTCATAGGC CATGAGGAAC 2280 
AAATACTTTT 1TTTTTTCAT GGTCCXTTTGC TnTGTTTTT GTACAAAAA A AAATGGTTTT 2340 
GCTACAAATA TCCA AGTAGC ATAACTTCAC ATTGTGTTGG AAGATTTGTC ATCAGTGAGG 2400 
AAAACATCTG CTTAAATTAC AGGAATmT GTATTATACA GCTCTGAAAA TTCTGCCATT 2460 
TCCTTATTAA CTAGCAGCTT TAGTTTGTAG TTTATGAAAT CTTGAGGGGC TCTTTTACTG 2520 
GGATTTCTTA TTTTTTTGTT TTTTCCCGCT TAATTTGGTG GGAGGTCAAA TTGAATATAA 2580 
CCCAATAAAG GCTTCTTAAT G AGAAAATTG GCATGTTTGC ATGATGAAAT GGAAATGAAC 2640 
AGTATTGCAA TGTCCGGTAT ACAAAATAAC ATTAATTCAA TGTAGATAAA ATTACACTAG 2700 
TTTAAAATATGTGCATTCACTTOTATTTGTTAGTGTmAGTCTTT^ 2760 
TCTQTTAATG TrG CTri"n li II i M 1 1 1 1 TAATACATGC TAGTCTAACA TTTCCTGCTC 2820 
TATGCCTGCA TCTTTAACAA TGGCCAAAGT GAAGAAAATG CTA CCnTl "!' TGTTAACAAG 2880 
ACACTGACTT GAAACATGTA CATTTAAAGC CTTTTATTTT TTCCCTTTTT GTTTTGGTAG 2940 
TTGGGCATTT AAATAAGGAC AAGGAAAAAT ATTnTGGGG GCAAATCAAG AGCCTATGAG 3000 
TTCTAAGTAT AAAGCTOAAO TOATrTCOAA TGCCAGCGIT ATATATTTGC ATTTTTCACA 3060 
TTTTACGAGG OAOTATATOT OTATGTGTGT GCACGCATGC ATOTGTATGT GTTTTGCTTT 3120 
TTGTTTCCAT CAACTAATCA AAAAGGATAA TTTAGAAAAT GGAGCATGAT GGGAAACAGA 3180 
GrmTGACT TTAAAA AACA GATG AGTTGT TTTCATAAGT AGACTCCACT GGGGTAGAGG 3240 
TATTCACCTT AAAACATAGG GTGAGTAGAT GCTTTTTTAG GCCnTTTGT GTATATGTAC 3300 
OTTGTTTGTTTTTTTCCnT TGTTTCTAGC CTGTTCAGTG TACAGTTTAT TCAAGGCTAC 3360 
ATOCTTTTCT TTAATGCTTC TGGCTATGCA TTTTCTCTTT TTACATATAG G ATTTGGG AT 3420 
TGGGGGTGGG TTGGATGTTT TTGTTTGGGG ACTTATTTAG TAGTATTG AG TCTCTTATAG 3480 
CCCTACTCTT AAGCCTTCAA TACTGTCCAC TCnTATATT CCTTTACTTG CAGAATTTAT 3540 
AAAAGCCCCC AAACTGCATA TAATATG AGC CTTTAAAACA TGGGTAAAAC TAATCCCATT 3600 
GATOGGTTTG GATGGTATGT TAAGAAATGG AGATGCTGCA GAGCCCAACG TAATTTTTTA 3660 
AACAGCAAGT TTTCCATCTC CCTACGAATC CTCTG AAGCT TTTACCCAAG CCCTTTCTTG 3720 
CCTCTCCAGT GCTATTTTCC TTCAGATGGA CCTTAAACAT AATTTCTTGG ACACTACTAG 3780 
AGAG ACTTCG AGGCAATAAT AAAAGATCAG TATTAACCAG CTATAACAGA GGTTTG ATCA 3840 
TGCTTACTTG TACAGTTTTT CCCCCGTTTT AAAAAGG AAT GTAATAAAAT TTGTTTTTTC 3900 
CATAGAATTA AATAATATTA AAATTGAGTG AAAGGTTGAT TGTTGATGAA TAGAATAGTA 3960 
CCTCTCATCT GTGCAGTGTC TCATTTCACC TCAGAGAAAA GGATACATAA GAGGAGTTTG 4020 
TAATTTATCT TAGGATA1TC TAATTGCATT TAAAAGAACT TATCTTGCGC AGGGTAAATG 4080 
GGGGACTCAC ATACATATAT TAATACCTCTGACTCATTAA CAGAAAGAAA TACTTGGTAC 4140 
TTCTTTCGCT GAATGACCAT ACTGTGG AGG ATGCATACTA TTTGGTATAG AGAAATAAAT 4200 
G AGGAAGAAA G AACTGCTTA ATTAAATTAT CATTCATATG TTCATATAGA GACCATCTGG 4260 
TTGCCATGTG TATTATGACA CATACACTTT GAATAGTTAC ATATCACAAG TATGTAGTTC 4320 
ATGTTTGTQT TGGTGGGGTA AGGCATCAGG AAAAATGTAG TTAGTCmT CTTAACTTAT 4380 
ACCA AATTAA CCA ACTATAT TATAGGAAAT ATGTG AA ATT AOTTCATTAG CnTATTCAC 4440 
TATTATGCATTCACATGATA TTAAAACGTA CACTCACATG TTAGAATGAA AAGAGCAGTA 4500 
GTTATCTTAG ATTTTAAAAA CATGGATATC TTCTTGAATT CCITCAAG AT TG AGGTAGAG 4560 
AATAAGAGCA AATCATTCTG GAAQTACCTT AAGGAAACAA ACAGCAGCAG ATATTTAGGT 4620 
TA AACTTATT TTCATAATTG TTTAATAACT TTTGTATAAT CTTCATTGCT ATTATGAGAG 4680 
AGAATGTATA TATCAAATAT GTGTAATGAT AAAATCTGAA TTGTAAAATT TTTGTATATT 4740 
GTTAAAATTG TAATTCTAA A TTGTATITCA AAAATGATTA TTTCTGATAT TGTmTATG 4800 
TCACCCATGA TGAAAACTGG ACITTATATA TCTAAACATA CAAGTATGAA CTATTCTATT 4860 
TAAAATTTTT AATAG'l ITVi TTCTmTTG GTGCCTATAA TTGATTGGTC ATTTCTGCTG 4920 
GCnTTCTCC AATG AACATT GAAATCTTCC TGTATATGTT ACCAATAAGA AAACTACCCT 4980 
GGAACAGTAG AAAAACCCAA CAAGAGACTT GGCATTCATC AAGCACATTA TCAGACTTTG 5040 
AGAACATATT G AAGGCATTG ACTTTGAAAA TCATCTCTTT TTCTCAAO AA G AAAGCA ATG 5 100 
GAGAAGCAAA TTTGTTTCAT TCAGTGAATC CCCAGTTTGG GOCTTOTGGG GCTTAGAGAC 5160 
ATTGTG AAAT CAAATCTTGT GTTATACTTT TCTCCTGGCT CACTTTTTTT GAG AAGGTTT 5220 
ATGGGCTATT TGGCTGGTGA GACACGATCC CCTCCTAAGA AAATGTAGGT GCTCAGACAG 5280 
GTAACCACTG CTGCTACTGT TTrTATTTGT TTGTTTGTTC AATTTTATTT AAGATTTGTT 5340 
TTTGTTGTAC TAGGATTTTA AAAAATGTAA TATATTGCAG GATTTATAAC GAG 

Seq ID NO. 72 Protein sequence: 
Protein Accession U: BAA865S2 
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MCSSVAAKLW FLTDRRIRED YPQKEILRAL KAKCCEEELD FRAVVMDEW LTIEQGNLGL 60 
RINGELITAY PQVVVVRVPT PWVQSDSDIT VLRHLEKMGC RLMNRPQAIL NCVNKFWTFQ 120 
ELAGHGVPLP DTFSYGGHEN FAKMIDEAEV LEFPMVVKNT RGHRGKAVFL ARDKHHLADL 180 
SHLIRHEAPY LFQKYVKESH GRDVRVIVVG GRVVGTMLRC STDGRMQSNC SLGGVGMMCS 240 
LSEQGKQLAIQVSNILGMDVCGIDLLMKDDGSFCVCEANANVGFIAFDKACNLDVAGIIA 300 
DYAASLLPSG RLTRRMSLLS VVSTASETSE PELGPPASTA VDNMSASSSS VDSDPESTER 360 
ELLTiCLPGGLFNMNQLLANE IKLLVD 

Seq ID NO: 73 DNA sequence 

Nucleic Acid Accession »: XM.O4008O^ 

Coding sequence: 159-1 104 (underlined sequences conespond to start and stop oodons) 

1 II 21 31 41 51 
1)1111 

CTG AGTGGGG GCGGGGACTQ CTGOAGTTGC GGGGCCTGCC TGGGGTAGGG CGGGGCAGGA 60 
CAGCTTGGAG ATAGGGCCCQ GAATTGCGGG CGTCACTCTG CTCCTGCGAC CTAGCCAGGC 120 
GTGAGGGAGT GACAGCAGCG CATTCGCGGG ACGAGAGCGATGAGTGAGAA CGCCGCACCA 180 
GGTCTQATCT CAGAGCTGAA GCTGGCTGTG CCCTGGGGCC ACATCGCAGC CAAAGCCTGG 240 
GGCTCCCTGC AGGGCCCTCC AGTTCTCTGC CTGCACGGCT GGCTGGACAA TGCCAGCTCC 300 
TTCG ACAG AC TCATCCCTCT TCTCCCGCAA G ACTTTTATT ACGTTGCCAT GG ATTTCGGA 360 
GGTCATGGGC TCTCGTCCCA TTACAGCCCA GGTGTCCCAT ATTACCTCCA GA Cl l 1 IG TG 420 
AGTGAGATCC GAAGAGTTGT GGCAGCCTTG AAATGG AATC GATTCTCCAT TCTGGGCCAC 480 
AGCTTCGGTG GCGTCGTGGG CGGAATGTTT ITCTGTACCT TCCCCGAGAT GGTGGATAAA 540 
CTTATCTTGC TGGACACGCC GCTCTTTCTC CTGGA ATCAG ATGAAATGGA GAACTTGCTG 600 
ACCTACAAGC GGAGAGCCAT AGAGCACGTG CTGCAGGTAG AGGCCTCCCA GGAGCCCTCG 660 
CACGTGTTCA GCCTGAAGCA GCTGCTGCAG AGGTTACTG A AGAGCAATAG CCACTTGAGT 720 
GAGGAGTGCG GGG AGCTTCT CCTGCAAAGA GGAACCACGA AGGTGGCCAC AGGTCTGGTT 780 
CTG AACAG AG ACCAGAGGCT CGCCTGGGCA GAGAACAGCA TTGACTTCAT CAGCAGGGAG 840 
CTGTGTGCGC ATTCCATCAG GAAGCTGCAG GCCCATGTCC TGTTGATCAA AGCAGTCCAC 900 
GGATATTTTG ATTCAAGACA GAATTACTCT G AGAAGGAGT CCCTGTCGTT CATGATAGAC 960 
ACGATGAAAT CCACCCTCAA AGAGCAGTTC CAGTTTGTGG AAGTCCCAGG CAATCACTGT 1020 
GTCCACATGA GCG AACCCCA GCACGTGGCC AGTATCATCA GCrCCTTCTT ACAGTGCACA 1080 
CACATGCTCC CAGCCCAGCT GTAGCTCTGG GCCTGGAACT ATGAAGACCT AGTGCTCCCA 1 140 
GACTCAACAC TGGGACTCTG AGTTCCTGAG CCCCACAACA AGGCCAGGGA TGGTGGGGAC 1200 
AGGCCrCACT AGTCTTGAGG CCCAGCCTAG GATGGTAGTC AGGGGAAGGA GCGAGATTCC 1260 
AACTTCAACA TCTGTGACCT CAAGGGGGAG ACAGAGTCTG GGTTCCAGGG CTGCITTCTC 1320 
CTGGCTAATA ATAAATATCC AGCCAGCTGG AGGAAGGAAO GGCAGGCTGG GCCCACCTAG 1380 
CCTTTCCCTG CTGCCCAACT GGATGGAAAA TAAAAGGTTC TTGTATTCTC A 

Seq ID NO: 74 Protein sequence: 
Protein Accession XP_040080.1 

1 11 21 31 41 51 

1 I I I ! i 

MSENAAPGLI SELKLAVPWG HIAAKAWGSL QGPPVLCLHG WLDNASSFDR LIPLLPQDFY 60 
YVAMDFGGHG LSSHYSPGVP YYLQTFVSEI RRWAALKWN RFSILGHSFG GVVGGMFFCT 120 
FPEM VDKLIL LDTPLFLLES DEMENLLTYK RRAIEHVLQV EASQEPSHVF SLKQLLQRLL 1 80 
KSNSHLSEEC GELLLQRGTT KVATGLVLNR DQRLAWAENS IDHSRELCA HSIRKLQAHV 240 
LLIKAVHGYF DSRQNYSEBCE SLSFMIDTMK STLKEQFQFV EVPGNHCVHM SEPQHVASII 300 
SSFLQCTHMLPAQL 

Seq ID NO: 75 DNA sequence 

Nucleic Acid Accession #: NM_005794 

Coding sequence: 434-1276 (underlined sequences concspond to start and stop codons) 

1 11 21 31 41 51 
I 1 i I I I 

GGTTCCCrrC CACGCTGTGA AGCTTTGTTC nTTGGTCTT CATG ATAAAT CTTGCTGCTG 60 
CTCACTCGTT GGGTCCGTGC CACCTTTAAG AGCTGTAACA CTCACCGCGA AGQTCTGCAA 120 
CnrCACTCCT GGGGCCAGCA AGACCACGAA TGCACCGAGA GG AATGAACA ACTCTGG ACA 1 80 
CACCATCTTT AAGAACOGTA ATACTCACCG CAAGGGTCTG CAACTTCATT CTTGAAGTCA 240 
GTGAGGCCAA GAACCCATCA ATTCCGTACA CATTTTGGTG ACTTTGAAGA GACTGTCACC 300 
TATCACCAAG TGGTGAGACT ATTGCCAAGC AGTG AGACTA TTGCCAAGTG GTGAGACCAT 360 
CACCAAGCGG TGAGACTATC ACCTATCGCC AAGTQGCCTG ATTCAGCAGG AAGCATCTCA 420 
GACACCAACC ACTATGCTGT CAGCAGTTGC CCGGGGCTAC CAGGGCTGGT TTCATCCCTG 480 
TGCTAGGCTT TCTGTGAGGA TGAGCAGCAC CGGG ATAGAC AGGAAGGGCG TCCTGGCTAA 540 
CCGGGTAGCC GTGGTCACGG GOTCCACCAG TGGG ATCGGC TTTGCCATCG CCCGACGTCT 600 
GGCCCGGGAC GGGGCCCACG TGGTCATCAG CAGCCGGAAG CAGCAGAACG TGGACCGGGC 660 
CATGGCCAAG CTGCAGGGGG AGGGGCTGAG TGTGGCGGGC ATTGTGTGCC ACGTGGGGAA 720 
GGCTG AGGAC CGGGAGCAGC TGGTGGCCAA GGCCCTGGAG CACTGTGGGG GCGTCGACTT 780 
CCTGGTGTGC AGCGCAGGGG TCAACCCTCT GGTAGGGAGC ACTCTGGGGA CCAGTGAGCA 840 
GATCTGGGAC AAGATCCTAA GTGTG AACGT G AAGTCCCCA GCCCTGCTGC TGAGCCAGTT 900 
GCTGCCCTAC ATGGAG AACA 00 AGGGGTGC TGTCATCCTG GTCTCTTCCA TTGCAGCTTA 960 
TAATCCAGTA GTGGCGCTGG GTGTCTACAA TGTCAGCAAG ACAGCGCTGC TGGGTCTCAC 1 020 
TAGAACACTG GCATTGGAGC TGGCCCCCAA GGACATCCGG GTAAACTGCG TGGTTCCAGG 1080 
AATTATAAAA ACTO ACTTCA GCAAAGTGTT TCATGGGAAT OAGTCTCTCT GQAAGAACTT 1 140 
CAAGOA ACAT CATCACSCTGC AGAGGATTOO 00 AGTCAO AG G ACTGTGCAG QAATCGTGTC 1200 
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CrrCCTGTGC TCTCCAGATG CCAGCTACGT CAACGGGGAG AACATTGCGG TGGCAGGCTA 1260 
CrCCACTCGG CTCTGAGAGG AGTGGGGGCG GCTGCGTAGC TGTGGTCCCA GCCCAGGAGC 1320 
CTOAQGGGGT GTCTAGGTGA TCATTTGGAT CTGGAGCAGA GTCTGCCATT CTGCCAGACT 1380 
AGCAAnrCG GGGCTTACTC ATGCTAGGCT TG AGGAAOAA GAAAAACGCT TCGOCATTCT 1440 

cc 

Scq ID NO: 76 Protein sequence: 
Protein Accession #: NP_00S78S 

1 11 21 31 41 51 

1 I I I I I 

MLSAVARG YQ GWFHPCARLS VRMSSTGIDR KGVLANRVAV VTGSTSGIGF AIARRLARDG 60 
AHWISSRKQ QNVDRAMAKL QGEGLSVAG! VCHVGKAEDR EQLVAKALEH CGGVDFLVCS 120 
AG VNPLVGST LGTSEQIWDK ILS VNVKSPA LLLSQLLPYM ENRRGAVILV SSIAAYNPVV 1 80 
ALGVYNVSKT ALLGLTRTLA LELAPKDIRV NCVVPGIIKT DFSKVFHGNE SLWKNFKEHH 240 
QLQRIGESED CAGIVSFLCSPDASYVNGEN lAVAGYSTRL 



SeqIDNO:77 DNA sequence 

Nudetc Acid Accession U: NMJX)242S 

Coding sequence: 26-1453 (undeillned sequences coirespond to start and stop codons) 

I 11 21 31 41 51 
I I I t I I 

AAAGAAGGTA AGGGCAGTQA GAATGATGCA TCTTGCATTC CTTGTGCTGT TGTGTCTGCC 60 
AGTCTGCTCT GCCTATCCTC TGAGTGGGGC AGCAAAAGAG GAGGACTCCA ACAAGGATCT 120 
TGCCCAGCAA TACCTAGAAA AGTACTACAA CCTCGAAAAG GATGTGAAAC AGITTAGAAG 180 
AAAGGACAGT AATCTCATTG TTAAAAAAAT CCAAGGAATG CAGAAGTTCC TTGGGTTGGA 240 
GGTGACAGGG AAGCTAGACA CTG ACACTCT GGAGGTGATG CX}CAAGCCCA GGTGTGGAGT 300 
TCCTGACGTT GGTCACTTCA GCTCCTTTCC TGGCATGCCG AAGTGGAGGA AAACCCACCT 360 
TACATACAGG ATTGTGAATT ATACACCAGA TTTGCCAAGA GATGCTGTTG ATTCTGCCAT 420 
TGAGAAAGCT CTGAAAGTCT GGGAAGAGGT G ACTCCACTC ACATTCTCCA GGCTGTATGA 480 
AGGAGAGGCT GATATAATGA TCTCTTTCGC AGTTAAAGAA CATGGAGACT TTTACTCTTT 540 
TGATGGCCCA GG ACACAGTT TGG CTCATGC CTACCCACCT GGACCTGGGC TTTATGG AG A 600 
TATTCACnT GATGATGATG A AAAATGGAC AQAAGATGCA TCAGGCACCA ATTTATTCCT 660 
CGTTGCTGCT CATGAACTTG GCCACTCCCT GGGGCTCTTT CACTCAGCCA ACACTGAAGC 720 
TTTGATGTAC CCACTCTACA ACTCATTCAC AGAGCTCGCC CAGTrCCGCC TTTCGCAAG A 780 
TG ATGTGAAT GGCATTCAGT CTCTCTACGG ACCTCCCCCT GCCTCTACTG AGGAACCCCT 840 
GGTGCCCACA AAATCTGTTC CTTCGGG ATC TGAGATGCCA GCCAAGTGTG ATCCTGCTTT 900 
GTCCTTCGAT GCCATCAGCA CTCTGAGGGG AGAATATCTG TTCTTTAAAG ACAGATATTT 960 
TTGGCGAAGA TCCCACTGGA ACCCTGAACC TG AATTTCAT TTG ATTTCTG CATnTGGCC 1020 
CTCrCITCCA TCATATTTGG ATGCTGCATA TG AAGTTAAC AGCAGGGACA CCGirillAT 1080 
TTTTAAAGGA AATGAGTTCT GGGCCATCAG AGGAAATGAG GTACAAGCAG GTTATCCAAG 1 140 
AGGCATCCAT ACCCTGGGTT TTCCTCCAAC CATAAGGAAA ATTGATGCAG CTGTTTCTGA 1200 
CAAGGAAAAG AAGAAAACAT ACTTCTTTGC AGCGGACAAA TACTGGAGAT TTGATGAAAA 1260 
TAGCCAGTCC ATGGAGCAAG GCTTCCCTAG ACTAATAGCT GATGACTTTC CAGGAGTTGA 1320 
GCCTAAOGTT GATGCTGTAT TACAGGCATT TGOATTTTTC TAC l l U l i CA GTGGATCATC 1380 
ACACnTGAG TTTGACCCCA ATGCCAGGAT GGTGACACAC ATATTAAAGA GTAACAGCTG 1440 
GTTACATTGC TAGGCGAGAT AGGGGGAAGA CAGATATGGG TGnTTTAAT AAATCTAATA 1500 
ATTATrCATC TAATGTATTA TGAGCCAAAA TGGTTAATTT TTCCTGCATG TTCTGTG ACT 1 560 
GAAGAAGATG AGCCTTGCAG ATATCTGCAT GTGTCATG AA GAATGTTTCT GGAATTCTTC 1620 
ACTTGCrriT GAATTGCACT G AACAGAATT AAGAAATACT CATGTGCAAT AGGTGAGAGA 1680 
ATGTATTTTC ATAG ATGTGT TATTACTTCC TCAATAAAAA GTnTATTTT GGGCCTGTTC 1 740 
CTT 

Seq ID NO: 78 Protein sequence: 
Protein Accession #: NPJ)Q24 1 6 

I 11 21 31 41 51 
I I I I I I 

MHLAFLVLLC LPVCSAYPLS GAAKEEDSNK DLAQQYLEKY YNLEKDVKQF RRKDSNLIVK 60 
KIQGMQKFLG LEVTG KLDTD TLEVMRKPRC G VPDVGHFSS FPGMPKWRJCT HLTYRIVNYT 1 20 
PDLPRDAVDS AIEKALKVWE EVTPLTFSRL YEGEADIMIS FAVKEHGDFY SFDGPGHSLA 180 
HAYPPGPGLY GDIHFDDDEK WTEDASGTNL FLVAAHELGH SLGLFHSANT EALMYPLYNS 240 
FTELAQFRLS QDDVNGIQSL YGPPPASTEE PLVPTKSVPS GSEMPAKCDP ALSFDAISTL 300 
RGEYLFFKDR YFWRRSHWNP EPEFHLISAF WPSLPSYLDA AYEVNSRDTV FIFKGNEFWA 360 
IRGNEVQAGY PRGIHTLGFP PTIRKIDAAV SDKEKKKTYF FAADKYWRFD ENSQSMEQGF 420 
PRUADDFPG VEPKVDAVLQ AFGFFYFFSG SSQFEFDPNA RMVTHIUCSN SWLHC 

Seq ID NO; 79 DNA sequence 

Nucleic Acid Accession H: NM J)00493. 1 

Coding sequence: I -2043 (underlined sequences coirespond to start and stop codons) 

1 11 21 31 41 51 
i I I I I I 

ATGCTGCCAC AAATACCCTT TTTOCTGCTA GTATCCTTG A ACTTGGTTCA TGGAGTOTTT 60 
TACGCTG AAC GATACCAAAC GCCCACAGGC ATAAAAGGCC CACTACCCAA CACCAAGACA 120 
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CAGTTCTTCA TTCCXTTACAC CATAAAG AGT AAAGGTATAG CAGTAAGAGG AGAGCAAGGT 1 80 
ACTCCTGGTC CACCAGGCCC TGCTGOACCT CG AGGGCACC CAGGTCCTTC TGGACCACCA 240 
GGAAAACCAG GCTACGGAAG TCXTTOGACTC CAAGGAG AGC CAGGGTTGCC AGGACCACCG 300 
GGACCATCAG CTGTAGGGAA ACCAGGTGTG CCAGGACTCC CAGGAAAACC AGGAGAGAGA 360 
GGACCATATG G ACCAAAAGG AGATGTTGGA CCAGCTGGCC TACCAGGACC CCGGGGCCCA 420 
CCAGGACCAC CTGGAATCCC TGGACCGGCT GGAATTTCTG TQCCAGQ AAA ACCTGGACAA 480 
CAGGGACCCA CAGGAGCCCC AGGACX:CAGG GGCTTTCCTG GAG AAAAGGG TGCACCAGGA 540 
GTCCCTGGTA TG AATGGACA GAAAGGGGAA ATGGGATATG GTGCTCCTGG TCGTCCAGGT 600 
G AG AGGGGTC TTCCAGGCCC TCAGGGTCCC ACAGGACCAT CTGGCCCTCC TGGAGTGGGA 660 
AAA AO AOGTG AAAATGGGGT TCCAOGACAG CCAGGCATCA AAGGTGATAG AGGmTCOG 720 
GGAGAAATGG GACCAATTGG CCCACCAGGT CCCCAAGGCC CTCCTGGGGA ACX3AGGGCCA 780 
GAAGGCATTG G AAAGCCAGG AGCTGCTGG A GCCCCAGGCC AGCCAGGGAT TCCAGGAACA 840 
AAAGGTCTCC CTGGGGCTCC AGGAATAGCT GGGCCCCCAG GGCCTCCTGG CnTGGGAAA 900 
CCAGGCTTGC CAGGCCTGAA GGGAGAAAGA GGACCTGCTG GCCTTCCTGG GGGTCCAGGT 960 
GCCAAAGGGG AACAAGGGCC AGCAGGTCTT CCTGGGAAGC CAGGTCTGAC TGGACCCCCT 1020 
GGG AATATGG GACCCCAAGG ACCAAAAGGC ATCCCGGGTA GCCATGGTCT CCCAGGOCCT 1080 
AAAGGTGAGA CAGGGCCAGC TGGGCCTGCA GGATACCCTG GGGCTAAGGG TGAAAGGGGT 1140 
TCCCCTGGGT CAG ATGGAAA ACCAGGGTAC CCAGGAAAAC CAGGTCTCOA TGGTCCTAAG 1200 
GGTAACCCAG GGTTACTAGG TCCAAAAGGT GATCCTGGAG TTGOAGGAC'C TCXTrGGTCTC 1260 
CCAGGCCCTG TGGGCCCAGC AGGAGCAAAG GGAATGCCCG GACACAATGG AGAGGCTGGC 1320 
CCAAGAGGTG CCCCTGGAAT ACCAGGTACT AGAGGCTCTA TTGGGCCACX: AGGCATTCCA 1380 
GQATTCCCTG OGTCTAAAGG GGATCCAGGA AGTCCCGGTC CTCCTGGCCC AGCTGGCATA 1440 
GCAACTAAQG GCCTCAATOG ACCCACCGGG CCACX:AGGGC CTCCAGGTCC AAGAGGCCCC 1500 
TCTGGAGAGC CTGGTCTTCC AGGGCCCCCT GGGCCTCCAG GCCCACCAGG TCAAGCAGTC 1560 
ATGCCTGAGG GTTTTATAAA GGCAGGCCAA AGGCCCAGTC mrCTGGGAC CCCTCTTGTT 1620 
AGTGCCAACC AGGGGGTAAC AGGAATGCCT GTGTCTGCTT TTACTGTTAT TCTCTCCAAA 1680 
GCTTACCCAG CAATAGGAAC TCCCATACCA TTTGATAAAA nTIXiTATAA CAG GCAAC AG 1740 
CATTATGACC CAAOGACTOG AATCTTTACT TGTCAOATAC CAGOAATATA CTATnfTTCA 1800 
TACCAOGTGC ATGTGAAAGG GACTCATGTT TGGGTAGGCC TGTATAAGAA TGGCACCCCT 1860 
GTAATGTACA CCTATGATGA ATACACCAAA GGCTACCTGG ATCAGGCTTC AGGGAGTGCC 1920 
ATCATCGATC TCACAGAAAA TGACCAGGTG TGGCTCCAGC TTCCCAATGC CGAGTCAAAT 1980 
GGCCTATACT CCTCTGAGTA TGTOCACTCC TCTTTCTCAG 0 ATTCCTAGT GOCTCCAATO 2040 
TGAGTACACC CCACAQAGCT AATCTAAATC TTGTGCTAGA AAAAGCATTC TCTAACTCTA 2100 
CCCCACCCTA CAAAATGCAT ATGGAGGTAG GCTGAAAAGA ArGTAATTTT TAinTCTGA 2160 
AATACAGATTTGAGCTATCAGACCAACAAA CCTTCCXrCCT GAAAAGTGA G CAG CAACGTA 2220 
AAAACGTATG TG AAGCCTCT CTTGAATTTC TAGTTAGCAA TCTTAAGGCT CnTAAGGTT 2280 
TTCTCCAATATTAAAAAATATCACXrAAAGAAGTCCIGCTATGrrAAAAACAAACA^ 2340 
AAACAAAGCA ACAAAAAAAA AAATTAAAAA AAAAAACAG A AATAOAGCTC TAAGTTATGT 2400 
GAAATTTGAT TTOAG AAACT CGGCATTTCC TTTTTAAAAA AGCCTGTTTC TAACTATGAA 2460 
TATGAGAACT TCTAGGAAAC ATCCAGGAGG TATCATATAA CTTTGTAGAA CTTAAATACT 2520 
TGAATATTCA AATTTAAAAG ACACTGTATC CCCTAAAATA TTTCTGATGG TGCACTACTC 2580 
TG AGGCCTGT ATGGCCCCTT TCATCAATAT CTATTCAAAT ATACAGGTGC ATATATACTT 2640 
GTTAA AGCTC TTATATAAAA AAGCCCX:AA A ATATTGAAGT TCATCTGAAA TGCAAGGTGC 2700 
TTTCATCAAT GAACCTTTrC AAAACTTTTC TATGATTGCA G AG AAGCTTT TTATATACCC 2760 
AGCATAACTT GG AAACAGGT ATCTGACCTA TTCTTATTTA GTTAACACAA GTGTGATTAA 2820 
TTTGATTTCT TTAATTCCTT ATTGAATCTT ATGTGATATG ATnrCTGGA TTTACAOAAC 2880 
ATTAGCACAT GTACCTTGTG CCTCCCATTC AAGTGAAGTT ATAATTTACA CT GAGGG TTT 2940 
CAAAATTCGA CTAGAAGTGG AGATATATTA TTTATTTATG CACTGTACTG TAl U 1 l ATA 3000 
TTGCTGTTTA AAACTTTTAA GCTGTGCCTC ACTTATTAAA GCACAAAATG TTTTACCTAC 3060 
TCCTTATTTA CGACACAATA AAATAACATC AATAGATTTT TAGGCTGAAT TAATTTGAAA 3120 
GCAGCAATTT GCTGTTCTCA ACCATTCTTT CAAGGCM i l CATTOGACAC AATAAAATAA 3180 
CATCAATAGA TTnTAGG 

Seq ID NO: 80 Protein sequence; 
Protein Accession #: NP_000484.1 



1 11 21 31 41 51 
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MLPQIPFLLL VSLNLVHGVF YAERYQTPTG IKGPLPNTKT QFFIPYTIKS KGIAVRGEQG 60 
TPGPPGPAGP RGHPGPSGPP GKPGYGSPGL QGEPGLPGPP GPSAVGKPGV PGLPGKPGER 120 
GPYGPKGDVG PAGLPGPRGP PGPPGIPGPA GISVPGKPGQ QGPTGAPGPR GFPGEKG APG 180 
VPGMNGQKGE MGYGAPGRPG ERGLPGPQGP TGPSGPPGVG KRGENGVPGQ PGIKGDRGFP 240 
GEMGPIGPPG PQGPPGERGP EGIGKPGAAG APGQPGIPGT KGLPGAPGIA GPPGPPGFGK 300 
PGLPGLKGER GPAGLPGGPG AKGEQGPAGL PGKPGLTGPP GNMGPQGPKG IPGSHGLPGP 360 
KGETGPAGPA GYPGAKGERG SPGSDGKPGY PGKPGUX3PK GNPGLPGPKG DPGVGGPPGL 420 
PGPVGPAGAK GMPGHNGEAG PRGAPGIPGT RGPIGPPGIP GFPGSKGDPG SPGPPGPAGl 480 
ATKGLNGPTG PPGPPGPRGP SGEPGLPGPP GPPGPPGQAV MPEGFIKAGQ RPSLSGTPLV 540 
SANQGVTGMP VSAFTVILSK AYPAIGTPIP FDKILYNRQQ HYDPRTGIFT CQIPGrYYFS 600 
YHVHVKGTHV WVGLYKNGTP VMYTYDEYTK GYLDQASGSA IIDLTENDQV WLQLPNAESN 660 
GLYSSEYVHS SFSGFLVAPM 

Seq ID NO: 81 DNA sequence 

Nucleic Acid Accession ii: NM_000786 

Coding sequence: 332-1861 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 
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I I I 1 i I 

CGCGATTCTC AGGQATTGAT CCGCCTCTTC AGGTAAGTTA TCTTCOGGCC CCGTACCACT 60 
QTGCCACAGG CGCAGCXXX3C TTCCTCAGGT GCCCTATCCC GCX3CAGAAGA CCACGGCTTC 120 
ACAQAGTOTT ATTTAAGQOC OTOGCCAQCG GAACATCCOO CCCCATTCTG TGACOCACGG 180 
GGTGGOGCGC GTGGGACCCG AGOGGTGGGG CTGGGTTTAG TAGGAGACCT GGGGCAAGGC 240 
OCCCTGTGGA CG ACCATCTG CCAGCTTCTC TCGTTCCGTC GATTGGG AGG AGCGGTGQCG 300 
ACCTCGGCCT TCAGTGTTTC CGACGGAGTG AATGGCGGCG GCGGCTGGG A TGCTGCTGCT 360 
GGGCTTGCTG CAGGCGGGTG GGTCGGTGCT GGGCCAGGCG ATGG AG AAGG TGACAGGCGG 420 
CAACCTCTTO TCCATOCTGC TG ATCGCCTO CGCCTTCACC CTCAGCCTGG TCTACCTGAT 480 
CCGTCTGGCC GCCGGCCACC TGGTCCAGCT GCCCGCAGGG GTGAAAAGTC CTCCATACAT 540 
TTTCTCCCCA ATTCCATTCC TTGGGCATGC CATAGCATTT GGGAAAAGTC CAATTG AATT 600 
TCTAG AAAAT GCATATG AGA AGTATGG ACC TGTATTTAGT TTTACCATCG TAGGCAAG AC 660 
ATTTACTTAC CTTCTGGGGA GTGATGCTGC TGCACTGCTT TTTAATAGTA AAAATG AAGA 720 
CCTGAATGCA G AAG ATGTCT ACAGTCGCCT GACAACACCT GTGT1TGGGA AGGGAGTTGC 780 
ATACGATGTG CCTAATCCAG TTTTCTTGG A GCAGAAGAAA ATG1TAAAAA GTGGCCTTAA 840 
CATAGCCCAC TTTAAACAGC ATGTTTCTAT AATTGAAAAA GAAACAAAGG AATACTTTG A 900 
GAGTTGGGGA GAAAGTGGAG AAAAAAATGT GTTTGAAGCT CTTTCTGAGC TCATAATTTT 960 
AACAGCTAGC CATTGTTTGC ATGGAAAGGA AATCAGAAGT CAACTCAATG AAAAGGTAGC 1020 
ACAGCTGTAT GCAGATTTGG ATGG AGGTTT CAGCCATGCA GCCTGGCTCT TACCAGGTTG 1080 
GCTGCCTTTG CCTAGTITCA GACGCAGGGA CAGAGCTCAT CGGGAAATCA AGOATATTTT 1 140 
CTATAAGGCA ATCCAGAAAC GCAGACAGTC TCAAGAAAAA ATTGATGACA TTCTCCAAAC 1200 
TTTACTAGAT GCTACATACA AGGATGGGCG TCCTTTGACT GATGATGAAG TAGCAGGGAT 1260 
GCTTATTGGA TTACTCTTGG CAGGGCAGCA TACATCCTCA ACTACTAGTG CTTGGATGGG 1320 
CTTCTnTTG GCCAGAG ACA AAACACTTCA AAAAAAATGT TATTTAGAAC AGAAAACAGT 1380 
CrGTGQAGAG AATCTGCCTC CTTTAACTTA TGACCAGCTC AAGGATCTAA ATTTACTTGA 1440 
TCGCTGTATA AAAGAAACAT TAAGACTTAG ACCTCCTATA ATGATCATGA TGAGAATGGC 1500 
CAGAACTCCT CAGACTGTGG CAGGGTATAC CATTCCTCCA GG ACATCAGG TGTGTGTTTC 1560 
TCCCACTGTC AATCAAAGAC TTAAAGACTC ATGGGTAGAA CGCCTGGACT TTAATCCTG A 1620 
TOGCTACTTA CAGGATAACC CAGCATCAGG GGAAAAGTTT GCXH'ATGTGC CATTTGGAGC 1680 
TGGGCGTCAT CGTTGTATTG GGGAAAATTT TGCCTATGTT CAAATTAAGA CAATTTGGTC 1740 
CACTATGCTT CWTTTATATG AATTTGATCT CATTGATGGA TACTTTCCCA CTGTGAATTA 1800 
TACAACTATG ATTCACACCC CTG AGAACCC AGTTATCCGT TACAAACGAA G ATCAAAATG 1 860 
AAAAAGGTTG CAAGGAACGA ATATATGTGA TTATCACTGT AAGCCACAAA GGCATTCGAA 1920 
GAGAATG AAG TGTACAAAAC AACTCTTGTA GTTTACTGTT 11 1 l lA AGTG TGTAATTCTA 1 980 
AAAGCCAGTT TATG ATTTAG G ATTTTGTTA ACTGAATGGT TCTATCAAAT ATAATAGCAT 2040 
TTGACACATT TTCTAATAGT TATGATACTT ATACATGTGC TTTCAGGAAG TTCCTTGGTG 2100 
AAACAATTGT TGAGGGGGGA TCTAGGTAAT TGGCAGATTC TAAATAATAT AATTTCCAGA 2160 
TAGTAATTTT AAG AGTACTC ATCGCTCTTG CCAAATAAGT TCAGGGTATT CAAATCTTGQ 2220 
ACTAGTCCTG CAAGGTATAA AGAATAAAAA TCCCAGTGAG ATACTTGGAA ACCACAGTTT 2280 
ATTATTATTT ATCTGGGCAA TTATTGTGTG TGTGAGG ATG GAAGGGTAGG GAATAATCG A 2340 
ACATCTAAAG CCTTGAATAA G AGAATACTA ATTGTTTTGG TATGATGATA CTCAGAAATG 2400 
GAGATATTATAGGAAAAAGA AATCCTTTGG AAITTTAACTAAAATCACTG CATATGGGAA 2460 
ATTAAGAGATCCAGGACX:AT ATTTGATAAG AGTTCCTAAA AATAATGTAA TTATTAATGC 2520 
TAAAGACTGC TCATGTATCT TG ATCTAATT ACTAAATAAA TTACATATTT ATTTACCTGA 2580 
TAAATATGTA TCTAGTTCTA CAAGGTCACA TTTATGTGGA AGTCCAAAGT CAAGTCCTTA 2640 
GGGGATAATT TTGTTTTGGG CTCAGTTGTT CCCTGCnrCC 11 1 11 1 1 1 1111 11 1 i U 1 1 2700 
TTGAGATGGA GTCTCX3CTCT GTTGCCCAGG CTGGAGTGCA GTGGTGCGAT CTCAGCTCAC 2760 
TGCATCCTCT GCCTCCCGGG TTCAAGCAAT TCTCTGCCTC AGCCTCCCAA GTAGTTGGG A 2820 
TTACAGGCAC CTGCCACCAT GCCTGGCTAA TTTTTTGTATTTTTAGTAGA GACGGGGGTT 2880 
TCACTATGTT GGCTAGGCTG GTCTTGAACT CCTG AGCCTC GTGAGTCCAC CCGCCTTGGC 2940 
CTCCCAAAGT GCTGGGATTA CAGGCATGAG CCACCGCACC TGGCCTTCCC TGCTTCCTCT 3000 
CTAGAATCCA ATTAGGGATG TTTGTTACTA CTCATATTGA TTAAAACAGT TAACAAACTT 3060 

rmcrmT aaaatoto ag atcagtg aac TCTGGTmrA agataatctg aaacaaggtc 3 1 20 

CrrGGOAGTAATAAAATTGGTCACATTCTGTAAAGCACATTCTGnTAGG AATCAACTTA 3180 
TCTCAAATTG TAACTCGGGG CCTAACTATA TGAGATGGCT GAAAAAATAC CACATCGTCT 3240 
GmrCACTA GGTG ATGCCA AAATATTTTG CTTTATGTAT ATTACAGTTC TTTTTAAAAC 3300 
ACTGGAAG AC TCATOTTAAA CTCTAATTOT GAAGGCAGAA TCTCTGCTAA TTTTTCAGAT 3360 
TAAAATTCrC TTTG AAAAAA T 

SeqIDNO: 82 Protein sequence: 
Protein Accession #: NP_000777 

1 11 21 31 41 51 
I t I I I I 

MAAAAOMLLL OLLQAGGSVL GQAMEKVTGG NLLSMLUAC AFTLSLVYLI RLAAGHLVQL 60 
PAGVKSPPYI FSPIPFLGHA lAFGKSPIEF LENAYEKYGP VFSFTMVGKT FTYLLGSDAA 120 
AaFNSKNED LNAEDVYSRL TTP VFGKGVA YDVPNPVFLE QKKMLKSGLN lAHFKQHVSI 1 80 
lEKBIKEYFE SWGESGEKNV FEALSELIIL TASHCLHGKE IRSQLNEKVA QLYADLDGGF 240 
SHAAWLLPGW LPLPSFRRRD RAHREDCDIF YKAIQKRRQS QEKIDDILQT LLDATYKDGR 300 
PLTDDEVAGM LIGLLLAGQH TSSTTSAWMG FFLARDKTLQ KKCYLEQKTV CGENLPPLTY 360 
DQLKDLNLLD RCIKETLRLR PPIMIMMRMA RTPQTVAGYT IPPGHQVCVS PTVNQRLKDS 420 
WVERLDFNPD RYLQDNPASG EKFAYVPFGA GRHRCIGENF AYVQDCTIWS TMLRLYEFDL 480 
IDGYFPTVNY TTMIHTPENP VIRYKRRSK 

Seq ID NO: 83 DNA sequence 

Nucleic Acid Accession #: NM.0065S 1.2 

Coding sequence: 64-336 (underlined sequences correspond to start and stop codons) 
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1 n 21 31 41 51 
1 I I I I I 

AATTCTAGAA GTCCAAATCA CTCATTGnT GTGAAAGCTG AGCTCACAGC AAAACAAGCX 60 
ACCATQAAGC TGTCGGTGTG TCTCXTTGCTG GTCACGCTGG CCXTTCTGCTQ CTACCAGGCC 120 
AATGCCG AGT TCTGCCCAGC TCTTGTTTCT GAGCTGTTAG ACTTCTTCTT CATTAGTGAA 180 
CCTCTGTTCA AGTTA AGTCT TGCCAAATTT GATGCCCCTC CGGAAGCTGT TGCAGCCAAG 240 
TTAGGAGTGA AGAGATGCAC GGATCAGATG TCCCTTCAGA AACG A AGCCT CATTGCGG AA 300 
GTCCTGGTGA AAATATTGAA GAAATGTAGT GTGTGACATG TAAAAACTIT CATCCTGGTT 360 
TCCACTGTCT TTCAATGACA CCCTGATCTT CACTaCAGAA TOTAAAGOTT TCAAOGTCTT 420 
GCTTTAATAA ATCACTTGCT CTAC 

Seq ID NO: 84 Protein sequence: 
Protein Accession #: NP_006542.1 

1 11 21 31 41 51 
1 I i I I I 

MKLSVCLLL V TLALCCYQAN AEFCPALVSE LLDFFFISEP LFICLSLAKFD APPEAVAAKL 60 
GVKRCTDQMS LQKRSLIAEV LVKILKKCS V 

Seq ID NO: 85 DNA sequence 

Nucleic Acid Accession #: NMJX)2317. 1 

Coding sequence: 231>1484 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
1111)1 

GGGCCAGGAC TG AGAAAGGG GAAAGQGAAG GGTGCCACOT CCGAGCAGCC GCCTTOACTQ 60 
GGGAAGGGTC TG AATCCCAC CCTTGGCATT GCTTGGTGGA GACTG AOATA CCCGTOCTCC 120 
GCTCX3CCTCC TTGGTTG AAG ATTTCTCCTT CCCTCACGTG ATTTG AGCCC CGm i lATT 1 80 
TTCTGTGAGC CACGTCCTCC TCGAGCGGGG TCAATCTGGC AAAAGGAGTG ATGCGCTTCG 240 
CCTGGACCGT GCTCCTGCTC GGGCCTTTGC AGCTCTGCGC GCTAGTGCAC TGCGCXTCCTC 300 
CCGCCGCCGG CCAACAGCAG CCCCXXSCGCG AGCCGCCGGC GGCTCCGQGC GCCTGGCGCC 360 
AGCAGATCCA ATGGGAGAAC AACGGGCAGG TGTTCAGCTT GCTGAGCCTG GGCTCACAGT 420 
ACCAGCCTCA GCX3CCGCCGG QACXiCGGGCG CCGCCGTCCC TGGTGCAGCC AACGCCTCCG 480 
CCCAGCAGCC CCGCACTCCG ATCCTGCTGA TCCGCGACAA CCGCACCGCC GCGGGGCGAA 540 
CGCGGACGGC CGGCTCATCT GGAGTCACCG CTGGCCGCCC CAGGCCCACC GCCCGTCACT 600 
GGTTCCAAGC TGGCTACTCG ACATCTAGAG CCCGCGAAGC TGGGCCCTCG CGCGCGGAGA 660 
ACCAOACAGC GCCGGGAGAA GTTCCTGCTC TCAOTAACCT GCOGCCOCCC AOCOGCGTOG 720 
ACGGCATGGT GGGCGACGAC CXTTACAACC CCTACAAGTA CTCTGACGAC AACCCTTATT 780 
ACAACTACTA CG ATACTTAT G AAAGGCCCA GACCTGGGGG CAGGTACCGG CCCGGATACG 840 
GCACTGGCTA CTTCCAGTAC GGTCTCCCAG ACCTGGTGGC CGACCCCTAC TACATCCAGG 900 
CGTCCACGTA CGTGCAGAAG ATGTCCATGT ACAACCTG AG ATGCGCGGCG G AGGAAAACT 960 
GTCTGGCCAG TACAGCATAC AGGGCAGATG TCAGAGATTA TGATCACAGG GTGCTGCTCA 1020 
GATTTCCCCA AAGAGTGAAA AACCAAGGGA CATCAGATTT CTTACCCAGC CX5ACCAAGAT 1080 
ATTCCTGGG A ATGGCACAGT TQTCATCAAC ATTACCACAG TATGGATGAG TTTAGCCACT 1 140 
TGTACCTGCT TGATGCCAAC ACCCAGAGGA GATGGGCTGA AGGCCACAAA GCAAGTTTCT 1200 
GTCTTGAAGA CACATCCTGT GACTATGGCT ACCACAGGCG ATTTG CATGT ACTGCACACA 1260 
CACAGGGATT GAGTCCTGGC TGTTATGATA CCTATGGTGC AGACATAGAC TGCCAGTGGA 1320 
TTG ATATTAC AGATGTAAAA CCTGGAAACT ATATCCTAAA OOTCAGTGTA AACTCCAGCT 1380 
ACCTGGTTCC TGAATCTGAC TATAOCAACA ATGTTGTGCG CTGTGACATT CGCTACACAG 1440 
G ACATCATGC GTATGCCTCA GGCTGCACAA TTTCACCGTA TTAGAAGGCA AAGCAAAACT 1500 
CCCAATGG AT AAATCAGTGC CTGGTGTTCT GAAGTGGGAA AAAATAG ACT AACTTCAGTA 1 560 
GGATTTATGT ATTTTGAAAA AGAGAACAGA AAACAACAAA AGAATTTTTG TTTGGACTGT 1620 
TTTCAATAAC AAAGCACATA ACTGGATrrT GAACGCTTAA GTCATCATTA CTTGGGAAAT 1680 
TTTTAATGTT TATTATTTAC ATCACrTTGT GAATTAACAC AGTGTTTCAA TTCTGTAATT 1740 
ACATATTTG A CTCTTTC AAA AAAAAAAAAA AAAAAAAAAA 

Seq ID NO: 86 Protein sequence; 
Protein Accession »: NP.002308.1 

1 11 21 31 41 51 
1)11)1 

MRFAWTVLLL GPLQLCALVH CAPPAAGQQQ PPREPPAAPG AWRQQIQWEN NGQVFSLLSL 60 
GSQYQPQRRR DPGAAVPGAA NASAQQPRTP ILLIRDNRTA AGRTRTAGSS GVTAGRPRPT 120 
ARHWFQAG YS TSRAREAGPS RAENQTAPG E VPALSNLRPP SR VDGM VGDD PYNPYKYSDD 1 80 
NPYYNYYDTY ERPRPGGRYR PGYGTGYFQY GLPDLVADPY YIQASTYVQK MSMYNLRCAA 240 
EENCLASTAY RADVRDYDHR VLLRFPQRVK NQGTSDFLPS RPRYSWEWHS CHQHYHSMDE 300 
FSHLYLLDAN TQRRWAEGHK ASFCLEDTSC DYGYHRRFAC TAHTQGLSPG CYDTYGADID 360 
CQWIDITDVK PGNYILKVSV NPSYLVPESD YTNNWRCDI RYTGHHAYAS GCTISPY 

Seq ID NO: 87 DNA sequence 

Nucleic Acid Accession #: NM_00641 9. 1 

Coding sequence: 91-420 (underlined sequences conespond to start and stop codons) 
1 11 21 31 41 51 

1 I I 1 I r 

TTCGGCACTT GGGAGAAG AT GTITGAAAAA ACTGACTCTG CTAATGAGCC TGGACTCAGA 60 
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GCTCAAGTCr GAACTCTACC TCCAG ACAG A ATGAAGTTCA TCTCGACATC TCTGCTTCTC 1 20 
ATGCTGCTGG TCAGCAGCCT CTCTCCAGTC CAAGGTGTTC TGGAGGTCTA TTACACAAGC 180 
TTGAGGTGTA GATGTGTCCA AGAGAGCTCA GTCTTTATCC CTAG ACGOT CATTGATCGA 240 
ATTCAAATCT TGCCCCGTGQ GAATGGTTGT CCAAGAAAAG AAATCATAGT CTGGAAGAAG 300 
AACAAGTCAA TTGTGTGTGT GGACCCTCAA GCTGAATGGA TACAAAGAAT GATGGAAGTA 360 
TTG AGAAAAA GA AGTTCTTC AACTCTACCA G1TCCAGTGT TTAAG AGAAA GATTCCCTGA 420 
TGCTGATATT TCCACTAAGA ACACCTGCAT TCTTCCCTTA TCXTTGCTCT GGATTTTAGT 480 
TTrGTGCITA GTTAAATCTT TTCCAGGGAG AAAGAACTTC CCCATACAAA TAAGGCATGA 540 
GGACTATGTG AAAAATAACC TTGCAGG AG C TG ATGGGGCA AACTCAAGCT TCTTC ACTCA 600 
CAGCACCCTA TATACACTTG G AGTTTGCAT TCTTATTCAT CAGGGAGGAA AGTTTCnTG 660 
AAAATAGTTA TTCAGTTATA AGTAATACAG GATTATTTTG ATTATATACT TGTTGTTTAA 720 
TGTTTAAAAT TTCTTAGAAA ACAATGGA AT GAG AATTTAA GCCTCAAATT TGAACATGTG 780 
GCTTGAATTA AGAAGAAAAT TATGGCATAT ATTAAAAGCA GGCTTCTATG AAAG ACTCAA 840 
AAAGCTGCCT GGGAGGCAGA TGGAACTTGA GCCTGTCAAG AGGCAAAGGA ATCCATGTAG 900 
TAGATATCCT CTGCTTAAAA ACTCACTACG G AGGAGAATT AAGTCCTACT TTTAAAOAAT 960 
TTCTTTATAA AATTTACTGT CTAAGATTAA TAGCATTCGA AGATCCCCAG ACTTCATAGA 1020 
ATACTCAGGG AAAGCATTTA AAGGGTGATG TACACATGTA TCCTTTCACA CATTTGCCTT 1080 
GACAAACTTC TTTCACTCAC ATCTTTTTCA CTG ACTTTTT TTGTGGGGGC GGGGCCGGGG 1 1 40 
GGACTCTGGT ATCTAATTCT TTAATGATTC CTATAAATCT AATGACATTC AATAAAGTTG 1200 
AGCAAACATTTTACrr 

Seq ID NO; 88 Protein seouencc: 
Protein Accession #: NPJ0O641 0.1 

1 11 21 31 41 51 
1)111! 

MKFISTSLLL MLLVSSLSPV QGVLEVYYTS LRCRCVQESS VFIPRRFIDR IQILPRGNGC 60 
PRKmiVWKK NKSIVC VDPQ AEWIQRMMEV LRKRSSSTLP VPVFKRKIP 

Seq ID NO: 89 DNA sequence 

Nucleic Acid Accession it: NMJ)02652 

Coding sequence: 37-477 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

i 1 1 I I I 

CTTCTCTGGG ACACATTGCC TTCrGTTTTC TCCAGCATGC GCTTGCTCCA GCTCCTGTTC 60 
AGGGCCAGCC CTGCCACCCT GCTCCTGGTT CTCTGCCTGC AGTTGGGGGC CAACAAAGCT 120 
CAGGACAACA CTCGG AAGAT CATAATA AAG AATTTTG ACA TTCCCAAGTC AGTACGTCCA 1 80 
AATGACG AAG TCACTGCAGT GCTTGCAGTT CAAACAG AAT TGAAAGAATG CATGGTGGTT 240 
AAAACTTACC TCATTAGCAG CATCCCTCTA CAAGGTGCAT TTAACTATAA GTATACTGCC 300 
TGCCTATGTQ ACGACAATCC AAAAACCTTC TACTGGGACT TTTACAOCAA CAG AACTGTG 360 
CAAATTGCAG CCGTCGTTG A TGTTATTCGG GAATTAGGCA TCTGCCCTGA TOATGCTGCT 420 
GTAATCCCCATCAAAAACAACCGGTTTTATACTATTGAAATCCTAAAOGTAGAAIM 480 
AAQCOCTOTCTOTTTGCCACACCCAGGTaATTTCCTCTAAAOAAAC^ 540 
TGCTGTGOTC TATAAAATAA ACTTCTTAAC ATGCTT 

Seq ID NO: 90 Protein sequence: 
Protein Accession #: NPJ)02643. 1 

1 11 21 31 41 51 
I I i } I I 

MRLLQLLFRA SPATLLLVLC LQLG ANKAQD NTRKIIIKNF DIPKSVRPND EVTAVLAVQT 60 
ELKECMVVKT YUSSIPLQG AFNYKYTACL CDDNPKTFYW DFYTNRTVQI AAVVDVIREL 1 20 
GICPDDAAVI PIKNNRFYTI EILK V 

Seq ID NO: 91 DNA sequence 

Nucleic Acid Accession #: AK00034 1 

Coding sequence: 85-975 (underlined sequences correspond to start and stop codons) 

1 11 21*. 31 41 51 
i I I ) I t 

GATAGCGCCG GGCAGAGOGA CCCGGCTACC CnrGGACAGCG CATCGCCGCC CGCCCGGGTC 60 
GCCGCGCCAC AGCCGCTGCG GATCATGGAA CATCTAAAGG CCTTTGATGA TGAAATCAAT 120 
GCnmTGG ACAATATGTT TGGACCGCG A GATTCTCGAG TCAGAGGOTO OTTCACGTTG 1 80 
GACTCTTACC TTCCTACCTT n i 1 C I 1 ACT GTC ATGTATC TGCTCTCAAT ATGGCTOGGT 240 
AACAAGTATA TGAAGAACAG ACCTGCTCTT TCTCTCAGGG GTATCCTCAC CTTGTATAAT 300 
CTTGGAATCA CACITCTCrC CGCGTAC ATG CTGGCAGAGC TCATTCTCTC CACTTGGG AA 360 
GGAGGCTACA ACTTACAGTG TCAAGATCTT ACCAGCGCAG GGGAAGCTGA CATCOGGGTA 420 
GCCAAGGTGC TTTGGTGGTA CTATTTCTCC AAATCAGTAG AGTTCCTGGA CACAATTTTC 480 
TrCGTITTGC GGAAAAAAAC G AGTCAO ATT ACnTTCTTC ATGTATATCA TCATGCTTCT 540 
ATGTITAACA TCTGGTGGTG TGTCTTG A AC TGGATACCTT OTGG ACAAAG TTTCTTTGG A 600 
CCAACACTGA ACAGTTTTGT CCACATTCTT ATGTACTCCT ACTATGGACT TTCTGTGTTT 660 
CCATCTATGC ACAAGTATCT TTGGTGG AAG AAATATCTCA CACAGGCTCA GCTQGTGCAG 720 
TTCGTGCTCA CCATCACGCA CACCATGAGC GCCGTCGTGA AACCGTGTGG CTTCCCCTTC 780 
GOTTGTCTCA TCTTCCAGTC ATCTTATATG CTAACGTTAG TCATCCTCTT CITAAAmT 840 
TATOTTCAG A CATACCG AAA AAAGCCAATG AAGAAAGATA TOCAAGAGCC ACCTQCAGGG 900 
A AAGAAGTGA AGAATGGTIT TTCCAAACCC TACTTCACTG CAGCAAATGG AGTG ATGAAC 960 
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AAGAAAGCAC AATAAAAATG AGTAACAGAA AAAGCACATA TACTAGCCTA ACAG ATTG GC 1020 
TTGrmrAAA GCAAAGACTG AATTGAAGGTTACATGTm AGGATAAACT AATTTCrnr 1080 
GAGTTCATAA ATCATTTGTA CCCAGAATGT ATTAATATAT TGCTATTAGG TTAATCTGTT 1 140 
AACTOAATGC TTTGATCAGC ATTQAGGTGA TGCTCACCTC CGAGGACCTC AGAACTGGTQ 1200 
CAGCTTCTCr CTCCXTTCCCT CCCACAOACT GAACCITTCG CCAOAAGCTG TCCTTATAAC 1260 
GCCITATACG CATACACAGC CAGGAAACGT GGAGCATTGT TTCTCACAG A GAGTCTCCAA 1320 
ATAAAAAGGG TTTTGTTCAG ATTAAAATGT TTACAACAAA ATGTTAAITA TATTCTAAAT 1380 
ACAGGGTATG TTCTAATCTA TATTAAGCAA TAATGCCAGT GCATAATCAT TCCATTTGTT 1440 
CXriTTAGCAA TCAACCCCAG AAAATATTAA AATGGGATCA TACACAG AAO ATAQAAAAAT 1 500 
CTAGCAAAAC TTCTCTTTCT GTAAGCCAGA GTCTTGTCTA TCAOATTCCC ACAACCACTC 1560 
CTGATTCTAA ATTTAGTGAT ATGGTAATGA AATTGGTATT TATTTTAAAT ATTAGTTATT 1620 
CTAAGGAGAA AAAAATGCTT CTGCAAGATT TTCATAATTC AGGGGCTGTG G ATAGGATTG 1 680 
TTCCTCTGTT TCCCTA ATCA TTCATCTGTT CATGTCTCCC TCTTGTGCCA GTC AGCCTAG 1 740 
GTTATACAGA TGCX:ATGCTC CACACCACGA GCAGTGTACA AATCrCGCTG CCCGTTTACT 1800 
TTCTG AGCAA GCACTGG AGT CCACTCCOAC CnTrrCTTT GAACATGCAT aCTOCTGG AA 1 860 
TATGTATAAA TCAOAACTAG CAGAAGTAGC AGAOTGATGG GAGCAAAATA GGCACTGAAT 1920 
TCGTCAACTC TTTTTTGTG A GCCTACTTGT GAATATTACC TCAGATACCT GTTGTCACTC 1980 
TTCACAGGTT ATTTAAGTTC TTGAAGCTGG GAGGAAAAAG ATGGAGTAGC TTGGAAAGAT 2040 
TCCAGCACTG AGCCGTGAGC CGGTCATGAG CCACGATAAA AAATGCCAGT TTGGCAAACT 2100 
CAGCACrCCT GTrCCCTGCT CAGGTATATG CGATCTCTAC TGAGAAGCAA GCACAAAAOT 2160 
AGACCAAAGT ATTAATG AGT ATTTCCTTTC TCCATAAGTG CAGGACTGTT ACTCACTACT 2220 
AAACTCTACX; AAGAATOGAA ACCAAGAATA TTTTCTCAAG ATTTTTTTGA AG ATTAATTT 2280 
ATACCCTATA AAATAAAACT TOTTAGCTTC GATGAAGTCA AAAAAAAAAA AAAAAAAAAA 

Seq ID NO: 92 Protein sequence: 
Protein Accession #: BAA91096.1 

I 11 21 31 41 51 
I I i I i i 

MEHLKAFDDE INAFLDNMFG PRDSRVRGWF TLDSYLPTFF LTVMYLLSIW LGNKYMKNRP 60 
ALSLRGILTL YNLGITLLSA YMLAELILST WEGGYNLQCQ DLTSAGEADI RVAKVLWWYY 120 
FSKSVEFLDT IFFVLRKKTS QITFLHVYHH ASMFNIWWCV LNWIPCGQSF FOPTLNSF VH 1 80 
ILMYSYYGLS VFPSMHKYLW WKKYLTQAQL VQFVLTITHT MSAWKPCGF PFOCLIFQSS 240 
YMLTLVILFL NFYVQTYRKK PMKKDMQEPP AGKEVKNGFS KAYFTAANGV MNKKAQ 

Seq ID NO: 93 DNA sequence 

Nucleic Acid Accession #: NM_000044 

Coding sequence: 1 1 1 5-3874 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I t I I I I 

CGAGATCCCG GGGAGCCAGC TTGCTGGGAG AGCGGGACGG TCCGGAGCAA OCCCACAGGC 60 
AGAGGAGGCG ACAGAGGGAA AAAGGGCCGA GCTAGCCGCT CCAGTGCTGT ACAGGAGCCG 120 
AAGGG ACGCA CCACGCC AG C CCCAG CCCGG CTCCAGCG AC AGCCAACGCC TCTTGCAGCG 1 80 
CGGCGGCTTC GAAGCCGCCG CCCGG AGCTG CCCTTTCCTC TTCGGTGAAG TTTTTAAAAG 240 
CTGCTAAAGA CTCGG AGGAA GCAAGGAAAG TGCCTGGTAG GACTGACGGC TGCCTTTGTC 300 
CTCCTCCTCT CCACCCCGCC TCCCCCCACC CTGCCTTCCC CCCCTCCCCC GTCTTCTCTC 360 
CCGCAGCTGC CTCAGTCGGC TACTCTCAGC CAACCCCCCT CACCACCCTT CTCCCCACCC 420 
GCCCCCCCGC CCCCGTCGGC CCAGCGCTGC CAGCOCGAQT TTGCAGAGAG GTAACTCCCT 480 
TTGGCTGCGA GCGGGCGAGC TAGCTGCACA TTGCAAAGAA GGCTCTTAGG AGCCAGGCGA 540 
CTGGGGAGCG GCTTCAGCAC TGCAGCCACG ACCCGCCTGG TTAGAATTCC GGCGGAGAGA 600 
ACCCTCTGTT TTCCCCCACT CTCTCTCCAC CTCCTCCTGC CTTCCCCACC CCG AGTGCGG 660 
AGCAGAGATC AAAAGATG AA AAGGCAGTCA GGTCTTCAGT AGCCAAAAAA CAAAACAAAC 720 
AAAAACAAAAAAQCCGAAATAAAAGAAAAAGATAATAACTCAGTTCrrATTTGCACCTAC 780 
TTCAGTGGAC ACTGAATTTG G AAGGTGGAG GATTTTGTTT TTTTCnTTA AGATCTGGGC 840 
ATCnTTGAA TCTACCCTTC AAGTATTAAG AGACAGACTG TGAGCCTAGC AGGGCAGATC 900 
TTGTCCACCG TGTGTCTTCT TCTGCACG AG ACTTTGAGGC TGTCAGAGCG CTTTTTGCGT 960 
GGTTGCTCCC GCAAGTTTCC TTCTCTGGAO CTTCCCGCAG GTGGGCAGCT AGCTGCAGCG 1020 
ACTACCGCAT CATCACAGCC TGTTGAACTC TTCTGAGCAA GAGAAGGGGA GGCGGGGTAA 1080 
GGGAAGTAGG TGGAAGATTC AGCCAAGCTC AAGGATGGAA GTGCAGTTAG GGCTGGGAAG 1 140 
GGTCTACCCT CGGCCGCCGT CCAAGACCTA CCGAGGAGCT TTCCAGAATC TGTTCCAGAG 1200 
CGTGCGCGAA GTGATCCAGA ACCCGGGCCC CAGGCACCCA GAGGCCGCGA GCGCAGCACC 1260 
TCCCGGCGCC AGTTTGCTGC TGCTGCAGCA GCAGCAGCAG CAGCAGCAGC AGCAGCAGCA 1320 
GCAGCAGCAG CAGCAGCAGC AGCAGCAAGA GACTAGCCCC AGGCAGCAGC AGCAGCAGCA 1380 
GGGTGAGGAT GGTTCTCCCC AAGCCCATCG TAGAGGCCCC ACAGGCTACC TGGTCCTGGA 1440 
TGAGGAACAG CAACCTTCAC AGCCGCAGTC GGCCCTGGAG TGCCACCCCG AGAGAGGTTG 1500 
CGTCCCAGAG CCTGGAGCCG CCGTGGCCGC CAGCAAGGGG CTGCCGCAGC AGCTGCCAGC 1560 
AOCTCCGGAC GAGG ATGACT CAGCTGCCCC ATCCACGTTG TCCCTGCTGG GCCCCACTTT 1620 
CCCCGGCTTA AGCAGCTGCT CCGCTGACCT TAAAGACATC CTGAGCGAGG CCAGCACCAT 1680 
GCAACTCCrr CAGCAACAGC AGCAGGAAGC AGTATOCGAA GGCAGCAGCA GCGGGAGAGC 1740 
GAGGGAGGCC TCGGGGGCTC CCACTTCCTC CAAGGACAAT TACTTAGGGG GCACTTCGAC 1800 
CATTTCTGAC AACGCCAAGG AGTTGTGTAA GGCAGTGTCG GTGTCCATGG GCCTGGGTGT 1860 
GGAGGCGTTG GAGCATCTGA GTCCAGGGGA ACAGCTTCGG GGGGATTGCA TGTACGCCCC 1920 
ACTTTTGGGA GTTCCACCCG CTGTGCGTCC CACTCCTTGT GCCCCATTGG CCGAATGCAA 1980 
AGGTTCTCTG CTAGACG ACA GCGCAGGCAA GAGCACTGAA GATACTGCTG AGTATTCCCC 2040 
TTTCAAGGGA GGTTACACCA AAGGGCTAGA AGGCGAGAGC CTAGGCTGCT CTGGCAGCGC 2100 
TCCAGCAGGG AGCTCCGGGA CACTTGAACT GCCGTCTACC CTGTCTCTCT ACAAGTCCGG 2160 
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AGCACTGGAC GAGGCAGCTG CGTACCAGAG TCGCXjACTACTACAACTTTCCACTGGCTCT 2220 
GGCCGGACXG CCGCCCCCTC CGCCGCCTCC CCATCCCCAC GCTCGCATCA AGCTGGAGAA 2280 
CCCGCTGGAC TACGGCAGCG CCTGGGCGGC TGCGGCGGCG CAGTGCCGCT ATGGGGACCT 2340 
GGCGAGCCTG CATGGOGCGG GTGCAGCGGG ACCCGGTTCTGGGTCACCCTCAGCCGCCGC 2400 
TTCCTCATCX: TGGCACACTC TCTTCACAGC CGAAGAAGGC CAGTTGTATG GACCGTGTGG 2460 
TGGTGGTGGG GGTGGTGGCG GCGGCGGCGG CGGCGGCGGCGGCGGCGGCG GCGGCGGCGG 2520 
CGGCGGCGGC GAGGCGGGAG CTGTAGCCCC CTACGGCTAC ACTCGGCCCC CTCAGGGGCT 2580 
GGCGGGCCAG GAAAGCGACT TCACCGCACC TGATGTGTGG TACCCTGGCG GCATGGTGAG 2640 
CAGAGTGCX:C TATCCCAGTC CCACTTGTGT CAAAAGCGAA ATGGGCCCCT GGATGGATAG 2700 
CTACTCCGGA CCTTACGGGG ACATGCGTTT GGAGACTGCC AGGGACCATG TTTTGCCCAT 2760 
TGACTATTAC TTrCCACXXrC AGAAGACCTG CCTGATCTGT GGAGATGAAG CTTCTGGGTG 2820 
TCACTATGGA GCTCTCACAT GTGGAAGCTG CAAGGTCTTCTTCAAAAGAG CCGCTGAAGG 2880 
GAAACAGAAG TACCTGTGCG CCAGCAGAA A TG ATTGCACT ATTGATAAAT TCCGAAGGAA 2940 
AAATTGTCCA TCTTGTCGTC TTCGGAA ATG TTATGAAGCA GGGATGACTC TGGGAGCCCG 3000 
GAAGCTGAAG AAACTTGGTA ATCTGAAACT ACAGGAGGAA GGAGAGGCTT CCAGCACCAC 3060 
CAGCCCCACT GAGGAGACAA CCCAGAAGCT GACAGTGTCA CACATTGAAG GCTATGAATG 3 120 
TCAGCCCATCTTTCTGAATG TCCTGGAAGC CATTGAGCCA GGTGTAGTGT GTGCTGGACA 3180 
CGACAACAAC CAGOCCGACT CCTTTGCAGC CTTGCTCTCT AGCCTCAATG AACTGGGAGA 3240 
GAGACAGCTT GTACAOGTGG TCAAGTGGGC CAAGGCCTTG CCTGGCTTCC GCAACTTACA 3300 
CGTGGACGACCAGATGGCTG TCATTCAGTA CTCCTGGATG GGGCTCATGG TGTTTGCCAT 3360 
GGGCTGGCGA TCCTTCACCA ATGTCAACTC CAGGATGCTC TACTTCGCCC CTGATCTGGT 3420 
TTTCAATGAG TACCGCATGC ACAAGTCCCG G ATGTACAGC CAGTGTGTCC G AATGAGGCA 3480 
CCTCTCTCAA GAGTTTGGAT GGCTCCAAAT CACCCCCCAG GAATTCCTGT GCATGAAAGC 3540 
ACTGCTACTC TTCAGCATTA TTCXTAGTCGA TGGGCTGAAA AATCAAAAAT TCTTTGATGA 3600 
ACTTCGAATG AACTACATCA AGG AACTCG A TCGTATCATT GCATGCAAAA GAAAAAATCC 3660 
CACATCCTGC TCAAGACGCT TCTACCAGCT CACCAAGCTC CTGGACTCCG TGCAGCCTAT 3720 
TGCXiAGAGAG CTGCATCAGT TCACTTTTGA CCTGCTAATC AAGTCACACA TGGTGAGCGT 3780 
GGACTTTCXX3 GAAATGATGG CAGAGATCAT CTCTGTGCAA GTGCCCAAGA TCCTTTCTGG 3840 
GAAAGTCAAG CCCATCTATT TCCACACCCA GTGAAGCATT GGAAACCCTA TTTCCCCACC 3900 
CCAGCTCATG CCCXXmTCA GATGTCTTCT GCCTGTTATA ACTCTGCACT ACTCCTCTGC 3960 
AGTGCCTTGQ GGAATTTCXr r CT ATTG ATGT ACAGTCTGTC ATGAACATGT TCCTGAATTC 4020 
TATTTGCTGG GCTTmTTT TCTCTTTCTC TCCTTTCTTT TTCTTCTTCC CTOCCTATCT 4080 
AACOCTCCCA TGGCACCITC AGACTTTGCT TCCCATTGTG GCTCCTATCT GTGTnTGAA 4140 
TGGTGTTGTA TGCCTTTAAA TCTGTGATGA TCCTCATATG GCCCAGTGTC AAGTTGTGCT 4200 
TGTTTACAGC ACTACTCTGT GCCAGCCACA CAAACG1TTA CTTATCTTATGCCACGGGAA 4260 
GTTTAGAGAG CTAAGATTAT CTGGGG AAAT CAAAACAAAA AACAAGCAAA CAAAAAAAAA 4320 
A 

Seq ID NO: 94 Protein seouence: 
Protein Accession #: NP_000035. 1 

1 11 21 31 41 51 
I I I I ! I 

MEVQLGLGRV YPRPPSKTYR G AFQha.FQSV REVIQNPGPR HPEAASAAPP G ASLLLLQQQ 60 
QQQQQQQQQQ QQQQQQQQET SPRQQQQQQG EDGSPQAHRR GPTG YLVLDE EQQPSQPQSA 120 
LECHPERGCV PEPGAAVAAS KGLPQQLPAP PDEDDSAAPS TLSLLGPTFP GLSSCSADLK 1 80 
DILSEASTMQ LLQQQQQEAV SEGSSSGRAR EASG APTSSK DNYLGGTSTI SDNAKELCKA 240 
VSVSMGLGVE ALEHLSPGEQ LRGDCMYAPL LG VPPAVRPT PCAPLAECKG SLLDDSAGKS 300 
TEDTAEYSPF KGGYTKGLEG ESLGCSGSAA AGSSGTLELP STLSLYKSGA LDEAAAYQSR 360 
DYYNFPLALA GPPPPPPPPH PHARHCLENP LDYG SAWAAA AAQCRYGDLA SLHG AGAAGP 420 
GSGSPSAAAS SSWHTLFTAE EGQLYGPCGG GGGGGGGGGG GGGGGGGGGG GGEAGAVAP Y 480 
GYTRPPQGLA GQESDFTAPD VWYPGGM VSR VPYPSPTCVK SEMGPWMDSY SGPYGDMRLE 540 
TARDHVLPID YYFPPQKTCL ICGDEASGCH YGALTCGSCK VFFKRAAEGK QKYLCASRND 600 
(niDKFRRKN(7SCRLRKCYEAGMTU3AlUCLiaaGhnJC]^EEGEASSmP 660 
VSHIEO YECQ PIFLNVLEAI EPG WCAGHD NNQPDSFAAL LSSLNELGER QLVHVVKWAK 720 
ALPGFRNLHV DDQMAVIQYS WMGLMVFAMG WRSFTNVNSR MLYFAPDLVF NEYRMHKSRM 780 
YSQCVRMRHL SQEFGWLQIT PQEFLCMKAL LLFSIIPVDG LKNQKFFDEL RMNYHCELDR 840 
UACKRKNPT SCSRRFYQLT KLLDSVQPIA RELHQFTFDL LIKSHMVSNO) FPEMMAEnS 900 
VQVPKILSGK VKPIYFHTQ 

SeqIDNO:95 DNA sequence 

Nucleic Acid Accession if: NM_002497 

Coding sequence: 1 35-1472 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

1 I I I I I 

GGCACGAGTA GGGGTGGCGG GTCAGTGCTG CTCGGGGGCT TCTCCATCCA GGTCCCTGGA 60 
GTTCCTGGTC CCTGGAGCTC CGCACTTGGC GCGCAACCTG CGTGAGGCAG CGCG ACTCTG 1 20 
GCGACTGGCC GGCCATGCCT TCCCGGGCTG AGG ACTATG A AGTGTTGTAC ACCATTGGCA 1 80 
CAGGCTCTTA CGGCrGCTGC CAG AAGATCC GGAGGAAGAG TGATGGCAAG ATATTAGITT 240 
GG AAAGAACT TGACTATGGC TCCATG ACAG AAGCTGAGAA ACAGATGCTT GTTTCTG AAG 300 
TGAATTTGCT TCGTGAACTG AAACATCCAA ACATCGTTCG TTACTATG AT CGGATTATTO 360 
ACCGGACCAA TACAACACTG TACATTGTA A TGGAATATTG TGAAGGAGGG GATCTGGCTA 420 
GTGTAATTAC AAAGGGAACC AAGGAAAGGC AATACTTAGA TGAAGAGTTT GITCTTCG AG 480 
TG ATGACTCA GTTGACTCTG GCCCTGA AGG A ATGCCACAG ACGAAGTGAT GGTGGTCATA 540 
CCGTATTGCA TCGGGATCTT AAACCAGCCA ATCTTTTCCT GGATGGCAAG CAAAACGTCA 600 
AGCTTGGAGA CTTrGGGCTA GCTAG AATAT TAAACCATGA CACXJ AGTm GCAAAAACAT 660 
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TTGTTGGCAC ACCTTATTAC ATCjTCTCCTG AACAAATGAA TCGCATGTCC TACAATGAGA 720 
AATCAGATAT CTGGTCATTG GGCTGCTTGC TGTATGAGTT ATGTGCATTA ATGCCTCCAT 780 
TTACAGCTTT TAGCCAGAAA GAACTCGCTG GGAAAATCAG AGAAGGCAAA TTCAGGCGAA 840 
TTCCATACCG TTACTCTG AT GAATTGAATG AAATTATTAC GAGGATGTTA AACTTAAAGG 900 
5 ATTACCATCGACCTTCTGTTGAAGAAATTCrrGAGAACCCTTTAATAGCAGATTTGGTTG 960 

CAGACGAGCA AAGAAGAAAT CTTGAGAGAA GAGGGCGACA ATTAGGAGAG CX:AGAAAAAT 1020 
CGCAGGATTC CAGCCCTGTA TTG AGTG AGC TG AAACTG AA GGAAATTCAG TTACAGGAGC 1080 
GAGAGCGAGC TCTCAAAGCA AGAGAAGAAA GATTGGAGCA G AAAGAACAG GAGCTTTGTG 1 140 
TTCGTGAGAG ACTAGCAGAG GACAAACTGG CTAGAGCAGA AAATCTGTTG AAGAACTACA 1200 

1 0 GCTTGCTAAA GGAACGGAAG TTCCTGTCTC TGGCAAGTAA TCCAGAACTT CTTAATCTTC 1260 
CATCCTCAGTAATTAAGAAG AAAGTTCATT TCAGTGGGGA AAGTAAAGAG AACATCATGA 1320 
GGAGTOAG AA TTCTGAGAGT CAGCTCACAT CTAAGTCCAA GTGCAAGGAC CTGAAGAAAA 1380 
GGCTTCACGC TGCCCAGCTG CGGGCTCAAG CCCTGTCAGA TATTGAGAA A AATTACCAAC 1440 
TGAAAAGCAG ACAGATCXTTG GGCATGCGC T AGC CAGGTAG AGAGACACAG AGCTGTGTAC 1500 

1 5 AGGATGTAAT ATTACCAACC TTTAAAGACT GATATTCAAA TGCTGTAGTG TTGAATACTT 1560 
GGCCCCATGA GCCATGCCTT TCTGTATAGT ACACATGATA TTTCGGAATT GGTITTACrG 1620 
TTCTTCAGCA ACTATTGTAC AAAATGTTCA CAITTAATTT TTCITTCTTC TTTTAAGAAC 1680 
ATATTATAAA AAGAATACTT TCTTGGTTGG GCTTTTAATC CTGTGTGTGA TTACTAGTAG 1740 
GAACATGAGA TGTGACATTC TAAATCTTGG GAGAAAAAAT AATATTAGGA AAAAAATATT 1800 

20 TATGCAGGAA OAGTAGCACT CACTGAATAG TTTTAAATGA CTGAGTGGTA TGCTTACAAT 1860 
TGTCATGTCT AGATTTAAAT TTTAAGTCTG AGATTTTAAA TGTTTTTGAG CTTAGAAAAC 1920 
CCAGTTAGAT GCAATTTGGT CATTAATACC ATGACATCTT GCTTATAAAT ATTCCATTGC 1980 
TCTGTAGTTC AAATCTGTTA GCTTTGTGAA AATTCATCAC TGTGATGTTT GTATTCmT 2040 

_ . TTTTTTTCTG TTTAACAGAA TATGAGCTGTCTGTCATTTA CCTACTTCTT TCCCACTAAA 2100 

25 TAAAAGAATTCrrCAGTTA 

Seq ID NO: 96 Protein sequence: 
Protein Accession NP_002488 

30 1 11 21 31 41 51 

I I i I I 1 

GIREFNPNIM ANEVERINMI TOSI5GENEA RELATEDKIN ASEHOMOSAP lENSMPSRAE 60 
DYEVLYTIGt GSYGRCQKIR RKSDGKILVW KELDYGSMTE AEKQMLVSEV NLLRELKHPN 120 
IVRYYDRIID RTNTTLYIVM EYCEGGDLAS VITKGTKERQ YLDEEFVLRV MTQLTLAUCE 1 80 

3 5 CHRRSDGGHT VLHRDLKPAN VFLDGKQNVK LGDFGLARIL NHDTSFAKTF VGTPYYMSPE 240 
QMNRMSYNEK SDIWSLGCLL YELCALMPPF TAFSQKELAG KJREGKFRRI PYRYSDELNE 300 
HTRMLNLKD YHRPSVEEIL ENPLIADLVA DEQRRNUBRR GRQLGEPEKS QDSSPVLSEL 360 
KLKEIQLQER ERALKAREER LEQKEQELCV RERLAEDKLA RAENLLKNYS LLKERKFLSL 420 
ASblPELLNLPSSVIKKJCVHFSGESKENIMRSENSESQLTSKSKaCDUCKRL^ 480 

40 LSDIEKNYQLKSRQILGMR 

Seq ID NO: 97 DNA sequence 
Nucleic Acid Accession #: NM_007050.2 
. ^ Coding sequence: 1 85-4576 (underlined sequences coircspond to start and stop codons) 
45 

1 11 21 31 41 51 
I 1 I I 1 I 

CCTCCCGCCT CAGTTCGCGC CGCGCCTCGG CTTGGAACGC AGG AGCGCCG GCTCCGGGAG 60 
CCCGAGCGGA GCCAGCCGCG CGCACAGCCA GCGGOCGCGC CGGCGATGCG GGGCCACCCC 120 

50 GCGCCCGCCC CAGTCCCGGC CCCGGCCCCC GCGGGAAGGG GCTGAGCTGC CCGCCGCCGC 1 80 
CCGGATGGCG AGCCTCGCCG CGCTCGCCCT CAGCCTGCTC CTGAGGCTGC AGCTGCCGCC 240 
ACTGCCCGGC GCrCGGGCTC AGAGCGCCCC AGGTGGCTGT TCCTTTGATG AGCACTACAG 300 
CAACTGTGGT TATAGTGTGG CTCTAGGGAC CAATGGGTTC ACCTGGGAGC AGATTAACAC 360 

_ ^ AACGGAGAAA CCAATGCTGG ACCAGGCAGT GCCCACAGG A TCTTTCATGA TGGTGAACAG 420 

5 5 CTCTGGG AGA GCCTCTGGCC AGAAGGCCCA CCTTCTCCTG CCAACCCTOA AGGAGAATG A 480 
CACCCACTGC ATCGACTTCC ATTACTACIT CTCCAGCCGT G ACAGGTCCA GCCCAGGGGC 540 
CTTGAACGTC TACGTGAAGG TGAATGGTGG CCTCCAAGGG AACCCTGTGT GGAATGTGTC 600 
CGGGGTCGTC ACTGAGGGCT GGGTGAAGGC AGAGCTCGCC ATCAGCACTT TCTGGCCACA 660 

^ ^ TTTCTATCAG GTG ATATTTG A ATCCGTCTC ATTGAAGGGT CATCCT GGCT ACATCGCCGT 720 

60 GGACGAGGTC CGGGTCCTTG CTCATCCATG CAGAAAAGCA CCTCATTTTC TGCGACTCCA 780 
AAACGTGGAG GTGAATGTGG GGCAG AATGC CACATTTCAG TGCATTGCTG GTGGG AAGTG 840 
GTCTCAGCAT GACAAGCTTT GGCTCCAGCA ATGGAATGGC AGGGACACGG CCCTGATGGT 900 
CACCCGTGTG GTCA ACCACA GGCGCTTCTC AGCCACAGTC AGTGTGGCAG ACACTGCCCA 960 
GCGGAGCGTC AGCAAGTACC GCTGTGTGAT CCGCTCTGAT GGTGGGTCTG GTGTGTCCAA 1020 

65 CTACGCGGAG CTG ATCGTGA AAGAGCCTCX: CACGCCCATT GCTCCX:CX:aG AGCTGCTGGC 1080 
TGTGGGGGCC ACATACCTGT GG ATCAAGCC AAATGCCAAC TCCATCATCG GGG ATGGCCC 1 140 
CATCATCCTG AAGGAAGTGG AATATCGCAC CACCACAGGC ACGTGGGCAG AGACCCACAT 1200 
AGTCGACTCT CCCAACTATA AGCTGTGGCA TCTGGACCCC GATGTTGAGT ATGAGATCCG 1260 
AGTGCTCCTC ACACGACCAG GTGAGGGGGG TACGGGACCG CCAGGGGCTC CCCTCAOCAC 1320 

70 CAGGACCAAG TGTGCAGATC CGGTACATGG CCCACAGAAC GTGOAAATCG TAGACATCAG 1380 
AGCCCGGCAG CTGACCCTGC AGTGGGAGCC CTTCGGCTAC GCGGTGACCC GCTGCCATAG 1440 
CTACAACCTC ACCGTGCAGT ACCAGTATGT GTTCAACCAG CAGCAGTACG AGGCCGAGGA 1 500 
GGTCATCCAG ACCTCCTCCC ACTACACCCT GCGAGGCCTG CGCCCCTTCA TGACCATCCG 1560 
GCTGCGACTC TTGCTGTCTA ACCCCGAGGG CCGAATGGAG AGCGAGGAGC TGGTGGTGCA 1620 

75 GACTGAGGAA GACGTTCCAG GAGCTGTTCC TCTAGAATCC ATCCAAGGGG GGCCCTTTGA 1680 
GGAGAAGATC TACATCCAGT GG AAACCTCC CAATGAGACC A ATGGGGTCA TCACGCTCTA 1740 
CG AGATCAAC TACA AGGCTG TCGGCTCGCT GGACCCAAGT GCTG ACCTCT CG AGCCAGAG 1 800 
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GGGGAAAGTG TTCAAGCTCC GGAATG AA AC CCACCACCTC TTTGIGGGTC TGTACCCAGG 1 860 
G ACCACCTAT TCCTTCACCA TCAAGGCCAG CACAGCAAAG GGCnTGGGC CCCCTGTCAC 1 920 
CACrCGGATT GCCACCAAAA TTTCAGCTCC ATCCATGCCT GAGTACG ACA CAGACACCCC 1980 
- ATTGAATGAG ACAGACACGA CCATCACAGT GATGCTGAAA CCCGCTCAGT CCCGGGGAGC 2040 
J TCCTGTCAGT GTTTATCAGC TGGTTGTCAA GGAGGAGCGA CTTCAGAAGT CACGGAGGGC 2100 
AGCTGACATT ATTGAGTGCT TTTCGGTGCC CGTGAGCTAT CGGAATGCCT CX:AGCCrCGA 21 60 
TTCTCTACAC TACTTTGCTG CTGAGTTGAA GCCTGCCAAC CTGCCTGTCA CCCAGCCATT 2220 
TACAGTGGGT GACAATAAG A CATACAATGG CTACTGGAAC CCTCCTCTCT CTCCXXTTGAA 2280 
AAGCTACAGC ATCTACTTCC AGGCACTCAG CAAAGCCAAT GG AGAGACCA AAATCAACTG 2340 

1 U TGTTCGTCTG GCTACAAAAG CACCA ATGGG CAGCGCCCAG GTGACCCCGG GGACTCCACT 2400 
CTGCCTCCTC ACCACAGGTG CCTCCACCCA GAATTCTAAC ACTGTGGAGC CAGAGAAGCA 2460 
GGTGGACAAC ACCGTGAAGA TGGCTGGCGT GATCGCTGGC CTCCTCATGT TCATCATCAT 2520 
TCTCCTGGGC GTGATGCTCA CCATCAAAAG GAGAAGAAAT GCTTATTCCT ACTCCTATTA 2580 
CTTGTCCCAA AGGAAGCTGG CCAAGAAGCA GAAGGAGACC CAGAGTGGAG CCCAGAGGGA 2640 

1 5 GATGGGGCCT GTGGCCTCTG CCG ACAAACC CACCACCAAG CTCAGCGCCA GCCGCAATGA 2700 
TGAAGGCTTC TCTTCTAGTT CTCAGGACGT CAACGG ATTC ACAGATGGCA GCCGCGGGGA 2760 
GCmCCCAG CCCACCCTCA CGATCCAGAC TCATCCCTAC CGCACCTGTG ACCCTGTGG A 2820 
GATGAGCTAC CCCCGGGACC AGTTCCAACT CGCCATCCGG GTGGCTGACT TGCTGCAGCA 2880 
CATCACGCAG ATGAAG AGAG GCCAGGGCTA CGGGTTCAAG G AGGAATACG AGGCCTTACC 2940 

20 AGAGGGGCAG ACAGCTTCGT GGGACACAGC CA AGGAGGAT GAAAACXX3CA ATAAGAATCG 3000 
ATATGGGAAC ATCATATCCT ACG ACCATTC CCGGGTG AGG CTGCTGGTGC TGGATGGAG A 3060 
CCCGCACTCT GACTACATCA ATGCCAACTA CATTGACGGA TACCATCGAC CTCGGCACTA 3120 
CATTGCG ACT CAAGGTCCGA TGCAGG AG AC TGTAA AGG AC TTTTGG AGAA TG ATCTGGCA 3 1 80 
GGAGAACTCC GCCAGCATCG TCATGGTCAC AAACCTGGTG GAAGTGGGCA GGGTGAAATG 3240 

25 TGTGCGATAC TGGCCAGATG ACACGGAGGT CTACGOAGAC ATTAAAGTCA CCCrGATTOA 3300 
AACAGAGCCC CTGGCAGAAT ACGTCATACG CACCTTCACA GTCCAGAAGA AAGGCTACCA 3360 
TGAGATCCGG GAGCTCCGCC TCTTCCACTT CACCAGCTGG CCTG ACCACG GCGTTCCCTG 3420 
CTATGCCACT GGCCTTCTGG GCTTCGTCCG CCAGGTCA AG TTCCTCAACC CCCCGGAAGC 3480 
TGGGCCCATA GTGGTCCACT GCAGTGCTGG GGCTGGGCGG ACTGGCTGCT TCATTGCCAT 3540 

30 TGACACCATG CTTGACATGG CCGAG AATGA AGGGGTGGTG GACATCTTCA ACTGCGTGCG 3600 
TGAGCTCCGG GCCCAAAGGG TCAACCTGGT ACAG ACAGAG GAGCAATATG TGTTTGTGCA 3660 
CGATGCCATC CTGGAAGCGT GCCTCTGTGG CAACACTGCC ATCCCTGTGT GTGAGTTCCG 3720 
TTCTCTCTAC TACAATATCA GCAGGCTGGA CCCCCAGACA AACTCCAGCC AAATCAAAGA 3780 
TGAATTTCAG ACCCTCAACA TTGTGACACC CCGTGTGCGG CCCGAGGACT GCAGCATTGG 3840 

3 5 GCTCCTGCCC CGGAACCATG ATAAGAATCG AAGTATGG AC GTGCTGCCTC TGGACCGCTG 3900 
CCTGCCCTTC CTTATCTCAG TGG ACGG AG A ATCC AGC AAT TACATCAACG CAGCACTGAT 3960 
GGATAGCCAC AAGCAGCCTG CCGCCTTCGT GGTCACCCAG CACCCTCTAC CCAACACCGT 4020 
GGCAGACTTC TGGAGGCTGG TGTTCGATTA CAACTGCTCC TCTGTGGTGA TGCTGAATGA 4080 
GATGGACACT GCCCAGTTCT GTATGCAGTA CTGGCCTGAG AAGACCTCCG GGTGCTATGG 4140 

40 GCCCATCCAG GTGGAGTTCG TCTCCGCAGA CATCGACGAG GACATCATCC ACAGAATATT 4200 
CCGCATCTGT AACATGGCCC GGCCACAGGA TGGTTATCGT ATAGTCCAGC ACCTCCAGTA 4260 
CATTGGCTGG CCTGCCTACC GGGACACGCC CCCCTCCAAG CGCTCTCTGC TCAAAGTGGT 4320 
CCGACGACTG GAGAAGTGGC AGGAGCAGTA TGACGGGAGG GAGGGACGTA CTGTGGTCCA 4380 
CTGCCTAAAT GGGGGAGGCC GTAGTGGAAC CTTCTGTGCC ATCTGCAGTG TGTGTGAGAT 4440 

45 GATCCAGCAG CAAAACATCA TTG ACGTGTT CCACATCGTG AAAACACTGC GTAACAACAA 4500 
ATCCAACATG GTGGAG ACCC TGG AACAGTA TAAATTTGTA TACGAGGTGG CACTGGAATA 4560 
TTTAAGCTCC mTAGCTCA ATGGGATGGO GAACTGCCGG AGTCCAGAGG CTGCTGTGAC 4«20 
CAAGCCCCCTTTTGTGTGAA TGGCAGTAAC TGGGCTCAGG AGCTCTGAGG TGGCACCCTG 4680 
CCTGACTCCA AGGAGAAGAC TGGTGGCCCT GTGTTCCACG GGGGGCTCTG CACCTTCTGA 4740 

50 GGGGTCTCCT GTTGCCGTGG GAGATGCTGC TCCAAAAGGC CCAGGCTTCC TTTTCAACCT 4800 
AACCAGCCAC AGCCAAGGGC CCAAGCAGAA GTACACCCAC AAGCAAGGCC TTGGATTTCT 4860 
GGCrcCCAGA CCACCTGCTT TTGTTCTGAG TTTGTGGATC TCTTGGCAAG CCAACTGTGC 4920 
AGGTGCTGGG GAGTGGGAGG CTCCCCTGCC CTCCTTCTCC TTAGGAGTGG AGGAGATGTG 4980 
TGTTCTGCTC CTCTACGTCA TGGAA A AGAT TGAGGCTCTT GGGGGTCACT GCTCTGCTGC 5040 

5 5 CCCCTGCAAC CTCCTTCAGG GGCCTCTGGC ACCAG AC ATT TGCAGTCTGG ACCAGTGTG A 5 100 
CCTTACGATG TTCCCTAGGC CACAAGAGAG GCCCCCCATC CTCACACCTA ACCTGCATGG 5160 
GGCrrCGCCC ACAACCATTC TGTACCCCTT CCCCAGCCTG GGCCTTG ACC GTCCAGCATT 5220 
CACTGGCCGG CCAGCTGTGT CCACAGCAGT TTTTG ATAA A GGTGTTCTTr GCTTTTTTGT 5280 
GTGGTCAGTG GGAGGGGGTG GAACTGCAGG GAACTTCTCT GCTCCTCCTT GTCTTTGTAA 5340 

60 AAAGGG ACCA CCTCCCTGGG GCAGGGCTTG GGCTGACCTG TAGGATGTAA CCCCTGTGTT 5400 
TCTTTGGTGG TAGCTTTCrr TGGAAGAGAC AAACAAGATA AGATTTG ATT AnrrCCAAA 5460 
GTGTATGTGA AAAGAAACTT TCTTTTGGAG GGTGTAAAAT CTTAGTCTCr TATGTCAAAA 5520 
AGAAGGGGGC GGGGGAGTTT GAGTATGTAC CTCTAAGACA AATCTCTCGG GCCITI lATT 5580 
TTTTCCTGGCAATGTCCTrAAAAGCTCCCACCCTGGGACAGCATGCCACTGAGCAAGGAG 5640 

65 AGATGGGTGA GCCTGAAGAT GGTCCCTTTG GTTTCTGGGG CAAATAGAGC ACCAGCTTTG 5700 
TGCATAATTT GGATGTCCAA ATTTGAACTC CTTCCTAAAG AAACCCAGCA GCCACCTTGA 5760 
AAAAGGCCAT TGTGGAGCCC ATTATACTTT GATTTAAAAT AGGCCAAGAG AATCAGGCCT 5820 
GGAGATCTAG GGTCTTGTCC AAAGTGTGAG TGAGTCAATG AGAGGGAACC AACATTTGCT 5880 

_ - AAGTCTCTAC TGTATGCCAG GGATCATGCT TCGCACTTTC CATAGGACAT TTCACACAGT 5940 

70 CCTTAG AACC CCCAGGAGAG AGCTACTGAC TTOTTATCAT CTOCATTTGA TCATCTCCTC 6000 
CAATGAGGAA ACCCACGCAC CTTCC TTAGT AATGA AATCC TGGGTTCCAA AGGGGCAGGT 6060 
AATGGCAATG AGACTTCTCC GTGCTGTTTr CTTCATCTTC TCTAAGCCAA GCAATrATTT 6120 
TATGGAGGGA AAATAAGGCC AGAAACTTCT GAGCAGATAA CTCCACAAAT GGAAATTTAG 61 80 
TACTTTCrrC CTGATGCCAG TTCTTCTGGG AAGCGCAGAA TTTCAGATAT ATTTTAGTAA 6240 

75 CACATTCCCA GCTCCOCAGG AA AGCCAGTC TCATCTAATT TCTTAGTCAG TAAA AACAAT 6300 
TCCCTGTTCC TTCAGGCTAT G AATGG ACCA GCCAGGGAAA GTCTCGACCT TGATCICTAG 6360 
CCAGTGCTTA GGCCCAATAT CTGACAGCCT CAGGTGGGCT GGGACCTAGG AAGCTCCATC 6420 

390 



wo 02/059377 



TTGAAGGCTG GTCTAGCCCC AG ACAGGGCA TGAGGGGCAG AGAATTCAAG AAGGTACAGC 6480 
TrrGGCCCTC AAGAGCCCAC TGTATGCTGG GGAAATGGAA CCATGGTGCA GTAGTGTGGA 6540 
GTGGATGAGT GTTCCATGAG CCTAGGAGCA AGAAAGTCTCTTCGGCCTCG OGCTTCXTQG 6600 ' 
AGAAGGGGAC GTCCATTCCT GCTGGGTCTT AACAAGCATA AAAAGGAAAA AAAGG AAACT 6660 
CAGGCAAAGG GATCCATATG TGCAATGGCA AAGAAATGTG AAAAGGCATTGGGAGAAGCA 6720 
GTCTGGGGGA GGCCAGCCCA GTGCGGGCAC AGCACAACAC GGGGAGCAGC AAGAGATG AG 6780 
CCAGGGTCCA GGAGACAG AT GCCCATCGCG AGTACAGACT TTGTCCTATT GGCAACAAGG 6840 
AGTOCATGGA GCTTTAGAGA GATGCACTCA GCTTCGTGTT GGCCAAGACT CCTTCTGGGC 6900 
CAATGGGGCT GCCTCTTTTC CTTTCATCAG ACACTGTGAA AACATTCCCT TAAGCGTGCA 6960 
CmTTAATA TCACATCTAT TTGTCTGTCT GCTCATTGTT TTGTTGCTGG AACTAAATAT 7020 
GCAATGGATC ATGAGACTCA GATTCTATGA GAAACCCAGG GTCTCTGCTT TACCACGG AG 7080 
CAGGGTCACC AACCCAGATC TCCAGGCCCA TGAGGATGGA ACATG AAAGG AGCCXjACAAA 7140 
AGTTGCTTCC ATTGGCATGG GCTCTGGAGC TGTCCAGAAG TCCAGGGACA CCAGACTTGA 7200 
TCAAGGAAGG GCTGTCACTT TAGAGGTTCA AAAGGA AGTG CCTCAAAGCA AAGGCAAGCA 7260 
AAGGAACOCCACGATGAACTTGCTCnTrTCCnTraATGAGCCICrOCCCAGGTGTAm 7320 
AGCAGACCX:C GGGG ACX^CAC CCCCACTGGG CCTGCTGGCC TCCCTCGGCT CCAGCCCAAT 7380 
GCCOCAGCTG GCCTTCCOCA GCCTGCAAGG AGCCTGTAGC ATGGCAAATC TGCCTGCTGT 7440 
ATGCTATTTT CTTAGATCTT GGTACATCCA GACAGGATGA GGGTGGAGGG AGAGCTATTT 7500 
AACACAAATC CTAAG ATTTT TTTCTGCTCA GGAAGGGGTG AAATAGCTGG CAGATACAAA 7560 
AGACAGTGGC TTTTATCATT TTAAATGGTA GGAATTTAAG GTGTGACTTC AGGGAG AAAC 7620 
AAACTTGCAA AAAAA AAAAA TCTCAGGCCA TGTTGGGGTA ACCCAGCAAG GGCCAGTG AT 7680 
GATTTCXXCC AGCTCATCCC CTTATTTTCC CACAACCCAA CCATTCTCTA AAGCAGGACA 7740 
GTGAATAGGTCTTAGGCCAG TGCACACAGG AAGAAATTGA GGCTTATGGA TGGGGATGAC 7800 
TTCOCTAAGA TCCCATGGGA CAAGGATGTG GCAAGGCTTG GATGAGATGG GGCACCAGTG 7860 
CXXIAGGAATTTGAACATriTCCTrTACCCAGGAAATCTCCGGAGCCAACACC^^ 7920 
CAGGGGGTCT CCCCACCCCA CCCCATTTAC AGGGTGAGCT CAGCCTGTCA TGAGCAGAGG 7980 
AAAATATTAT TAATGCTCTC TG AGTCTTTA CAACAGGAGC TCTTACCTCA TAGATGTGGG 8040 
CTCTGTTTGG GGAAGATGCA AGGAAGTAAT GAGAAGCCCA GGAAATTTCT CCACCTGTGT 8100 
TTATGGCCTA AATAGCTTCA GGATGTATCT TAGCTGCACT CCAACATTGC ATCCTTTCTG 8160- 
GGGTGAAGA A TCTGGGCCAA CCAGGGGTCC TTGGGCCTCT AGAAGGCCAC AGTAGGCCTC 8220 
TCnTGTGGG AATGGAAGGG GACAGTTTGC TTTTAGTGCT GGCCCTCTCT GTGGGTGTGG 8280 
CCTGCAAAGG AACCAACAGA CCCTATGCTG GGGACTCTAA CATGTGAGCT CATTAAATTC 8340 
TTCCAGCATT CTAAAGG AGG GTTTGTGATT GTCACCATTT ACTGATGAGG AAACTA AGGC 8400 
TCCTAGGGGA GAAATCACTT GCCCACAGTT CCACAGCTAG TGAGTGAATG AACCAGGATT 8460 
TAAACCGGTT TTTTCTCACT ACAGAGACAA TATTTTrCCA CCATTGTATC TCACATmT 8520 
OCCAGGAGGT TACCCATAAC AG AAGAG ACT AGAGTGGAAC AGATACGTCA GTGGATA A AG 8580 
CTCAAAGCAA ACAACAGTAA GCTTAAAATT CCTTCATAGT CTCATGTTTT ACGTTCACAA 8640 
TTCATGCAAAATTTGCATTCCACTTTCTGATTTAGCCTrGTTGGTm 8700 
TGAATATTTC AAAAAAAAAT GTGCTCTGTT CCTCATGTTG TTCTGTTCTG TTCACCCCGC 8760 
TATGACGG AC CCTAGGTCAG CTGGTCTTCA GCTTGACCCT AGAATTGACT CTAGGAGCAG 8820 
TGACCCTGCT GCCTCCCAGA GCCAGTTATA GGCTCAAGAT CAAGACCAAC TGACCTTCTC 8880 
CTAGGCAGCT CCTTTGGTGT GTGGGTGCTC TGACCTCACT GTTCATGAGG GGACCTCAAC 8940 
TAAGGCATCT TCCAGTTGGG TGCTGGAAGG AACCCATTAA CTCACACTAG AATGATGAGG 9000 
ATTTGCTCAT CTGGCGTGGA G AAGGATGAG CCCACAAAAC CCTAAAGGGA AAAGAGAAGC 9060 
TGGACACAGC TGTACTCAGC AGATTCCTGA ATGCTAGGCT GGAAAGTGGT GCCTGTTGTC 9 120 
CAAGTGGAGT CACATGGTTG CTAATGTGGG CAAGTCTGAG GACACACTTC ATGAGCAGCT 9180 
GGGGTCTGGA AGGCTCCTCA CTTTACCCTA GCCACACATA ATTACTGGGT GCCTACAGCA 9240 
CXTTAGCACCT TGGAGGGGGC ACTATTAGGA AATCGAGATT ACTATGGCAC AATTAATTCC 9300 
TGGGTAAGGC ATGGGGTTGT GGTGGACAGA GCTCAGTCTT TAGTTTGAAC GAAAACATAC 9360 
ATACATGAAA AACATACATG AAAAAAGGAC CCTCATCAAC ATTAGAAGGG GTAGATTTGG 9420 
AGCACTTTAG GCAGGAAAAC AGGAACGCAA GGCCAGGAAA CTGGAACCCA GTGAATACTC 9480 
AGAACCGAGG ATGCAGATG A CTTATTTAGC AAAATGGTCA CTTCTGTG AC ATAGCTGG AG 9540 
AAAGGATGGG TAACAGCTTG CCAGAGCCAC TTGGAACAAG GGCAAATCTC AGTGTCTGGG 9600 
GCAAAAGATG ATGCATTTCC CTCTGACCCA TCATGTTTAT TCATCCTCCA CTCCCCATTG 9660 
CCACACTAGC TCTTGCTGTA AGTCCTCACC AGG ATCTACA TTTCCTCGTC GCTGGTGGGA 9720 
ACCOCTTAGA GTACATAGAG GTATCAGTCC AGTAAGACTG CTCTACACAA CAGAAGTGAG 9780 
GCCCAGGGAG TAGCAGCCAG GCCCTTATCC TGTTACCTCT GCAGGAGTGA CTGCCCAACC 9840 
CAGATCCAGA GACATTGAAG G AAATG ATAA TTCCTTGGTA CCTCACTGCC TTGGG ACAA A 9900 
ATGAAGAAAG CCACCCTTCC TTAGGCTGCA GCTTGCCACT CCTGGGCTGG GTAAACAGGT 9960 
CATCAGCACC AGGCTCAACC AGGAGTAACA TTCTGGAAGA CATGGGTGAG CCCAAGAGGA 10020 
AGCATGAACA GGACGCTGTT CCTAAGTCAT GTCAACAGGT TGTGCTGGGC CAGGATCCCC 10080 
AGGGAAAAAA ATGGTCAACC CAACTGGAGG GTAGGTTAGA AGAAAAAAAA CATAAACGTG 10140 
GATAGTCATG TCATCTCAAA TCCCTGACTT GGCTTCCCCA TTACTTGACA GTCTGAGCTC 10200 
CTTCTTAGCC TGTGACCAGC TTCAAATCAC AGCCAAGTAA AACAAGGAAA TAGGAAAAGT 10260 
AAATCCAACT AGAAGAGACA AGCTGAGATT CAG AnTGTT TACTCCTCCC ATGCAAAGTT 10320 
TCCCTGTTGG AGGTTTTCCA TGTATACATG TCTAGAAGTG ATAGAATGCA AGGCCTTGGC 10380 
TTTGTCTTGC AGGGATCTGC CTTTGAGGTC ATAGACTGAA CAGCAGGGAG AGAGGTTAGT 10440 
GGTGGAGTGT GGGGGGAGCT GTTCTAGCTC CAGTTTCTTC TGACACATTT TTCAGGATCA 10500 
TGGATCTGAT CCTCCGAAGC ACAGCAGAGA TATCTAAGCC ATATTTGTGC ACATGAGCAG 10560 
ACrCTTCTAG TTTTTTAGTA ACCAGGGATG GGCmTGCA TGGCACTOAC TATAGAG ATG 10620 
TCTTGTAGAG ATCAAGCCAG TCTTTTGCAT CCCACCTGCC CACCTCCAGA AGAGATGGGA 10680 
AAAGGTCATC AAAGGGCATT CACCAACTGA AATCCACTCA TGAATGTTAG GTCTCTAAAA 10740 
GGAGGCATCA ACACTCACAA TGGTAGCCTC CAAACCTAGC ATCCCACCTA TCTAAGAGCT 10800 
CAGGGGTGGT CCA CTGG GGC AGATACAAGG GAAGTGCAAG GGCTCAGGAT GAAAGAAAAT 10860 
CTATTGGGAA GAGTTTTAGG GGCTTGATCA TTATGGGGCT TCCTTCTATA TCTGAGAACT 10920 
GCrCTGGGTG GTGAGATGTG GACTCTGATC CTTAATTGGA ATGTTCGGAG AATGAGTGTC 10980 
TGGTGGCCTT GAAGTGTTGG ACAOAAAAGT ATCAGTATAA AAGCCTGGAG CTCAGGGTAA 1 1040 
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TTAATGTAGT TCATGGTTCC TTAGTGAGCA GGACTCTTGG ATGTGGAGGA GAAAGGGTCA \ 1 100 
TAGGAAGTAA ACCACCAAAA TTACAAAATT GAGTCTCTGT ACAATTACTT CAGTGCCTTT 1 1 160 
GGGCTTATGA ATACAAATCA GTOGGCCTTC TCTATGATGG TCCAACAAAC TCTCAGTGTC 11220 
CACCCTGTCC CTGTATCTCC CATGGAAGAT GAATAATGTC AGGTOTTCTT TGGGTCAAAG 1 1280 
GCCCCAGGGC AGTCTGGAGG CTTAGAGGGC AGAOTGGTGT CATTCCATGT AAAGTTAGGC 1 1340 
TTCTGAGGGG TCAGGCAGAA TATGGTGTCC ATATCTTCCA TAGCTCTGCA GATTCTTGGA 1 1400 
TGAAGTCAAG CACAGTTTGC TAGACCCAGG TCACTCCTCT GAGTATAACT AGGACCCATG 11460 
AGTGAAACTT AATAGCTGTA AGGAAGAACC TGCTGTCTGC CAGAGAGGAT AAGCTGCCCA 11520 
TCTCAGCAGC TGTCTAAAAG AAGGCAGGTG TCTCTTTAAA GGGAAGAGAA GCATTGGTGA 11580 
AATGGATTTC AGGTCACTTC CATTCCAGAT GGGTGAGATC TTGTGGAGCT GGGATCATGT 11640 
TTGAACTCAT TCATACCTGT AGAGCACGAA TCCAAGTAGA TTGTGTTrGG TCTGTACAGG 1 1700 
CTGAAGCCCC CTGCTCTCCC ACCCAAGTGC CCCCACTGAG CAGGCCAACA TGCTGTTGTG 1 1760 
GCC ACATATA CTGGGCTGAT CCAGGCTGGT TATCACCAAA CAGCAAACCA TAGGGAACAG 1 1 820 
CTGCTTTGCC ATAGACCCAA TACCCATGTA GATCTCTCAT GAG AGCAGCC ATAACTCAG A 1 1880 
CCCACTGACC AACAGGGCCA TGAGTGACAG CCAGAACCAG TGA AGGTCCA AGTAGGACAC 11940 
AGAGCAGGGC TTTTCTTACC ATACACATTA TCTCCAGAGG TTATTTCTAC CCCACTCCCT 12000 
ATTCAAGGCC TGTTGGAGCA CACTGCAAAA GCAAAAGCAC AGTAACTCAA TTTACACATG 12060 
ATTATAATCA TTTCCAGTGC ACACATTTCA TCACCAGGTG GATCCTGAGC TAGCCCATGT 12120 
AAATCCGGGT TAACCCATAT TGGTAATCAT ACTCAAAAGC ACTTTTCACC CTACATTCTA 121 80 
CTAGCCAATC AAAGACAAAG AGTTGTGGCC TCTACCATTG CCTTGGCTTC TGGACACCCT 12240 
CACAAGCTAT CCCAAGGTTC CCGCTCAACT CCAGGGAGGC TG ACATCTTC ACATCCACTG 12300 
GGCATATAAT ATTGCATGAG ACCAAAGTCT CC ACACTCTT TGCAGCCTCC TCCATGAATC 12360 
CCAATGGCCT GCACTTGTAC AGnTGGGTG TTTGATAGAT AAAGCACGTA TGAGAAGAGA 12420 
AAACAAA ATA AATCAACTTT TTAAAAAAGC CAGCACTGTG CTGTCAATGT llTil il ll C 12480 
mTCAATTCTAGCTCAGAA AAGCAGAAGG TAAATAATGT CAGGTCAATG AATATCAGAT 12540 
ATATTTTTTG ACTGTACATT ACAGTGAAGT GTAATCTTTT TACACCTGCA AGTCCATCTT 12600 
ATTTATTCTT GTAAATGTTC CCTGACAATG TTTGTAATAT GGCTGTGTTA AAAAATCTAT 12660 
ACAATAAAGC TGTGACCCTG 

Seq ID NO: 98 Protein sequence; 
Protein Accession #: NP_00898 1 . 1 

1 11 21 31 41 51 
I I 1 I I 1 

GIREFNPPRO TEINTYROSI NEPHOSPHAT ASERECEPTO RTYPETHOMO SAPIENSMAS 60 
LAALALSLLL RLQLPPLPGA RAQSAPGGCS FDEHYSNCGY SVALGTNGFT WEQINTTEKP 1 20 
MLDQAVPTGS FMMVNSSGRA SGQKAHLLLP TLKENDTHa DFHYYFSSRD RSSPGALNVY 1 80 
VKVNGGPQGN PVWNVSG WT EGWVKAELAI STFWPHFYQV IFESVSLXGH PGYIAVDEVR 240 
VLAHPCRKAP HFLRI-QNVEV NVGQNATFQC lAGGKWSQHD KLWLQQWNGR DTALMVTRVV 300 
NHRRFSATVS VADTAQRSVS KYRCVIRSDG GSGVSNYAEL IVKEPPTPIA PPELLAVGAT 360 
YLWIKPNANS IIGDGPIILK EVEYRTTTGT WAETHIVDSP NYKLWHLDPD VEYEIRVLLT 420 
RPGEG GTG PP GAPLTTRTKC ADPVHGPQNV EI VDIRARQL TLQWEPFGYA VTRCHSYNLT 480 
VQYQYVFNQQ QYEAEEVIQT SSHYTLRGLR PFMTIRLRLL LSNPEGRMES EELWQTEED 540 
VPGAVPLESI QGGPFEEKIY IQWKPPNETN GVITLYEINY KAVGSLDPSA DLSSQRGKVF 600 
KLRNETHHLF VGLYPGTTYS FTDCASTAKG FGPPVTTRIA TKISAPSMPE YDTDTPLNET 660 
DTTITVMLKP AQSRGAP VSV YQLWKEERL QKSRRAADH ECFSVPVSYR NASSLDSLHY 720 
FAAELKPANL PVTQPFTVGD NKTYNGYWNP PLSPLKSYSI YFQALSKANG ETKINCVRLA 780 
nCAPMGSAQV TPGTPLCLLT TGASTQNSNT VEPEKQVDNT VKMAG VIAGL LMFDILLG V 840 
MLTKRRRNA YSYSYYLSQR KLAKKQKETQ SGAQREMGPV ASADKPTTKL SASRNDEGFS 900 
SSSQDVNGFT DGSRGELSQP TLTIQTHPYR TCDPVEMSYP RDQFQLA IRV ADLLQHTTQM 960 
KRGQGYGFKE EYEALPEGQT ASWDTAKEDE NRNKNRYGNI ISYDHSRVRL LVLDGDPHSD 1020 
VINANYIDGY HRPRHYIATQ GPMQETVKDF WRMIWQENSA SIVMVTNLVE VGRVKCVRYW 1080 
PDDTEVYGDI KVTLIETEPL AEYVIRTFTV QKKGYHEIRE LRLFHFTSWP DHG VPCYATG 1 140 
LLGFVRQVKF LNPPEAGPIV VHCSAGAGRT GCFUIDTML DMAENEG VVD IFNCVRELRA 1200 
QRVNLVQTEE QYVFVHDAIL EACLCGNTAI PVCEFRSLYY NISRLDPQTN SSQIKDEFQT 1260 
LNIVTPRVRP EDCSIGLLPR NHDKNRSMDV LPLDRCLPFL ISVDGESSNY INAALMDSHK 1320 
QPAAFVVTQH PLPNTVADFW RLVFDYNCSS WMLNEMDTA QFCMQYWPEK TSGCYGPIQV 1380 
EFVSADIDED UHRJFRICN MARPQDGYRJ VQHLQYIGWP AYRDTPPSKR SLLKWRRLE 1440 
KWQEQYDGRE GRTWHCLNG GGRSGTFCAI CSVCEMIQQQ NIIDVFHIVK TLRNNKSNMV 1500 
ETLEQYKF VY EVALEYLSSF 

Scq ID NO: 99 DNA sequence 

Nucleic Acid Accession H: NM_002988, 1 

Coding sequence; 71-340 (underlined sequences correspond to stait and stop codons) 
1 11 21 31 41 51 

I I I ! I 1 

CCGGCACGAG AGGAGTTGTG AGTTTCCAAG CCCCAGCTCA CTCTGACCAC TTCTCTGCCT 60 
GCCCAGCATC ATGAAGGGCC TTGCAGCTGC CCTCCTTGTC CTCGTCTGCA CCATGGCCCT 120 
CTGCTCCTGT GCACAAGTTG GTACCAACAA AGAGCTCTGC TGCCTCGTCT ATACCTCXTTG 1 80 
GCAGATTCCA CAAAAGTTCA TAGTTGACTA TTCTGAAACC AGCCCCCAGT GCCCCAAGCC 240 
AGGTGTCATC CTCCTAACCA AGAGAGGCCG GCAGATCTGT GCTG ACCCCA ATAAGAAGTG 300 
GGTCCAG AAA TACATCAGCG ACCTGAAGCT GAATGCCTGA GGGGCCTGGA AGCTGCGAGG 360 
GCCCAGTOAA CTTGGTGGGC CX:AGGAGGGA ACAGGAGCCT GAGCCAGGGC AATGGCCCTG 420 
CCACCCTGGA GGCCACCTCT TCTAAGAGTC CCATCTGCTA TGCCCAGCCA CATTAACTAA 480 
CTTTAATCTT AGTTTATGCA TCATATTTCA TTTTGAAATT GATTTCTATT G TTGAG CTGC 540 
ATTATGAAAT TAGTATTTTC TCTGACATCT CATG ACATTQ TCTTTATCAT CCTTTCCCXrr 600 
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TTCCCTTCAA CTCTTCGTAC ATTCAATGCA TGGATCAATC AGTGTGATTA GCTTTCTCAG 
CAGACATTGT GCCATATGTA TCAAATGACA AATCTTTATT GAATGGTnT GCTCAGCACC 
ACCrnTAAT ATATTGGCAG TACTTATTAT ATA AA AGGTA AACCAGCATT CTCACTGTGA 
AAAAAAAAAA AAAAAAAAAA AAA 

Scq ID NO: 100 Protein seouence: 
Protein Accession #: NP__002979. 1 

1 11 21 31 41 51 

I i I t I I 

MKGLAAALLV LVCTMALCSC AQVGTNKELC CLVYTSWQIP QKFIVDYSET SPQCPKPGVI 60 
LLTKRGRQIC ADPNKKWVQK YISDLKLNA 

SeqIDNO:101 DNA sequence 

Nucleic Acid Accession NM_01 5507.2 

Coding sequence: 241-1902 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I 1 

CCGCAGAGGA GCCTCGGCCA GGCTAGCCAG GGCGCCCCCA GCCCCTCOCC AGGCCGCGAG 60 
CGCCCCTGCC GCGGTGCCTG GCCTCCCCTC CCAGACTGCA GGGACAGCAC CCGGTAACTG 120 
CGAGTGGAGC GGAGGACCCXi AGCGGCTGAG GAGAGAGGAG GCGGCGGCTT AGCTGCTACG 180 
GGGTCCGGCC GGCGCCCTCC CGAGGGGGGC TCAGGAGGAG GAAGGAGGAC CCGTGCGAGA 240 
ATG CCTCTGC CCTGGAGCCT TGCGCTCCCG CTGCTGCTCT CCTGGGTGGC AGGTGGTTTC 300 
GGGAACGCGG CCAGTGCAAG GCATCACGGG TTGTTAGCAT CGGCACGTCA GCCTGGGGTC 360 
TGTCACTATG GAACTAAACT GGCCTGCTGC TACGGCTGGA GAAGAAACAG CAAGGGAGTC 420 
TGTGAAGCTA CATG.CGAACC TGGATGTAAG TTTGGTGAGT GCGTGGGACC AAACAAATGC 480 
AG ATGCTTTC CAGGATACAC CGGG AAAACC TGCAGTCAAG ATGTG AATGA GTGTGG AATG 540 
AAACCCCGGC CATGCCAACA CAGATGTGTG AATACACACX} GAAGCTACAA GTGCTTTTGC 600 
CTCAGTGGCX; ACATGCTCAT GCX^\GATGCT ACGTGTGTG A ACTCTAGGAC ATGTGCCATG 660 
ATAAACTGTC AGTACAGCTG TGAAGACACA GA AGAAGGGC CACAGTGCCT GTOTCCATCC 720 
TCAGGACTCC GCCTGGCCCC AAATGGAAGA GACTGTCTAG ATATTGATGA ATGTGCCTCT 780 
GGTAAAGTCA TCTGTCCCTA CAATCGAAGA TGTGTGAACA CATTTGGAAG CTACTACTGC 840 
AAATGTCACA TTGGTTTCXiA ACTGCAATAT ATCAGTGGAC GATATGACTG TATAGATATA 900 
AATGAATGTA CTATGG ATAG CCATACOTGC AGCCACCATG CCAATTGCrT CAATACCXIA A 960 
GGGTCCTTCA AGTGTAAATG CAAGCAGGGA TATAAAGGCA ATGGACTTCG GTQTTCTGCT 1020 
ATCCCTGAAA ATTCTGTGAA GGAAGTCCTC AGAGCACCTG GTACCATCAA AGACAGAATC 1080 
AAGAAGTTGC TTGCTCACAA AAACAGCATG AAAAAGAAGG CAAAAATTAA AAATGTTACC 1 140 
CCAGAACCCA CCAGGACTCT TACCCCTAAG GTGAACTTGC AGCCCTTCAA CTATGAAGAG 1200 
ATAGTTTCCA GAGGCGGGAA CTCTCATGGA GGTAAAAAAG GGAATGAAGA GAAAATGAAA 1260 
GAGGGGCTTG AGGATGAGAA AAGAGAAGAG AAAGCCCTGA AGAATGACAT AGAGGAGCGA 1320 
AGCCTGCG AG GAGATGTGTT TTTCCCTAAG GTGAATGAAG CAGGTGAATT CGGCCTGATT 1380 
CTGGTCCA AA GGAAAGCGCT AACTTCCAAA CTGGAACATA AAGATTTAAA TATCTCGGTT 1440 
GACTGCAGCr TCAATCATGG GATCTGTGAC TGGAAACAGG ATAGAGAAGA TGATTTTGAC 1500 
TGG A ATCCTG CTGATCG AGA TA ATGCTATT GGCTTCTATA TGGCAGTTCX: GGCCTTGGCA 1 560 
GGTCACAAGA AAG ACAT TGG CCGATTGAAA CTTCTCCTAC CTGACCTGCA ACCCCAAAGC 1620 
AACTTCTGTT TGCTCTTTGA TTACCGGCTG GCCGGAGACA AAGTCGGGAA ACTTCGAGTG 1680 
TTTGTGAA AA ACAGTAACAA TGCCCTGGCA TGGGAGAAGA CCACGAGTGA GGATGAAAAO 1740 
TGGAAGACAG GGAAAATTCA GTTGTATCAA GGAACTGATG CTACCAAAAG CATCATTTTT 1800 
GAAGCAGAAC GTGGCAAGGG CAAAACCGGC GAAATCGCAG TGGATGGCGT CTTGCTTGTT 1860 
TCAGGCTTAT GTCCAGATAG CCTTTTATCT GTGGATGA CT G AA TGTTACT ATCTTTATAT 1 920 
TTGACTTTGT ATGTCAGTTC CCTGGTTTTT TTGATATTGC ATCATAGGAC CTCTGGCATT 1980 
TTAGAATTAC TAGCTGAAAA ATTGTAATGT ACCAACAGAA ATATTATTGT AAG ATGCCTT 2040 
TCTTGTATA A GATATGCCAA TATTTGCTTT AAATATCATA TCACTGTATC TTCTCAGTCA 2 1 00 
TTTCTGAATC TTTCCACATT AT ATT AT AAA ATATGG AAAT GTCAGTTTAT CTCCCCTCCT 2 1 60 
CAGTATATCT GATTTGTATA AGTAAGTTGA TGAGCTTCTC TCTACAACAT TTCTAGAAAA 2220 
TAG AAAAAAA AGCACAGAGA AATGTTTAAC TGTTTGACTC TTATGATACT TCTTGGAAAC 2280 
TATG ACATCA AAGATAGACT TTTGCCTAAG TGGCTTAGCT GGGTCmCA TAGCCAAACT 2340 
TGTATATTTA A ATTCTTTGT AATA ATAATA TCCAA ATCAT CAAAAAAAAA AAAAAAAA 

Seq ID NO: 102 Protein sequence: 
Protein Accession #: NPJ)56322.2 

1 11 21 31 41 51 

I 1 I I I I 

MPLPWSLALP LLLSWVAGGF GNAASARHHG LLASARQPGV CHYGTKLACC YGWRRNSKGV 60 
CEATCEPGCK FGECVGPNKC RCFPGYTGKT CSQDVNECGM KPRPCQHRCV NTHGSYKCFC 1 20 
LSGHMUvlPDA TCVNSRTCAM INCQYSCEDT EEGPQCIjCPS SGLRLAPNGR DCLDIDECAS 1 80 
GKVICPYNRR CVNTFGSYYC KCHIGFELQY ISGRYOaDI NECTMDSHTC SHHANCFNTQ 240 
GSFKCKCKQGYKGNGUlCSAlPENSVKEVLRAPGTIKDRIKKLIJVHKh^MKKKj^^ 300 
PEPTRTPTPK VNLQPFNYEE fVSRGGNSHG GKKGNEEKMK EGLEDEKREE KAUCNDIEER 360 
SLRGDVFFPK VNEAGEFGLI LVQRKALTSK LEHKDLNISV DCSFNHGICD WKQDREDDFD 420 
WNPADRDNAI GFYMAVPALA GHKKDIGRLK LLLPDLQPQS NFCLLFDYRL AGDKVGKLRV 480 
FVKNSNNALA WEKTTSEDEK WKTGKIQLYQ GTDATKSIIF EAERGKGKTG EIAVDGVLLV 540 
SGLCPDSLLSVDD 

Seq ID NO: 103 DNA sequence 
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Nucleic Acid Accession #: NM_001565. 1 

Coding sequence: 67-363 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

GAGACATTCC TCAATTGCTT AGACATATTC TGAGCCTACA GCAGAGGAAC CTCCAGTCTC 60 
AGCACCATGA ATCAAACTGC GATTCTGATT TGCTGCCTTA TCTTTCTGAC TCTAAGTGGC 120 
ATTCA AGGAG TACCTCTCTC TAGAACCGTA CGCTGTACCT GCATCAGCAT TAGTAATCAA 1 80 
CCTGTTAATC CAAGGTCTTT AG AAAAACTT GAAATTATTC CTGCAAGCCA ATTTTGTCCA 240 
CGTGTTOAGA TCATTGCTAC AATOAAAAAG AAGGGTGAGA AGAGATQTCT GAATOAG AA 300 
TCGAAGGCXA TCA AGAATTT ACTGAAAGCA GTTAGCAAGG AAATGTCTAA AAGATCTCCT 360 
TAAAACCAGA GGGGAGCAAA ATCX5ATGCAG TGCTTCCAAG GATGGACCAC ACAGAGGCTG 420 
CXTCTCCCAT CACTTCCCTA CATGGAGTAT ATGTCAAGCC ATAATTGTTC TTAGTTTGCA 480 
GTTACACTAA AAGGTGACCA ATGATGGTCA CCAAATCAGC TGCTACTACT CCTGTAGGAA 540 
GGTTAATGTT CATCATCCTA AGCTATTCAG TAATAACTCT ACCCTGGCAC TATAATGTAA 600 
GCTCTACTGA GGTGCTATGT TCTTAGTGGA TGTTCTGACC CTGCTTCAAA TATTTCCCTC 660 
ACCTTTCCCA TCTTCCAAGG GTACTAAGGA ATCTTTCTGC TTTGGGGTTT ATCAGAATTC 720 
TCAGAATCrC AAATAACTAA AAGGTATGCA ATCAAATCTG CTTTTTAAAG AATGCTCTTT 780 
ACrrCATGGA CTTCCACTGC CATCCTCCCA AGGGGOCCAA ATTCTTTCAG TGGCTACCTA 840 
CATACAATTC CAAACACATA CAGGAAGGTA GAAATATCTG AAAATGTATG TGTAAGTATT 900 
CTTATTTAAT GAAAGACTGT ACAAAGTATA AGTCTTAG AT GTATATATTT CCTATATIGT 960 
TTTCAGTGTA CATGGAATAA CATGTAATTA AGTACTATGT ATCAATGAGT AACAGGAAAA 1020 
TTTTAAAAAT ACAGATAG AT ATATGCTCTG CATGTTACAT AAGATAAATG TGCTGAATGG 1080 
TTTTCAAATA AAAATG AGGT ACTCTCCTGG AAATATTAAG 

Seq ID NO: 104 Protein sequence: 
Protein Accession if: NP_001556.1 

1 U 21 31 41 51 

I I I I 1 ) 

MNQTAILICC LIFLTLSGIQ GVPLSRTVRC TOSISNQPY NPRSLEKLEI IPASQFCPRV 60 
EIIATMKKKG EKRCLNPESK AIKNLLKAVS ICEMSICRSP 

Seq ID NO: 105 DNA sequence 

Nucleic Acid Accession #: NMjOl 5068. 1 

Coding sequence: 1 170-2243 (underlined sequences correspond to start and stop codons) 

1 n 21 31 41 51 

111)11 

GTAACAACCG TCACCCTGGG TCCCGACTGC CCACCTCCTC CTCCTCCCCC TCCCCCCAAC 60 
AACAACAACA ACAACAACTC CAAGCACACC GGCCATAAGA GTGCGTGTGT CCCCAACATG 120 
ACCGAACGAA GAAGGGACGA GCTCTCTGAA GAGATCAACA ACITAAGAGA GAAGGTCATG 180 
AAGCAGTCGG AGGAGAACAA CAACCTGCAG AGCCAGGTGC AGAAGCTCAC AGAGGAGAAC 240 
ACCACCCTTC GAGAGCAAGT GGAACCCACC CCTGAGGATG AGGATG ATGA CATCGAGCTC 300 
CGCGGTGCTG CAGCAGCTGC TGCCCCACCC CCTCCAATAG AGGAAGAGTG CCCAGAAGAC 360 
CTCCCAGAGA AGTTCGATGG CAACCCAGAC ATGCTGGCTC CTTTCATGGC CCAGTGCCAG 420 
ATCTTCATGG AAAAGAGCAC CAGGGATTTC TCAGTTGATC GTGTCCGTGT CTGCTTCGTG 480 
ACAAGCATGA TGACCGGCCG TGCTGCCCGT TGGGCCTCAG CAAAGCTGGA GCGCTCCCAC 540 
TACCTGATGC ACAACTACCC AGCTTTCATG ATGGAAATGA AGCATGTCTT TG AAGACCCT 600 
CAGAGGCGAG AGGTTGCCAA ACGCAAGATC AGACGCCTGC GCCAAGGCAT GGGGTCTGTC 660 
ATCGACTACT CCAATGCTTT CCAGATGATT GCCCAGGACC TGGATTGGAA CGAGCCTGCG 720 
CTG ATTGACC AGTACCACGA GGGCCTCAGC GACCACATTC AGGAGGAGCT CTCCCACCTC 780 
GAGGTCGCCA AGTCGCTGTC TGCTCTGATT GGGCAGTGCA TTCACATTG A G AG AAGGCTG 840 
GCCAGGGCTG CTGCAGCTCG CA AGCCACGC TCGCCACCCC GGGCGCTGGT GTTGCCTCAC 900 
ATTGCAAGCC ACCACCAGGT AGATCCAACC GAGCCGGTGG GAGGTGCCCG CATGCGCCTG 960 
ACGCAGGAAG AAAAAGAAAG ACGCAGAAAG CTGAACCTGT GCCTCTACTG TGGAACAGGA 1020 
GGTCACTACG CTG AC AATTG TCCTGCCAAG GCCTCAAAGT CTTCGCCGGC GGGAAACTCC 1080 
CCGGCCCCGC TGTAGAGGGA CCTTCAGCGA CCGGGCCAGA AATAATAAGG TCCCCACAAG 1 140 
ATG ATGCCTC ATCTCCACAC TTGCAAGTG A TGC TCCAGAT TCATCTTCCG GGCAGACACA 1200 
CCCTGTTCGT CCGAGCCATG ATCGATTCTG GTGCTTCTGG CAACTTCATT GATCACGAAT 1260 
ATGTTGCTCA A AATGGAATT CCTCTAAGAA TCAAGGACTG GCCAATACTT GTGGAAGCAA 1320 
TTGATGGGCG CCCCATAGCA TCGGGCCCAG TTGTCCACGA AACTCACGAC CTGATAGTTG 1380 
ACCTGGG AGA TCACCGAGAG GTGCTGTCAT TTGATGTGAC TCAGTCTCCA TTCTTCCCTG 1440 
TCGTCCTAGG GGTTCGCTGG CTGAGCACAC ATGATCCCAA TATCACATGG AGCACTCGAT 1500 
CTATCGTCTT TGATTCTGAA TACTGCCGCT ACCACTGCCG GATGTATTCT CCAATACCAC 1560 
CATCGCTCCC ACCACCAGCA CCACAACCGC CACTCTATTA TCCAGTAGAT GGATACAGAG 1620 
TTTACCAACC AGTGAGGTAT TACTATGTCC AGAATGTGTA CACTCCAGTA GATGAGCACG 1680 
TCTACCCAGA TCACCGCCTG GTTGACCCTC ACATAGAAAT GATACCTGGA GCAC ACAGT A 1740 
TTCCCAGTGG ACATGTGTAT TCACTGTCCG AACCTGAAAT GGCAGCTCTT CGAGATTTTG 1800 
TGGCAAGAAA TGTAAAAGAT GGGCTAATTA CTCCAACGAT TGCACCTAAT GGAGCCCAAG 1 860 
TTCTCCAGGT GAAGAGGGGG TGGAAACTGC AAGTIICITA TGATTGCCGA GCTCCAAACA 1920 
AtrrTACTATCCAGAATCAG TATCCTCGCC TATCTATTCC AAATTTAGAA GACCAAGCAC 1980 
ACCTGGCAACGTACACTGAA TTCGTACCTC AAATACCTGG ATACCAAACA TACCCCACAT 2040 
ATGCCGCGTA CCCGACCTACCCAGTAGGATTCGCCTGGTA CCCAGTGGGA CGAGACGGAC 2100 
AAGGAAGATC ACTATATGTA CCTGTGATGA TCACTTGGAA TCCACACTGG TACCGCCAGC 2160 
CTCCGGTACC ACAGTACCCG CCGCCACAGCCGCCGCCTCC ACCACCACCA CCGCCGOCGC 2220 
CTCCATCTTA CAGTACCCTG TAAATACCTG TCATGTCCTT CAGGATCTCT GCCCTCAAAA 2280 
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TTTATTCXTG TTCAGCTTCT CAATCAGTGA CTGTGTGCTA AATTTTAGGC TACTGTATCT 2340 
TCAGGCCACC TGAGGCACAT CCTCTCTG AA ACGGCTATGG AAGGTTAGGG CCACTCTGGA 2400 
CTGGCACACA TCXTTAAAGCA CCAAA AGACC TTCAACATTT TCTGAGAGCA ACAGAGTATT 2460 
TGCCAATAAA TGATCTCTCA TTTTTCCACC TTGACTGCCA ATCTAACTAA AATAATTAAT 2520 
AAGTITACTT TCCAGCCAGT CCTGGAAGTC TGGGnTTAC CTGCCAAAAC CTCCATCACC 2580 
ATCTAAATTA TAGGCTGCCA AATTTGCTGT TTAACATTTA CAGAGAAGCT G ATACAAACG 2640 
CAGGAAATGC TGATTTCTTT ATGGAGGGGG AGACGAGGAG GAGGAGGACA TGACmTCT 2700 
TGCGGTTTCG GTAOCCTCTT TTTAA ATCAC TGG AGG ACTG AGGCCTTATT AAGG AAGCCA 2760 
AAATTATCGG TGCAGTGTGG AAAGGCTTCC GTG ATCCTCT CGCTGCACCC 1TAGAAACTT 2820 
CACCGTCTTC AAACTCCATT TCCATGGTTC TGTTAATTCT CAAGGAGCAG CAACTCGACT 2880 
GGTTCTCCCA GGAGCAGGAA AAACCCTTGT GACATGAAAC ATCTCAGGCC TGAAAAGAAA 2940 
GTGCTCTCTC AGATGGACTCTTGCATGTTA AGACTATGTC TTCACATCAT GGTGCAAATC 3000 
ACATGTACCC AATGACTCCG GCTTTGACAC AACACCTTAC CATCATCATG CCATGATGGC 3060 
TTCCACAAAGCATTAAACCTGGTAACCAGAGATTACTGGTGGCTCCAGCGTTGTTAGATG 3120 
TTC ATGAAAT GTGACCAOCT CTC AATCACC TTTG AGGGCT AAAG AGTAGC ACATCAAA AG 3 1 80 
GACrcCAAAA TCCCATACCC AACTCTTAAG AGATTTGTCC TGGTACTTCA GAAAGAATTT 3240 
TCATGAGTGT TCITAATTGG CTGGAAAAGC ACCAGCTGAC GTTTTGGAAG AATCTATCCA 3300 
TGTGTCTGCC TCCATATGCA TCTGGGCATT TCATCTTCAG TCCCCTCATT AGACTGTAGC 3360 
ATTAGGATGT GTGGAGAGAG GAGAA ATGAT TTAGCACCCA GATTCACACT CCTATGCCTG 3420 
GAAGGGGG AC ATCTTTG AAG AAG AGG AATT AGGGCTGTGG ACACTGTCnT GAGGATGTGG 3480 
ACTTCCTTAG TGAGCTCCAC ATTACTTGAT GGTAACCACT TCAAAAGGAT CAGAATOCAC 3540 
GTA ATGA AAA AGGTCCCTCT AGAGGATGGA GCTGATGTGA AGCTGCCAAT GGATGAAAAG 3600 
. CCTCAGAAAG CAACTCAAAG GACTCAAAGC AACGGACAAC ACAAGAGTTG TCTTCAGCCC 3660 
AGTGACACCT CTGATGTCCC CTGGAAGCTT TGTGCTAACC TGGG ACTGCC TGACTTCCTT 3720 
TAGCCTGGTC CCTTGCTACT ACCTTG AACT GTTTTATCTA ACCTCTCTTT TTCTGTTTAA 3780 
TTCnTGCTA CTGCCATTGA CCCTGCTGCA GGATTTGTGT CATTTTCCTG CCTGGTTGCT 3840 
GAGACTCCAT TTTGCTGCXIA CACACAGAGA TGTAAGAGGC AGGCTTTAAT TGCCAAAGCA 3900 
CAGTTTGAGC AGTAGAAAAC AACATGGTGT ATATCTCAAA TTGCCTGACA TGAAGAGGAG 3960 
TCTAACGGTG AAGTITCACT TTTCATCAGC ATCATCTTTC ACATGTTCAT TATCATOCGC 4020 
TCTTATTCTT GCATGTTTAA ACACTTAAAA TTTTTAGTAT AATTTTTAGT GTGTnTGAA 4080 
GTGGTGACTA GGCTTTCAAA AACTTCCATT GAATTACAAA GCACTATCCA GTTCTTATTG 4140 
TTAAACTAAG TAAAAATGAT AAGTAACATA GTGTAAAATA TTCCTTTACT GTGAACTTCT 4200 
TACAATGCTG TGAATGAGAG GCTCCTCAGA ACTGG AGCAT TTGTATAATA ATTCATCCTG 4260 
TTCATCTTCA ATTrTAACAT CATATATAAT TTCAATTCTA TCAATTGGGC CTTTAAAAAT 4320 
CATATAAAAG GATATAAAAT TTGA AAAGAG AAACCTAATT GGCTATTTAA TCTAAAACAA 4380 
Cl i l l l lii l TCCTTCAATG GAATCAG AAA GCTTGTCAAT CACTCATGTG mTAGAGTA 4440 
ATTA Cri 1 - 1 A AAATGGTGCA TTTGTGCTTC TGAACTATTT TGAAG AGTCA CTTCTGTTTA 4500 
CCTCAAGTAT CAATTCATCC TCCATACATT TGAATTCAAG TTGTTTTTTG TCAAATTTAC 4560 
AGTTGTCAAT TGATCTTCAA GCTGCAGGGT GCCTAGAAAT GGGCCGTTGT CTGTAGCCCT 4620 
GGCATGTGCA CACGGACATT TGCCACCACT GCAAGCAAAA GTCTGGAGAA GTTCACCAAC 4680 
GACAAGAACG ATTAGGGAAA ATATGCTGCT GTGGGTTAAC AACTCAGAAA GTCCCTGATC 4740 
CACATTTGGC TGTTTACTAA AGCTTGTGAT TAACTTTTTG GCAGTGTGTA CTATGCTCTA 4800 
rrGCTATATA TGCTATCTAT AAATGTAGAT GTTAAGGATA AGTAATTCTA AATTTATTAT 4860 
TCTATAGTTT TGAAGITTGG TTAAGTTTCC TTTC ACTCAA TTGATTTATT TTGTTGTTAA 4920 
TCAAATTTAT GTTAATTGGA TCCTTTAAAT TTTTTTTGGC ATTTTCCAAC AAAAATGGCT 4980 
TTATTCATAA G AAAGGAAAA AAATCAATGG AATTTGATAT CTAAAGAAGT TAGAAAGGGA 5040 
GCAAAATAAA AAACATAAAG GAGATAGATG AATTAGTAAG CAAATCAGTA GTCGAGTTTT 5100 
TCAAACTGGC AAAATTAATT AATTGACTTT TAGCCCAA AT TTACATTGTT AATTAAATCA 51 60 
AGAAGGAAGA AGATCTAAGA GCTCCCATTG ATAGGCA AGC CTAGAGAG AA CTAGCTAAAT 5220 
TTATCATGCT AGGATATTGA AACACAGAAA GITTACATAC ATTTATGAAG GGTCAATTTA 5280 
GTTTGGACAG TGAGGTATTT GTCTTAGTGG AAAAAAGGAG AATTAGTCTG ATCAAATCGT 5340 
GAAGTAATAC AGTGAACTTG CAGGTGCACA AAATAAGAGG GCCACATCTA TATGGTGCAG 5400 
TCTGGAATTC TGTTTAAGTTTGTAGGTACX: TCTTGGACTT CTGAATTGAT CCAGTTGTCA 5460 
TCCACX:ACAG ACATCTCACA TCAGATACAG ACAGTTCCAA G ATTGACAAC AG AGAA CAAC 5520 
CTGCTGGAAA GACCTGGGCA GAAATGGAGA GCCCTGCGGG AAC CATG CTA CATTTTCATC 5580 
TAAAGAGAGA ATGCACATCTGATGAGACTG AAAGTTCTTT GITGnTTAG ATTGTAGAAT 5640 
GGTATTGAAT TGGTCTGTGG AAAATTGCATTGCTTTTATT TCTTTGTGTA ATCAAG1TTA 5700 
AGTAATAGGG GATATATAAT CATAAGCATT TTAGGGTGGG AGGGACTATT AAGTAATTTT 5760 
AAGTGGGTGG GGTTATTTAG AATGTTAGAA TAATATTATG TATTAGATAT CGCTATAAGT 5820 
GGACATGCGT ACTTACTTGT AACCCTTTAC CCTATAATTG CTATCCTTAA AGATTTCAAA 5880 
TA AACTCXjGA GGGAACTGCA GGGAG ACCAA CTTATTTAGA GCGAATTGGA CATGGATAAA 5940 
AACCCCAGTG GGAGAAAGTT CAAAGGTGAT TAGATTAATA ATTTAATAGA GGATGAGTGA 6000 
CCrCTGATAA ATTACTGCTA GAATG AACTT GTCA ATG ATG GATGGTAAAT TTTCATGGAA 6060 
GTTATAAAAG TGATAAATAA AAACCCTTGCTTTTACCCCTGTCAGTAGCCCTCCTCCTAC 6120 
CACTGAACCC CATTGCCCCT ACCCCTCCTT CTAACTTTAT TGCTGTATTC TCTTCACTCT 6180 
ATATTTCTCT CTATTTGCTA ATATTGCATT GCTGTTACAA TAAAAATTCA ATAAAGATTT 6240 
AGTGGTTAAGTGC 

Seq ID NO: 106 Protein seouence: 
Protein Accession NP_05S883.1 

1 II 21 31 41 51 

MTERRRDH^S EEINNLREKV MKQSEENNNL QSQ VQKLTEE NTTLREQVEP TPEDEDDDIE 60 
LRGAAAAAAP PPPIEEECPE DLPEKFDGNP DMLAPFM AQC QIFMEKSTRD FSVDRVRVCF 120 
VTSMMTGRA A RWASAKLERS HYLMHNYPAF MMEMKHVFED PQRREVAKRK IRRLRQGMGS 1 80 
VIDYSN AFQM lAQDLDWNEP AUDQYHEGL SDHIQEELSH LEVAKSLSAL IGQCIHIERR 240 
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LARAAAARKP RSPPRALVLP HIASHHQVDP TEPVGGARMR LTQEEKERRR KLNLCLYCGT 300 
GGHYADNCPA KASFCSSPAGN SPAPL 

Seq ID NO; 107 DNA sequence 

Nucleic Acid Accession H : NM_003679. 1 

Coding sequence: 47-! 507(underlined sequences correspond to start and stop codons) 

I II 21 31 41 51 

1)11)1 

GGCACGAGCA GAAGCAACAA TAATTGTGAA AAATACTTCA GCAGTTATGG ACTCATCTGT 60 
CATTCAAAGG AA AAAAGTAG CTGTCATTGG TGGTGGCTTG GTTGGCTCAT TACAAGCATG 1 20 
CTTTCTTGCA AAGAGG AATT TCCAGATTGA TGTATATGA A GCTAGGGAAG ATACTCGAGT 1 80 
GGCTACCTTC ACACGTGGAA GAAGCATTAA CTTAGCCCTT TCTCATAGAG GACG ACAAGC 240 
CTTG AA AGCT GTTGGCCTGG AAGATCAGAT TGTATCCCAA GGTATTCCCA TGAGAGCA AG 300 
AATGATCCAC TCTCTTTCAG GAAAAAAGTC TGCAATTCCC TATGGGACAA AGTCTCAGTA 360 
TATTCTTTCT GTAAGCAGAG AAAATCTAAA CAAGGATCTA TTGACTGCTG CTG AGAAATA 420 
CCCCAATGTG AAAATGCACT TTAACCACAG GCTGTTGAAA TGTAATCCAG AGGAAGGAAT 480 
GATCACAGTG CTTGGATCTG ACAAAGTTCC CAAAGATGTC ACTTGTGACC TCATTGTAGG 540 
ATGTGATGG A GCCTATTCAA CTGTCAGATC TCACCTGATG AAGAAACCTC GCTTTGATTA 600 
CAGTCAGCAG TAG ATTCCTC ATGGGTACAT GGAGTTGACT ATTCCACCTA AG AACGGAGA 660 
TTATGCCATG GAACCTAATT ATCTGCATAT TTGGCCTAGA AATACCTTTA TG ATGATTGC 720 
ACTTCCTAAC ATGAACAAAT CATTCACATG TACTTTGTTC ATGCCCTTTG AAGAGTTTGA 780 * 
AAAACTTCTA ACCAGTAATG ATGTGGTAG A TTTCTTOCAG AAATACITTC CGGATGCCAT 840 
CCCTCTAATT GGAG AG AAAC TCCTAGTGCA AG ATTTCTTC CTGTTGCCTG CCCAGCCCAT 900 
GATATCTGTA AAGTGCTCTT CATTTCACTT TAAATCTCAC TGTGTACTGC TGGG AGATGC 960 
AGCTCATGCT ATAGTGCTGT TTTTTGGGCA AGGAATGAAT GCGGGCTITG AAGACTGCTT 1020 
GGTATTTGAT GAGTTAATGG ATAAATTCAG TAACGACCTT AGTTTGTGTC TTCCTGTGTT 1080 
CTCAAGATTG AGAATCCCAG ATGATCACGC GATTTCAGAC CTATCCATGT ACAATTACAT 1 140 
AGAGATGCGA GCACATGTCA ACTCAAGCTG GTTCATTTTT CAGAAGAACA TGGAGAGATT 1200 
TCTTCATGCG ATTATGCCAT CGACCTTTAT CCCTCTCrAT ACAATGGTCA CTTTTTOCAG 1260 
AAT AAGATA C CATGAGGCTG TGCAGCGTTG GCATTGGCAA AAAAAGGTGA TAAACAAAGG 1320 
ACTCrnTTC TTGGGATCAC TGATAGCCAT CAGCAGTACC TACCTACTTA TACACTACAT 1380 
GTCACCACGA TCTTTCCTCT GCTTGAGAAG ACCATGGAAC TGGATAGCTC ACTTCCGGAA 1440 
TACAACATGT TTCCCCGCAA AGGCCGTGGA CTCCCTAGAA CAAATITCCA ATCTCATTAG 1500 
CAGGTQATAG AAAGGTTTTG TGGTAGCAAA TGCATGATTT CTCTGTG ACC AA AATTAAGC 1 560 
ATGAAAAAAA TGTTTCCATT GCCATATTTG ATTCACTAGT GGAAGATAGT GTTCTGCTTA 1620 
T AATT AAACT GAATGTAGAG TATCTCTGTA TGTTAATTGC AATTACTGGT TGGGGGGTGC 1680 
ATTTTAAAAG ATGAAACATG CAGCTTCCCT ACATTACACA CACTCAGGTT GAGTCATTCT 1740 
AACTATAAAA GTGCA ATGAC TAAGATCCTT CACTTCTCTG A AAGTAAGGC CCTAG ATGCC 1 800 
TCAGGGAAGA CAGTAATCAT GCCTTTTCTT TAAAAGACAC AATAGGACTC GCA ACAGCAT 1 860 
TGACTCAACA CCTAGGACTA AAAATCACAA CTTAACTAGC ATGTTAACTG CACmTCAT 1920 
TACGTG AATG GAACTTACCT AACCACAGGG CTCAGACTTA CTAGATAAAA CCAGAAATGG 1980 
AAATAAGGA A TTCAGGGGAG TTCCAGAGAC TTACAAAATG AACTCATTTT ATTTTCCCAC 2040 
CTTCAAATAT AAGTATTATC ATCTATCTGT TTATCGTCTA TCTATCTATC ATCTATCTAT 2100 
CTATCTATCA TCTATCTATC TATCTATCTA TCTATCTATC TATCTATCTA TCTATCTATC 2160 
TCTATTTATT TATGTATTTA G AG ATCAGGT CTCACTCTGT TGACCAGGCT GG AGTGCAGT 2220 
GGTGAGATCT GGGTTCACTG CAACCTCTGC CTCCTGGGCT CAAGCAATCC TCCCACTTCA 2280 
GCCrCCCAAA TAGCTGGGGC TACCATGGTA TTTTTCAGTA G AGACCGGGT CTTGCCATGC 2340 
TGCCCAGGCC AGTCTCAAAC TCCTGGCCTC ATGTGATCTG CCCACCTCAG CCTCCCAAAG 2400 
TACAGGG ATT AGAGTTGTGA GCCACCGCTG CCAGCOCAGA GTTACCCTCT AAAGATAAGA 2460 
AAAAGGCTAT TAATATCATA CTAAGTGAAG GACAGGAAAG GGTTTTATTC ATAA ATTAAA 2520 
TGTCTACATG TGCCAGAATG GAAAGGAAAC AAGGGGAGAC AACTTTTATA GAAATACAAA 2580 
GCCATTACTT TATTCAATTT CAGACCCTCA GAAGCAATTT ACTAATTTAT TCTTCGACTA 2640 
CATACTGCAG CAGAACCAGC AATACACTTG ATTTTTAAAA GCACATTTAG TGAAATGTTT 2700 
TCTTTGGTTC ATCCTTCTTT AACAGGCTGC TGAGTCACTC AGAAATCCTT CAAACATGAT 2760 
TAATTATGAA GATGAAACAC TAGAGTCATA TAAGAAATAA AAATTGGGCA ATAAAATAAA 2820 
ATG ATTCAGT GTTTCTnTC TATATTGTCA ATGAAAACCT TG AGTTCTAA TAATCCATGT 2880 
TCAGTTTGTA GGG AAAG AAA A AATAATTTT TCCTTCTACC CACTTTAGGT TCCTTGGCTG 2940 
GGGCCCCTAT AACAAAAGAC AGATTGACAA GAGAAAAACA AACATAAATT TATTAGCGGG 3000 
TATATGTAAT ATATATGTGG GAAATACAGG GGAATGAGCA A ATCTCAAAG AGCTGGCGTC 3060 
TTAGAACTCC CTGGCTTATA TAGCATCGAC AAAGAACAGT AAATTTTTAG AGA AACAACA 3 120 
AAACAAAGAA AAAGAGCTTT GAGTCTGTAG GGGCAGCAAT TTGGGGGAAG CAAATATATG 3180 
GGAGTTTGCC TTGTAGATTC CTCTGGTGGT GGTCTOCAGG CTGACAAGGA TTCA AAGTTG 3240 
TCTCTGAAAC TCCTCTTTGT CATACTGCAG ATATAAAACG T CI lU 0111 CCAACA AGAG 3300 
GATTTCTTTT TCATTCTAGA ATTATCTCCT TGATAACTTG ATCAGATATA GGACATGACA 3360 
CTGAATAGAG TCCAACAGTA CA AAA AAA AT TCAGTATGTT CTAGCTACTT CACACATGTG 3420 
TACGCGACAG TTATTTTTAC AGTA AGGTAT TTTCGAGAAA AATGCATTAC GTGTTTTGGA 3480 
AAATAGAGTA ATTTAAAAAA TATATTTGAA ATGAAAATCT CCAACACATT AGAAGATGAT 3540 
GATGTTAGAT GCCCATCGTG TGCCACAAGT GGTTTnTCA TTATGTAAAG CACCCGTTGA 3600 
ATTAA AAG AA TTTGTTTTTG TTCAACCTCT TCCTGAGGCC CAAGAGCATA TGGGCAATTC 3660 
GGATTTCCTG CTGGACCACA AGGTTCTGTTGATATTACAT AGAAACGGGT ATTCCAGACA 3720 
CTTCTTATGA TGAAAGTCCA A AAGTGGCAT CCAATTTAAG GCCCCATCTT TCGTTGCCAT 3780 
TCTTCATTCC TACAAAGGAC GAACTTGGAT TACATCAACT TTGGACCCAT TGGTTTTGTC 3840 
GCTGTCGTCA ACTGACAGTG ATTCACCACT GGTGATG ATA AAAATGATGG AAGAAGAGTT 3900 
GAAAGTCACT TTTTTCTTTG GCCTGTCCCC ATCmCTGTGACATCACAA TGGGTCTGAT 3960 
CTGC ATTTCA CTTCCAGCTG CTGGTAGGTC TTTAGCAGGC CTCTGGCACC TCAGCAGTCG 402O 
GAGGCACAGA AGCTGCAAAA GGGATCTTCG AAACTGGGCA GAGAAAAAAT A AAGTGGAAT 4080 
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ATTAAGTAAA AGTTGGGCAC TAATCTGGAT TAACATTCGA GGAAATCAGT TGAGCTGATT 4140 
TAAGTTGTTT nTGTTTGTT AGCAGGTGTG G ATGTGGGGT TATGTGGTCA TGCTCAGATC 4200 
TACCTAAATC ACCCCAGAGC TTTATGTCTTTTATTCATTCTAAATCTTATTAACCGGAAT 4260 
ATGTAGGACC ATTTCA ATAC CTTGTA ATCC TCCAAGCTTC AATCTGCACA CACTTTCTAT 4320 
GAGGGCAGGT ACAACTATTA AGAG ATTTTG AACATTAAGT TAGTCCACAA ATATTCAGTG 4380 
GGCATCTACT AGGTGACAGC CACTGTGCTA TAATTAGAGA CmTTACTA TAAGCATCAA 4440 
AAACAGATAA GGCTCTTCCT GGCAGAGTTT ACAGCCTGGT GTACTTGCTA ATGTCTCTTT 4500 
AATTAGGTGA AGAATTTTTT TTTTCTATCG AAATTACTAA TCAGTTGGGG AAAAAAATAC 4560 
TATAGCAGAC AGCACTA ATG TCATCAACAA ACATTGTTCT TCTCCGTGTC CTGGGTACAA 4620 
CATCGAATAA TATTTCTTGG CCTCCTTTCC GCTTCTCCTC TCTGCTGTTC CTCTCTACAA 4680 
GAACCTGGGA GGCCAACGCC TAAAGATCAT A ATATCACAA TGGAAGGAAC CTAGATTCCT 4740 
AAATGACTGC ATAGGACAGA TCCCATCTCC TCCACCCAAT ACATTATTAG ACTGAACTGT 4800 
GACCTGAAAT GAGCAATAAA CTCTGTATTA ATTCACTGAA ATGTTGGGGT TGCTTGTTAT 4860 
AGTAGTCGGT CCATC ATG AC CAGTA AA ACA TA AATCAAAA GTTAATGTAA TTGTTATCCC 4920 
ATTATTTAGA GCGAA ATAAA TGTTGA ATAT ATGGACTTTC TCAGATTAGG AAATACCAAT 4980 
TAAAAATATA ATAAATAGCT 

Seq ID NO. 108 Protein sequence: 
Prolein Accession #: ?sfP_003670.1 

1 II 21 31 41 51 

I i I I I I 

MDSSVIQRKK VAVIGGGLVG SLQACFLAKR NFQIDVYEAR EDTRVATFTR GRSINLALSH 60 
RGRQALKAVG LEDQIVSQGI PMRARMIHSL SGKKSAIPYG TKSQYILSVS RENLNKDLLT 120 
A AEKYPNVKM HFNHRLLKCN PEEGMITVLG SDKVPKDVTC DUVGCDGAY STVRSHLMKK 180 
PRFDYSQQYI PHGYMELTIP PKNG DYAMEP NYLHIWPRNT FMMIALPNMN KSFTCTLFMP 240 
FEEFEKLLTS NDWDFFQKY FPDAIPLIGE KLLVQDFFLL PAQPMISVKC SSFHFKSHCV 300 
LLGDAAHAIV PFFGQGMNAG FEDCLVFDEL MDKPSNDLSL CLPVFSRI-Rl PDDHAISDLS 360 
MYNYIEMRAH VNSSWFIFQK NMERFLHAIM PSTFIPLYTM VTFSRIRYHE AVQRWHWQKK 420 
VINKGLFFLG SUAISSTYL UHYMSPRSF LCLRRPWNWI AHFRMTTCFP AKAVDSLEQI 480 
SNLISR 

Seq ID NO: 109 DNA sequence 

Nucleic Acid Accession # : NM_006 115.1 

Coding sequence: 236-1765 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

i I I I { I 

GCrrCAGGGT ACAGCTCCCC CGCAGCCAG A AGCCGGGCCT GCAGCCCCTC AGCACCGCTC 60 
CGGGACACCC CACCCGCTTC CCAGGCGTG A CCTGTCAAC A GCAACTTCGC GGTGTGGTG A 1 20 
ACTCTCTG AG GAAAA ACCAT TTTG ATTATT ACTCTCAGAC GTGCGTGGCA ACA AGTGACT 1 80 
GAGACCTAGA AATCCAAGCG TTGGAGGTCC TGAGGCCAGC CTAAQTCGCT TCAAAAEGGA 240 
ACGAAGGCGT TTGTGGGGTT OCATTCAGAG CCG ATACATC AGCATGAGTG TGTGGACAAG 300 
CCCACGGAGA CTTGTGGAGC TGGCAGOGCA GAGCCTGCTG AAGOATGAGG CCCTGGCCAT 360 
TGCCGCCCTG GAGTTGCTGC CCAGGG AGCT CTTCCCGCCA CTCTTCATGG CAGCCTTTGA 420 
CGGG AG AC AC AGCCAG ACCC TG A AG GCAAT GGTG CAGGCC TGGCCCTTCA CCTGCCTCCC 480 
TCTGGGAGTG CTGATOAAGG GACAACATCT TCACCTOGAG AOCTTCAAAG CTGTGCTTGA 540 
TGGACTTG AT GTGCTCCTTG CCCAGG AGGT TCGCCCCAGG AGGTGGAAAC TTCAAGTGCT 600 
GGATTTACGG AAGAACTCTC ATCAGGACTT CTGGACTGTA TGGTCTGGAA ACAGGGCCAG 660 
TCTGTACTCA TTTCCAG AGC CAGAAGCAGC TCAGOCCATG ACAAAGAAGC GAAAAGTAGA 720 
TGGTTTGAGC ACAGAGGCAG AGCAGCCCTT CATTCCAGTA GAGGTGCTCG TAGACCTGTT 780 
CCTCAAGGAA GGTGCCTGTG ATGAATTGTT CTCCTACCTC ATTG AGAAAG TGAAGCGAAA 840 
G AAAAATGTA CTACGCCTGT GCTGTAAGAA GCTG AAG ATT TTTGCAATGC CCATGCAGGA 900 
TATCAAGATG ATCCTGAAAA TGGTGCAGCT GGACTCTATT GAAGATTTGG AAGTGACTTG 960 
TACCTGGAAG CTACCCACCT TGGCG AAATT TTCTCCTTAC CTGGGCCAGA TGATTAATCT 1020 
GCGTAGACTC CTCCTCTCCC ACATCCATGC ATCTTCCTAC ATITCCCCGG AG AAGGAAG A 1080 
GCAGTATATC GCCCAGTTCA CCTCTC AG TT CCTCAGTCTG CAGTGCCTGC AG G CTCTCTA 1 1 40 
TGTGGACTCT TTATTTTTCC TTAGAGGCCG CCTGGATCAG TTGCTCAGGC ACGTGATGAA 1200 
CCCCTTGGAA ACCCTCTCAATAACTAACTG CCGGCTTTCG GAAGGGGATG TG ATGCATCT 1260 
GTCCCAGAGT CCCAGCGTCA GTCAGCTAAG TGTCCTGAGT CTAAGTGGGG TCATGCTGAC 1320 
CGATGTAAGT CCCGAGCCCC TCCAAGCTCT GCTGGAGAGA GCCTCTGCCA CCCTCCAGGA 1380 
CCTGGTCTTT GATGAGTGTG GGATCACGG A TGATCAGCTC CTTGCCCTCC TGCCTTCCCT 1440 
GAGCCACTGC TCCCAGCTTA CAACCTTAAG CTTCTACGGG AATTCCATCT CCATATCTGC 1500 
CTTGCAG AGT CTCCTGCAGC ACCTCATCGG GCTGAGCAAT CTGACCCACO TGCTGTATCC 1560 
TGTCCCCCTG GAGAGTTATG AGGACATCCA TGGTACCCTC CACCTGGAGA QOCTTGCCTA 1620 
TCTGCATGCC AGGCTCAGGG AGTTGCTQTG TGAGTTGGGG CGGCCCAGCA TGGTCTGGCT 1680 
TAGTGCCA AC CCCTG TCCTC ACTGTGGGG A CAG AACCTTC TATG ACCCGG AGCCCATCCT 1 740 
GTGCCCCTGT TTCATGCCTA ACTAGCTGGG TGCACATATC AAATGCTTCA TTCTGCATAC 1800 
TTOGACACTA AAGCCAGGAT GTGCATGCAT CTTGAAGCAA CAAAGCAGCC ACAOTTTCAG 1860 
ACAAATGTTC AGTGTG AGTG AGGAAAACAT GTTCAGTGAG GAAAAAACAT TCAGACAAAT 1920 
GTTCAGTGAG GAAAAAAAGG GGAAGTTGGG GATAGGCAGA TGTTGACTTG AGGAGTTAAT 1980 
GTGATCTTTG GGGAGATACA TCTTATAG AG TTAGAAATAG AATCTGAATT TCTAAAGGGA 2040 
GATTCTGGCT TGGGAAGTAC ATGTAGGAGT TAATCCCTGT GTAGACTGTT GTAAAGAAAC 2100 
TGTTGAAAAT AAAGAGAAGC AATGTGAAGC AAAAAAAAAA AAAAAAAA 



Seq ID NO; 1 10 Protein sequence: 
Protein Accession #: NP_0061 06. 1 
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1 11 21 31 41 51 

I I I I I 
MERRRLWGSI QSRYISMSVW TSPRRLVELA GQSLLKDEAL AIAALELLPR ELFPPLFMAA 60 
FDGRHSQTLK AMVQAWPFTC LPLGVLMKGQ HLHLETFKAV LDGLDVLLAQ EVRPRRWICLQ 120 
VLDLRKNSHQ DFWTVWSGNR ASLYSFPEPE AAQPMTKKRK VEXjLSTEAEQ PFIPVEVLVD 1 80 
LFLKEG ACDE LFSYUEKVK RKKNVLRLCC KKLKIFAMPM QDIKMIUCMV QLDSIEDLEV 240 
TCTWKLPTLA KFSPYLGQMI NLRRLLLSHI HASSYISPEK EEQYIAQFTS QFLSLQCLQA 300 
LYVDSLFFLR GRLDQLLRHV MNPLETLSIT NCRLSEGDVM HLSQSPSVSQ LSVLSLSGVM 360 
LTDVSPEPLQ ALLERAS ATL QDLVFDECGI TDDQLLALLP SLSHCSQLTT LSFYGNSISI 420 
SALQSLLQHL IGLS^a.THVL YPVPLESYED IHGTLHLERL AVLHARLREL LCELGRPSMV 480 
WLSANPCPHC GDRTFYDPEP ILCPCFMPN 



SeqIDNa 111 DNA sequence 

Nucleic Acid Accession »i NM_0038 1 S 

Coding sequence: 8-2452 (underlined sequences conespond to start and stop codons) 

1 11 21 31 41 51 
I f I I I I 

CGCTGC CATG CGGCTGGCGC TGCTCTGGGC CCTGGGGCTC CTGGGCGCGG GCAGCCCTCT 60 
GCCrrcCTGG CCGCrcCX^^A ATATAGGTGG CACTGAGGAG CAGCAGGCAG AGTCAGAGAA 120 
GGCOCCG AGG GAGCCCTTGG AGCCCCAGGT CCTTCAGG AC GATCTOCCAA TTAGCCTCAA 1 80 
AAAGGTGCTT CAGACCAGTC TGCCTGAGCC CCTG AGG ATC AAGTTGGAGC TGGACGGTGA 240 
CAGTCATATC CTGGAGCTGC TACAG AATAG GGAGTTGGTC CCAGGCCGCC CAACCCTGGT 300 
GTGGTACCAG CCCGATGGCA CTCGGGTGGT CAGTGAGGGA CACACTTTGG AGAACTGCTG 360 
CTACCAGGGA AGAGTGCGGG GATATGCAGG CTCCTGGGTG TCCATCTGCA CCTGCTCTGG 420 
GCTCAG AGGC TTGGTGGTCC TGACCCCAGA GAGAAGCTAT AOCCTGGAGC AGGGGCCTGG 480 
GGACCTTCAG GGTCCTCCCA TTATTTCGCG AATCCAAGAT CTCCACCTGC CAGGCCACAC 540 
CTGTGCCCTG AGCTGGCGGG AATCTGTACA CACTCAGACG CCACCAGAGC ACCCCCTGGG 600 
ACAGCGCCAC ATTCGCCGGA GGCGGGATGT GGTAACAGAG ACCAAGACTG TGGAGTTGGT 660 
G ATTGTGGCT G ATCACTCGG AGGCCCAGAA ATACCGGGAC TTCCAGCACC TGCTAAACCG 720 
C ACACTGGAA GTCGCCCTCT TGCTGGACAC ATTCTTCCGG CCCCTGAATG TACGAGTGGC 780 
ACTAGTGGGC CTGGAGGCCT GGACCCAGCG TGACCTGGTG GAGATCAGCC CAAACCCAGC 840 
TGTCACCCTC GAAAACTTCC TCCACTGGCG CAGGGCACAT TTGCTGCCTC GATTGCCCCA 900 
TG ACAGTGCC CAOCTGGTGA CTGGTACTTC ATTCTCTGGG CCTACGGTGG GCATGGCCAT 960 
TCAGAACTCC ATCTGTTCTC CTOACTTCTC AGGAGOTGTG AACATGGACC ACTCCACCAG 1020 
CATCCTGGGA GTCGCCTCCT CCATAGCCCA TGAGTTGOGC CACAGCCTGG GCCTGGACCA 1080 
TG ATTTGCCT GGGAATAGCT GCCCCTGTCC AGGTCCAGCC CCAGCCAAGA CCTGCATCAT 1 140 
GGAGGCCrcC ACAGACITCC TACCAGGCCT GAACTTCAGC AACTGCAGCC GACGGGCCCT 1200 
GGAGAAAGCC CTCCTGG ATG GAATGGGCAG CrGCCTCTTC GAACGGCTGC CTAGCCTACC 1260 
CCCTATGOCT OCnTCrQCG QAAATATGTT TOTGO AGCCG GGCOAGCAGT GTGACTGTOG 1320 
CTTCCTGGAT GACTGCGTOG ATCCCTGCTG TG ATTCTTTG ACCTGCCAGC TGAGGCCAGG 1 380 
TGCACAGTGTGCATCTGAOG GACCCTGTTG TCAAAATTGC CAGCTGCGCC CGTCXGGCTG 1440 
GCAGTGTCGT CCTACCAGAG GGG ATTGTG A CTTGCCTG AA TTCTGCCCAG G AGACAGCTC 1 500 
CCAGTGTCCC CCTGATGTCA GCCTAGGGGA TGGCG AGCCC TGCGCTGGCG GGCAAGCTGT 1 560 
GTGCATGCAC GGGCGTTGTG CCTCCTATGC CCAGCAGTGC CAGTCACTTT GGGGACCTGG 1 620 
AGCOCAGCCC GCTGCGCCAC TTTGCCTCCA G ACAGCTAAT ACTCGGGGAA ATG CnilG G 1 680 
GAGCTGTGGG CGCAACCCCA GTGGCAGTTA TGTGTCCTGC ACCCCTAGAG ATGCCATTTG 1740 
TGGGCAGCTC CAGTGCCAGA CAGGTAGGAC CCAGCCTCTG CTGGGCTCCA TCCGGGATCT 1 800 
ACTCTGGG AG ACAATAGATG TGA ATGGG AC TG AGCTGAAC TGCAGCTGOG TG CACCTGG A 1 860 
CCTGGGCAGT G ATGTGGCOC AGCCCCTCCT G ACTCTGCCT GGCAC AGCCT GTGGCCCTGG 1 920 
CCTGGTGTGT ATAGACCATC G ATGCCAGCG TGTGGATCTC CTGGGGGCAC AGGAATGTCG 1980 
AAGCAAATGC CATGGACATG GGGTCTGTGA CAGCAACAGG CACTGCTACr GTGAGGAGGG 2040 
CTGGGCACCC CCTG ACTGCA CCACTCAGCT CA AAG CAACC AGCTCCCTGA CCACAGGGCT 2 100 
GCTCCTCAGC CTCCTGGTCT TATTGGTCCT GGTG ATGCTT GGTGCCGGCT ACTGGTACCG 2 1 60 
TGCCCGCCTG CACCAGCG AC TCTGCCAGCT CAAGGGACCC ACCTGCCAGT ACAGGGCAGC 2220 
CCAATCTGGT CCCTCTG AAC GGCCAGGACC TCCGCAGAGG GCCCTGCTGG CACGAGGCAC 2280 
TAAGTCTCAG GGGCCAGCCA AGCCCCCACC CCCAAGGAAG CCACTGCCTQ CCGACOCCCA 2340 
GGGCCGGTGC CCATCGGGTG ACCTGCCCGG CCCAGGGGCT GGAATCCCGC CCCTAGTGGT 2400 
ACCCrcCAGA CCAGCGCCAC CGCCTCCGAC AGTGTCCTCO CTCTACCT CT G A CCTCTCCG 2460 
GAGGTTCCGC TGCCTCCAAG CCGGACTTAG GGCTTCAAG A GGCGGGCGTG CCCTCTGGAG 2520 
TCCCCTACCA TGACTGAAGG CGCCAGAGAC TGGCGGTGTC TTAAGACTCC GGGCAOCGCC 2580 
ACGCGCTGTC AAGCAACACTCTGCGGAOCTGCCGGCGTAG TTGCAGCGGG GGCTTOGOGA 2640 
GGGGCTGGGG GTTGGACGGG ATTG AGGAAQ GTCCGCACAG CCTGTCTCTG CTCAGTTGCA 2700 
ATAAACGTGA CATCTTGGGA GCGTTAAAAA AAAAAAAAAA 

Seq ID NO: 1 12 Protein sequence: 
Protein Accession #: NP .003806.2 

I II 21 31 41 51 
I I I I I I 

MRLALLWALG LLGAGSPLPS WPLPNIGGTE EQQAESEKAP REPLEPQVLQ DDLPISLKKV 60 
LQTSLPEPLR IKLELDGDSH ILELLQNREL VPGRPTLVWY QPDGTRWSE GHTLENCCYQ 1 20 
GRVRGYAGSW VSIcrCSGLR GLVVLTPERS VTLEQGPGDL QGPPIISRIQ DLHLPGHTCA 180 
LSWRESVHTQ TPPEHPLGQR HIRRRRDVVT ETKTVELVIV ADHSEAQKYR DFQHLLNRTL 240 
EVALLLDTFF RPLNVRVALV GLEAWTQRDL VEISPNPAVT LENFLHWRRA HLLPRLPHDS 300 
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PCT/US02/02242 



10 



60 



AQLVTGTSFS GPTVGMAIQN SICSPDFSGG VNMDHSTSIL G VASSIAHEL GHSLGLDHDL 360 
PGNSCPCPGP APAKTCIMEA STDFLPGLNF SNCSRRALEK ALLDGMGSCL FERLPSLPPM 420 
AAFCGNMFVE PGEQCDCGFL DDCVDPCCDS LTCQLRPGAQ CASDGPCCQN CQLRPSGWQC 480 
RPTRGDCDLP EFCFGDSSQC PPDVSLGDGE PCAGGQAVCM HGRCASYAQQ CQSLWGPGAQ 540 
PAAPLCLQTA NTRGNAFGSC GRNPSGSYVS CTPRDAICGQ LQCQTGRTQP LLQSIRDLLW 600 
ETID VNGTEL NCSWVHLDLO SDVAQPLLTL PCTACOPOLV CIDHRCXJRVD LLGAQECRSK 660 
CHGHG VCDSN RHCYCEEQWA PPDCTTQLKA TSSLTTGUX SLLVLLVLVM LGAGYWYRAR 720 
LHQRLCQLKG PTCQYRA AQS GPSERPGPPQ RALLAROTKS QGPAKPPPPR KPLPADPQGR 780 
CPSGDLPGPG AGIPPLVVPS RPAPPPPTVS SLYL 

Seq ID NO; 1 13 DNA sequence 

Nucleic Add Accession NM.002416 

Coding sequence: 40-417 (underlined sequences correspond to start and stop codons) 



15 1 11 21 31 41 51 
} i I I ] i 

ATCCAATACA GGAGTGACTT GGAACTCCAT TCTATCAC TA TG AAGAAAAG TGGTGTTCTT 60 
TTCCTCrrGG GCATCATCTT GCTGGTTCTG ATTGGAGTGC AAGOAACCCC AGTAGTGAGA 120 
A AGGGTCGCT GTTCXn"GC AT CAGCACCAAC CAAGGG ACTA TCCACCT ACA ATCCTTGA AA 1 80 
20 G ACCTTAAAC AATTTGCCCC AAGCCCTTCC TGCGAGAAA A TTGAAATCAT TGCTACACTG 240 
A AGAATQGAG TTCAAACATG TCTAAACCCA GATTCAGCAG ATGTG AAGGA ACTGATTAAA 300 
AAGTGCGAGA AACAGGTCAG CCAAAAGAAA AAGCAAAAGA ATGGGAAAAA ACATCAAAAA 360 
AAG AAAGTTC TG AAAGTTCG AAAATCTCAA CGTTCTCGTC AAA AOAAGAC TACATAAGAG 420 

- ACCACTTCAC CAATAAGTAT TCTGTGTTAA AAATGTTCTA TTTTAATTAT ACCGCTATCA 480 
25 TTCCAAAGG A GGATGGCATA TAATACAAAG GCTTATTA AT TTG ACTAGAA AATTTAAAAC 540 

ATTACrCTGA AATTGTAACT AAAGTTAG AA AGTTG ATTTT AAGAATCCAA ACGTTAAGAA 600 
TTGrrAAAGGCTATGATrGTCTrTGTTCrTCTACCACCCACCAGTrGAATTrCATCATGC 660 
TTA AGGCCAT G ATTTTAGCA ATACCCATGT CTACACAGAT GTTCACCCAA CCACATCCCA 720 

- ^ CTCACAACAG CTGCCTGGAA GAGCAGCCCT AGGCTTCCAC GTACTGCAGC CTCCAGAGAG 780 
30 TATCTGAGGC ACATGTCAGC AAGTCCTAAG CCTGTTAGCA TGCTGGTGAG CCAAGCAGTT 840 

TGAAATTGAG CTGGACCTCA CCAAGCTGCT GTGGCCATCA ACCTCTGTAT TTGAATCAGC 900 
CTACAGGCCT C ACACACAAT OTOTCTGAO A G ATTC ATGCT G ATTGTTATT GGGTATCACC 960 
ACTGGAGATC ACCAGTGTGTGGCnTCAGA GCCTCCTTTC TGGCTTTGGA AGCCATGTGA 1020 
TTCCATCTTG CCCGCTCAGG CTGACCACTT TATTrCTTTT TGTrCCCCTT TGCITCATTC 1080 

3 5 A AGTCAGCTC TTCTCCATCC TACCACA ATG C AGTGCCTTT CTTCTCTCC A GTGCACCTOT 1140 
CATATGCTCT GATTTATCTG AGTCAACTCC TTTCTCATCT TGTCCCCAAC ACCCCACAGA 1200 
AGTGCTTTCT TCTCCCAATT CATCCTCACT CAGTCCAGCT TAGTTCAAGT CCTGCCTCTT 1260 
AAATAAACCT TTTrGGACAC ACAAATTATC TTAAAACTCC TGTTTCACTT GGTTCAGTAC 1320 
CACATGGGTG AACACTCAAT GGTTAACTAA TTCTTGGGTG TTTATCCTAT CTCTCCAACC 1380 

40 AG ATTOTCAG CTCCTTGAGG GCAAG AGCCA CAGTATATTT CCCTGITTCr TCCACAGTGC 1 440 
CTAATAATAC TGTGGAACTA GGTnTAATA ATTnTTAAT TGATGTTGTT ATGGGCAGGA 1500 
TGGCAACCAG ACCATTGTCT CAGAGCAGGT GCTGGCTCTT TCCTGGCTAC TCCATGTTGG 1560 
CTAGCCTCTG GTAACCTCTT ACTTATTATC TTCAGGACAC TCACTACAGG GACCAGGGAT 1620 
G ATGCAACAT CCTTGTCTTT TTATGACAGG ATGITTGCTC AGCTTCTCCA ACAATAAGAA 1 680 

45 GCACGTGGTA AAACACTTGC GG ATATTCTG G ACrGTTTTT AAAAAATATA CAGTIT ACCG 1 740 
AAAATCATAT AATCTTACAA TGAAAAGGAC TTTATAG ATC AGCCAGTGAC CAACCnTTC 1 800 
CCAACCATAC AAAAATTCCT TTTCCCG AAG G AAAAGGGCT TTCTCAATAA OCCTCAGCTT 1 860 
TCTAAGATCT AACAAGATAG CCACCGAGAT CCTTATCG AA ACTCATTTTA GGCAAATATG 1920 

- - AGTTTTATTG TCCGTTTACT TGTTTCAGAG TTTGTATTGT G ATTATCAAT TACCACAOCA 1980 

5 0 TCTCCCATG A AGAAAGGG AA CGGTG AAGTA CTAAGCGCTA G AGG AAGCAO CCAAOTCOGT 2040 
TAGTGGAAGC ATGATTGGTG CCCAGTTAGC CTCTGCAGGA TGTGGAAACC TCCTTCCAGG 2100 
GGAGGTTCAG TGAATTGTGT AGGAGAGGTTGTCTGTGGCC AGAATTTAAA CCTATACTCA 2160 
CTTTCCCAAA TTGAATCACT GCTCACACTG CTG ATG ATTT AG AGTGCTGT CCGGTGGAG A 2220 

- TCCCACOCQ A ACGTCTTATC TAATCATQ AA ACTCCCTAGT TCCTTCATQT AACTTCCCIG 2280 
55. AAAAATCTAA GTGTTTCATA AATTTGAGAG TCTGTGACCC AdTACCITG CATCTCACAG 2340 

GTAGACAGTA TATAACTAAC AACCAAAGAC TACATATTGT CACTGACACA CACGTTATAA 2400 
TCATTTATCA TATATATACA TACATGCATA CACTCTCAA A GCAAATAATT TTTCACTTCA 2460 
AAACAGTATT GACTTGTATA CCTTGTAATT TG AAATATTT TGTrTGTTAA AATAGAATGG 2520 
TATCAATAAA TAGACCATTA ATCAG 



Seq ID NO: 1 14 Protein sequence: 
Protein Accession #: NP_002407 



1 11 21 31 41 51 
65 1111)1 

MKKSGVLFLL GIILLVLIG V QGTP WRKO R CSCISTNQGT IHLQSLKDUC QFAPSPSCEK 60 
IEnATLKNGV<3TCLNPDSADVKEUKKWEKQVSQKKKQKNGKKHQKKKVLKVRKSQRSR 120 . 
QKKTT 

70 Seq ID NO: 115 DNA sequence 

Nucleic Acid Accession #: NM.0a3238. 1 

Coding sequence: 1 82-1426 (underlined sequences correspond to start and stop codons) 
_^ 1 11 21 31 41 51 

75 I I 1 I I 1 

CAAOCAGGATACGTrmCTGTTGGGCArrGACTAOATrGTTTOCAAAAGTTTCGCATCA 60 
AAAACAAACAACAACAACAAAAAACCAAACAACTCTCCTTGATCTATACTTrOAOAATTG 120 
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TTGATTTCrT 1 i 1 M l i ATTCTGACTmA AAAACAACTT ri ' ITn CCACTTTTITAAAA 180 
AATGCACTAC TGTGTGCTGA GCGCTl ri'CT G ATCCTGCAT CTGGTCACGG TCGCGCTCAG 240 
CCTGTCTACC TGCAGCACAC TCGATATGGA CCAGTTCATQ CGCAAGAGGA TCGAGGCOAT 300 
CCGCGGGCAG ATCCTGAGCA AGCTOAAGCT CACCAGTCCC CCAGAAGACT ATCCTGAGCC 360 
CX3AGGAA0TC CCCCCGGAGG TOATITOCAT CTACAACAGC ACCAGGGACT TGCTCCAGGA 420 
GAAGGCGAGC CGGAGGGCGG CCGCCTGCOA GCGCGAGAGG AGCGACGAAG AGTACTAOGC 480 
CAAGGAGGTT TACA AAATAG ACATGCCGCC CTTCTTCCCC TCCG AAAATG CX:ATCCCGCC 540 
CACTTTCTAC AGACCCTACT TCAGAATTGT TCG ATTTGAC GTCTCAGCAA TGGAG AAGAA 600 
TGCTTCCAAT TTGGTGAAAG CAGAGTTCAG AGTCTTTCGT TTGCAGAACC CAAAAGCCAO 660 
AGTGCCTOAA CAACGGATTG AGCTATATCA GATTCTCAAG TCCA AAG ATT TAACATCTCC 720 
AACCCAGCGC TACATCGACA GCAAAGTTGT GA AAACAAGA GCAGAAGGCG AATGGCTCTC 780 
CTTCGATGTA ACTGATGCTG TTCATGAATG GCTTCACCAT A AAG ACAGGA ACCTGGG ATT 840 
TAAAATAAGC TTACACTGTC CCTGCTGCAC TTTTGTACCA TCTAATAATT ACATCATCCC 900 
AAATAAAAGT GAAG AACTAG AAGCAAGATT TGCAGGTATT G ATGGCACCT CCACATATAC 960 
CAGTGGTGAT CAGAAAACTA TAAAGTCCAC TAGGAAAAAA AACAGTGGGA AGACCCCACA 1020 
TCTCCTGCTA ATGTTATTGC CCTCCTACAG ACTTGAGTCA CAACAGACCA ACCGGCXjGAA 1080 
GAAGCGTGCT TTGG ATGCX30 CCTATTGCTT TAG AAATQTQ CAGO ATAATT GCTGCCTACX3 1 140 
TCCACTTTAC ATTOATTTCA AGAGOOATCT AGGGTQGAAA TGGATACACX3 AACOCAAAGG 1200 
GTACAATGCC AACTTCTGTG CTGGAGCATG CCCGTATTTA TGGAGTTCAG ACACTCAGCA 1260 
CAGCAGGGTC CTGAGCTTAT ATAATACCAT AAATCCAGAA GCATCTGCTT CTCCTTGCTG 1320 
GGTGTCCCAA GATITAGAAC CTCTAACCAT TCTCTACTAC ATTGGCAAAA CACCCAAGAT 1380 
TGAACAGCTT TCTAATATGA TTGTAAAGTC TTGCAAATGC AGCTAAAATT CTTGGAAAAG 1440 
TGGCAAG ACC AAAATGACAA TGATOATGAT AATGATGATG ACGACGACAA CGATGATOCT 1500 
TGTAACAAGA AAACATAAGA GAGCCTTGGT TCATCAGTGT TAAAAAATTT TTGAAAAGGC 1560 
GGTACTAGTT C AG ACACTTT GG AAGTTTGT GTTCTGTTTG TTAAAACTGG CATCTGACAC 1620 
AAAAAAAGTT GAAGGCCTTA TTCTACATTT CACCTACTTT GTAAGTGAGA GAGACAAGAA 1680 
GCAAATTTTT TTAAA 

SeqIDNO: 116 Protein sequence: 
Protein Accession #: NP_003229. 1 

1 11 21 31 41 51 
I I I I I I 

MHYCVLS AFL ILHLVTVALS LSTCSTLDMD QFMRKRIEAI RG QILSKLKL TSPPEDYPEP 60 
EEVPPEVISI YNSTRDLLQE KASRRAAACE RERSDEEYYA KEVYKIDMPP FFPSENAIPP 120 
TFYRPYFRI V RFDVSAMEKN ASNLVKAEFR VFRLQNPKAR VPEQRIELYQ ILKSKDLTSP 1 80 
TQRYIDSKVV KTRAEGEWLS FDVTDAVHEW LHHKDRNLGF KISLHCPCCT FVPSNNYIIP 240 
NKSEELEARF AGIDGTSTYT SGDQKTIKST RKKNSGKTPH LLLMLLPSYR LESQQTNRRK 300 
KRALDAAYCF RNVQDNCCLR PLYIDFKRDL GWKWIHEPKG YNANFCAGAC FYLWSSDTQH 360 
SRVLSLYNTI NPEASASPCC VSQDLEPLTI LVYIGKTPKI BQLSNMIVKS OCCS 

Seq ID NO: 1 17 DNA sequence 

Nucleic Acid Accession #: NM_000095. 1 

Coding sequence: 26-2299 (underUned sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I t I 1 

CAGCACCCAG CTCCCCGCCA CCGCCAISGT CCCCG ACACC GCCTGCGTTC TTCTGCTCAC 60 
CCTGGCTGCC CTCGGOGCXyr COGGACAGGO CCAGAGCCCO TTGGGCTCAG ACXTGGGCTC 120 
GCAGATGCTT CGGGAACTGC AGGAAACCAA CGCGGCGCTG CAGGACGTGC GGOACTGGCT 1 80 
GCGGCAGC AG GTCAGGG AGA TCACGTTCCT G AAAAACACG GTG ATGG AGT GTG ACGCXjTG 240 
CGGGATGCAG CAGTCAGTAC GCACCGGCCT ACCCAGCGTG CGGCCCCTGC TCCACTGCGC 300 
GCCCGGCTTC TGCTTCCCCG GCGTGGCCTG CATCCAGACG GAGAGCGGCG GCCGCTGCGG 360 
CCCCTQCCCC GCGGGCTTCA CGGGCAACGG CTCGCACTGC ACCOACGTCA ACOAGTGCAA 420 
CQCCCACCCC TGCTTCCCCC GAGTCCGCTQ TATCAACACC AGCCCGGGGT TCCGCTGCGA 480 
GGCTTGCCCG CXXjGGGTACA GCGGCCCCAC CCACCAGGGC GTGGGGCTGG CTTTCGCCAA 540 
GGCCAACAAG CAGCnTGCA CXjGACATCAA CG AGTGTGAG ACCGGGCAAC ATAACTGCGT 600 
CCCCAACTCC GTGTGCATCA ACACCCGGGG CTCCTTCCAG TGCGGCCCGT GCCAGCCCGG 660 
CTTCXjTGGGC GACCAGGCGT COGGCTGCCA GCGOGGCGCA CAGCGCTTCT GCCCCGACGG 720 
CTCGCCCAGC GAGTGCCACG AGCATGCAGA CTGCGTCCTA G AGCGCGATG GCTCGCGGTC 780 
GTGCGTGTOT CGCGTTGGCT GGGCCGGCAA CGGG ATCCTC TGTGGTCOCG ACACTGACCT 840 
AGACGGCTTC CCGGACGAGA AGCTGOGCTG CCCGGAGCCG CAGTGCCGTA AGGACAACTG 900 
CGTGACTGTG CCCAACTCAG GGCAGGAGGA TGTGGACCGC GATGGCATCG GAGACGCCTG 960 
CGATCCGGAT GCCGACGGGG ACGGGGTCCC CAATGAAAAG GACAACTGCC OGCTGGTGCG 1020 
GAACCCAGAC CAGCGCAACA CXjGACXiAGGA CAAGTGGGGC GATGCGTGCG ACAACTGCCO 1080 
GTCCCAG AAG AACG ACGACC AAAAGGACAC AGACCAGGAC GGCCX3GGGCG ATGCGTGCGA 1 140 
CGACOAC ATC G ACGGCX3 ACC GGATCOGCAA CCAGGCCG AC AACTGCCCTA GGGTACCCAA 1 200 
CTCAGACCAG AAGGACAGTG ATGGCGATGG TATAGGGGAT GCCTGTGACA ACTGTCCCCA 1260 
G AAOAGCAAC CCGGATCAGG CGGATGTGG a CCACGACTTT GTGGGAGATG CTTGTGACAG 1320 
CGATCAAGAC CAGGATGGAG ACGGACATCA GGACTCTCGG GACAACTGTC CCACGGTGCC 1380 
TAACAGTGCC CAGGAGGACT CAOACCACGA TGGCCAGOGT GATGCCTGCQ ACGACGACGA 1440 
CGACAATGAC GGAGTCCCTO ACAGTCGGG A CAACTGCCGC CTGGTG CCTA ACCCCGGCCA 1500 
GG AGGACGCG GACAGGGACG GOGTGGGCGA CGTGTGCCAG GACGACTTTG ATGCAGACAA 1560 
GGTGGTAGAC AAG ATCGACG TGTGTCCGGA GAACGCTGAA GTCACGCTCA CCGACTTCAG 1620 
GGCCTTCCAG ACAGTCGTGC TGGACCOGGA GGGTGACGCG CAGATTGACCCCAACTGGGT 1680 
GGTGCTCAAC CAGGGAAGGG AGATCGTGCA QACAATGAAC AGCGACCCAO GCCTGQCTGT 1740 
GGGTTACACT GCCTTCAATG GCGTGGACTTCGAGGGCACG TTCCATGTGA ACACGGTCAC 1800 
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GGATGACGAC TATGCOGGCT TCATCTTTGG CTACCAGGAC AGCTCCAGCT TCTACGTGGT I860 
CATGTGGAAG CAGATGG AGC AAAOGTATTG GCAGGCGAAC CCCTTCCGTG CTGTGGCCGA 1920 
GCCTGGCATC CAACTCAAGG CTGTGAAGTC TTCCACAGGC CCCGGGGAAC AGCTGCGGAA 1980 
CGCTCTGTGG CATACAGGAG ACACAG AGTC CC AGGTGCGG CTGCTGTGGA AGGACCCGCG 2040 
AAACOTOOGT TGGAAGOACA AGAAGTCCTA TCGTTGGTTC CTGCAGCACC GGCCCCAAGT 2100 
GGGCTACATC AGGGTGCGAT TCTATGAGGG CXCTGAGCTG GIGGCXTGACA GCAACXiTGGT 2160 
CTTGGACACA ACCATGCGGG GTGGCCGCCT GGGGGTCTTC TGCTTCTCCC AGGAGAACAT 2220 
CATCTGGGCC AACCTGCGTT ACOjCTGCAA TGACACCATC CCAGAGGACT ATGAGACCCA 2280 
TCAGCTGCGG CAAGCCTAGG GACCAGGGTO AGGACCCGCG GGATGACAGC CACCCTCACC 2340 
OCOGCTGO AT GGGGOCTCTO CACCCAGCCC AAGOGGTGGC CGTCCTGAGO CGOAAGTGAO 2400 
AAGGGCTCAG AG AGGACAAA ATAAAGTOTG TGTGCAGGG 

Seq ID NO: 118 Proiein sequence: 
Proiein Accession #: NP_000086. 1 

1 11 22 31 41 51 
i I I 1 I I 

MVPDTACVLL LTLAALGASG QGQSPLGSDL GPQMLRELQE TNAALQDVRD WLRQQVREIT 60 
FLKNTVMECD ACGMQQSVRT GLPSVRPLLH CAPGFCFPGV ACIQTESGGR CGPCPAGFTG 120 
NGSHCTDVNE CNAHPCFPRV RCINTSPGFR CEACPPGYSG PTHQG VGLAF AKANKQVCTD 1 80 
INECBTGQHN CVPNSVCINT RGSFQCGPCQ PGFVGDQASG CQRGAQRFCP DGSPSECHEH 240 
ADCVLERDGS RSCVCRVGWA GNQILCGRDT DLDGFPDEKL RCPEPQCRKD NCVTVPNSGQ 300 
EDVDRDGIGD ACDPDADGDG VPNEFCDNCPL VR>JPDQRNTD EDKWQDACDN CRSQKNDDQK 360 
DTDQDGRGDA CDDDIDGDRI RNQADNCPRV PNSDQKDSDG DGIGDACDNC PQKSNPDQAD 420 
VDHDFVGDAC DSDQDQDGDG HQDSRDNCPT VPNSAQEDSD HDGQGDACDD DDDNDGVPDS 480 
RDNCRLVPNP GQEDADRDG V GDVCQDDFDA DKWDKIDVC PENAEVTLTD FRAFQTVVLD 540 
PEGDAQIDPN WVVLNQGREI VQTMNSDPGL AVGYTAFNGV DFEOTFHVNT VTDDDYAQFI 600 
FG YQDSSSFY VVMWKQMEQT YWQANPFRAV AEPGIQLKAV KSSTGPGEQL RNALWHTGDT 660 
ESQVRLLWKD PRNVG WKDKK SYRWFLQHRP QVGYIRVRFY EGPELVADSN WLDTTMRGG 720 
RLG VFCFSQE NIIWANLRYR CNDTIPEDYE THQLRQA 

Seq ID NO: 1 19 DNA sequence 

Nucleic Acid Accession #: NM_01 42 1 1 

Coding sequence: 1 57-1 479 (underlined sequences correspond to start and stop codons) 
I II 21 31 41 51 

GGGACAGGGC TG AGGATG AG G AG AACCCTG GGG ACCCAGA AG ACCGTGCC TTGCCCGGAA 60 
GTCCTGCCTG TAGGCCTGAA GGACTTGCCC TAACAG AGCC TCAACAACTA CCTGGTG ATT 1 20 
CCTACTTCAO CCCCTTGGTG TG AGCAGCTT CTCAACATG A ACTAC AGCCT CC ACTTGGCC 1 8 0 
TTCGTGTGTC TG AGTCTCTT CACTGAGAGG ATGTGCATCC AGGGGAGTCA GTTCAACGTC 240 
G AGGTCGGCA GAAGTGACAA GCnTCCCTG CCTGGCTTTG AGAACCTCAC AGCAGGATAT 300 
AACAAATTTC TCAGGCCCAA TTTTGGTQGA GAACCCGTAC AGATAGCGCT GACTCTGGAC 360 
ATTGCAAGTA TCTCTAOCAT TTCAOAG AGT AACATQG ACT ACACAGCCAC CATATACCTC 420 
CGACAGCGCT GGATGGACCA GCGGCTGGTG TTTGAAGGCA ACAAGAGCTT CACTCTGGAT 480 
GCCCGCCTCG TGGAGTTCCT CTGGGTGCCA G ATACTTACA TTGTGGAGTC CAAG AAGTCC 540 
TTCCrCCATG AAGTCACTGT GGGAAACAGG CTCATCCGCC TCTTCTCCAA TGGCACGGTC 600 
CTGTATGCCC TCAGA ATCAC GACAACTGIT GCATGTA ACA TGGATCTGTC TA AATACCCC 660 
ATGOACACAC AOACATGCAA QTTGCAGCTG GAAAGCTOGG GCTATG ATGG AAATGATGTG 720 
G AGTTCACCT GGCTG AGAGG GA ACOACTCT GTGCGTGGAC TGGAACACCT GCGGCTTGCT 780 
CAGTACACCA TAGAGCGGTA TTTCACCTrA GTCACCAGAT CGCAGCAGGA GACAGGAAAT 840 
TACACTAGAT TGGTCTTACA GTITGAGCTT CGGAGGAATG TTCTGTATTr CATTTTGGAA 900 
ACCTACGTTC CTTCC ACTTT CCTGGTGGTG TTGTCCTGGG TTTC ATTTTG GATCTCTCrC 960 
G ATTCAOTCC CTGCAAGAAC CTGCATTGGA QTGACOACCO TQTTATCAAT GACCACACTG 1020 
ATGATCGGGT CCCGCACTTC TCTTCCCAAC ACCAACTGCT TCATCA AGGC CATCGATGTG 1080 
TACCTGGGGA TCTGCTTTAG CTTTGTGTTT GGGGCCTTGC TAGAATATGC AGTTGCTCAC 1 140 
TACAGTTCCT TACAGCAGAT GGCAGCCAAA GATAGGGGGA CAACAAAGGA AGTAGAAGAA 1200 
GTCAGTATTA CTAATATCAT CAACAGCTCC ATCTCCAGCT TTAAACGGAA GATCAGCnT 1260 
GCCAGCATTG AAATTTCCAG CGACAACGTTGACTACAGTG ACTTGACAATGAAAACCAGC 1320 
GACAAGTTCA AGTTTGTCTT CCG AG AAAAG ATGGGCAGGA TTGTTG ATT A TTT CACAATT 1380 
CAAAACOCCA GTAATOTTQA TCACTATrCC AAACTACTGT TTCCTTTGAT TrTTATGCTA 1440 
GCCAATGTAT TTTACTGGGC ATACTACATG TATTTTTfiAG TCAATGTTAA ATTTCTTGCA 1500 
TGCCATAGGTCTTCAACAGG ACAAGATAAT G ATGTAAATG GTATTTTAGG CCAAGTGTGC 1560 
ACCCACATCC AATGGTGCTA CAAGTGACTG AAATAATATT TG AGTCTTTC TGCTCAAAGA 1 620 
ATGAAGCTCC AACCATTGTT CTAAGCTGTG TAGAAGTCCT AGCATTATAG GATCTTGTAA 1680 
TAGAAACATC AGTCCATTCC TCTTTCATCT TAATCAAGGA CATTCCCATG GAGCCCAAGA 1740 
TTACAAATGT ACTCAOGGCT GTITATrOGG TGGCTCCCTG GTTTGCATTT ACCTCATATA 1800 
AAGAATGGGA AGGAGACCAT TGGGTAACCC TCAAGTGTCA OAAGTTGTTT CTAAAGTAAC 1860 
TATACATGTT TTTTACTAAA TCTCTGCAGT GCTTATAAAA TACATTGTTG CCTATTTAGG 1920 
G AGTAACATT TTCTAGTrTT TGTTTCTGGT TAAAATG AAA TATGGGCTTA TGTCAATTCA 1980 
TTGGAAGTCA ATGCACTAAC TCAATACCAA OATOAGTnT TAAATAATGA ATATTATTTA 2040 
ATACCACAAC AGAATTATCC CCAATTTCCA ATAAGTCCTA TCATTG AAAA TTCAAATATA 2100 
AGTGAAGAAA AAATTAGTAG ATCAACAATC TAAACAAATC CCTCGGTTCT AAGATACAAT 2160 
GGATTCCCCA TACTGG AAGG ACTCTGAGGC TTTATTCCCC CACTATGCAT ATCTTATCAT 2220 
TTTATTATTA TACACACATC CATCCTAAAC TATACTAAAG CCCTITTCCC ATGCATGGAT 2280 
GGAAATGGAA G ATTTTTTTG TAACTTGTTC TAGAAGTCTT AATATGGGCT GTTGCCATGA 2340 
AGGCTTGCAG AATTGAGTCC ATTTTCTAGC TGCCTTTATT CACATAOTGA TGGGGTACTA 2400 
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AAAGTACTGG GTTGACTCAG AGAGTCGCTG TCATTCTGTC ATTGCTGCTA CTCTAACACT 2460 
GAGCAACACT CTCCCAGTGG CAGATCCCCTGTATCATTCC AAGAGGAGCA TTCATCCCTT 2520 
TGCTCTAATG ATCAOGAATG ATGCTTATTA GAAAACAAAC TGCTTGACCC AGGAACAAGT 2580 
GGCTTAGCTT A AGTAAACTT GGCTTTGCTC AGATCCCTGA TCCTTCCAGC TGGTCTGCTC 2640 
TGAGTOGCTT ATCCCGCATG AGCAGGAGCG TGCTGGOCCT GAGTACTG AA CnTCTGAGT 2700 
AACAATGAG A CACGTTACAG AACCTATGIT CAGGTTGCGG GTGAOCTGCC CTCTCCAAAT 2760 
CCAGCCAG AG ATGCACATTC CTCGGCCAGT CTCAGCCAAC AGTACCAAAA GIGATITTTG 2820 
AGTGTGCCAG GGTAAAGGCT TCCAGTTCAG CCTCAGTTAT TTTAGACAAT CTCGCX^ATCT 2880 
TTAATTTCrr AGCTTCCTGT TCTAATAAAT GCACGGCTTT ACCTTTCCTG TCAGAA ATA A 2940 
ACCAAGGCTC TAAAAGATOA TTrCCXTTCT GTAACTCCCT AG AGCCACAG GTTCTCATTC 3000 
CTTTTCCCAT TATACTTCTC ACAATTCAGT TTCTATGAGT TTG ATCACCT GATTnTTTA 3060 
ACAAAATATT TCTAACGGGA ATGGGTGGGA GTGCTGGTGA AAAGAG ATGA AATGTGGTTG 3120 • 
TATGAGCCAA TCATATTTGT GAll lTHAA AAAAAGTTTA AAAGGAAATA TCTGTTCTGA 3180 
AACCCCACTT AAGCATTGTT TTTATATAAA AACA ATG ATA AAGATGTG AA CIXSTGAAATA 3240 
AATATACCAT ATTAGCTACC CACC 

Seq ID NO: 120 Protein sequence: 
Protein Accession »: NP_0SS026. 1 

I 11 21 31 41 51 
I I I I I t 

MNYSLHLAFV CLSLFTERMC IQQSQFNVEV GRSDKLSLPO FENLTAGYNK FLRPNFQQEP 60 
VQIALTLDIA SISSISESNM DYTATIYLRQ RWMDQRLVFE GNKSFTLDAR LVEFLWVPDT 120 
YIVESKKSFL HEVTVGNRU RLFSNGTVLY ALRITTTVAC NMDLSKYPMD TQTCKLQLES 180 
WGYEXjNDVEF TWLRGNDSVR GLEHLRLAQY TIERYFTLVT RSQQETGNYT RLVLQFELRR 240 
NVLYFILETY VPSTFLVVLS WVSFWISLDS VPARTCIG VT TVLSMTTLMI GSRTSLPNTN 300 
CFDCAIDVYL QICFSFVFOA LLEYAVAHYS SLC3QMAAKDR QTTKEVEEVS ITNIINSSIS 360 
SFKRKISFAS lEISSDNVDY SDLTMKTSDK FKFVFREKMG RIVDYFnQN PSNVDHYSKL 420 
LFPLIFMLAN vfywayymyf 

Seq ID NO: 121 DNA sequence 

Nucleic Acid Accession #: NM_001 854 

Coding sequence: 1 63-5582 (underlined sequences coirespond to start and stop codons) 

1 11 21 31 41 SI 

I I I I I i 

AACCATCAAA TTTAG AAG AA AAAGCCCTTT G ACTTTTTCC CCCTCTCCCT CCCCAATGGC 60 
TGTGTAGCAA ACATCCCTGG CGATACCTTG GAAAGGACGA AGTTGGTCTG CAGTCGCAAT 120 
TTCGTGGGTT G AGTTCACAG TTGTGAGTGC GGGGCTCGG A GATGG AGCCG TGGTCCTCTA 1 80 
GGTGGAAAAC G AAACGGTGG CTCTGGGATT TCACCGTAAC AACCCTCGCA TTGACCTTCC 240 
TCTTCCAAGC TAGAGAGGTC AGAGGAGCTG CTCTAGTTGA TGTACTAAA A GCACTAGATT 300 
TTCACAATTC TCCAGAGGGA ATATCAAAAA CAACGOGATT TTGCACAAAC AGAAAG AATT 360 
CTAAAGGCTC AGATACTGCT TACAGAGTTT CAAAGCAAGC ACAACTCAGT GCCCCAACAA 420 
AACAGTTATT TCCAGGTGGA ACTTTCCCAG AAGACTTTTC AATACTATTT ACAGTAAAAC 480 
CAAAAAAAGO AATTCAGTCT TTCCllllAT CTATATATAA TG AGCATGGT ATTCAGCAAA 540 
TTGGTGTTGA GGTTGGGAGA TCACCTGTTT TTCTGTTTG A AG ACCACACT GGAAAACCTG 600 
CCCCAGAAGA CTATCCCCTC TTCAGAACTG TTAACATCGC TGACGGGAAG TGGCATCGGG 660 
TAGCAATCAG COTGGAG AAG AAAACTGTGA CAATGATTGT TGATTGTAAG AAGAAAACCA 720 
CGA AACCACT TGATAOAAOT GAGAQAGCAA TTGTTGATAC CAATGGAATC AOGGTmTG 780 
GAACAAGG AT TTTGG ATG AA GAAGTTTTTG AGGGGG ACAT TCAGCAGTTT TTGATCACAG 840 
GTGATCCCAA GGCAGCATAT GACTACTGTG AGCATTATAG TCCAGACTGT GACTCTTCAG 900 
C ACCCAAG GC TGCTCAAGCT CAGG AACCTC AG ATAG ATG A GTATGCACC A GAGG ATATAA 960 
TCGAATATG A CTATGAGTAT GGGGAAGCAG AGTATAAAGA GGCTGAAAGT GTAACAGAGG 1020 
OACCCACTGT AACTOAGGAG ACAATAGCAC AOACGGAGGC AAACATCGTT GATGATTTTC 1080 
AAGA ATACAA CTATGG AACA ATGGAAAGTT ACCAG ACAG A AGCTCCTAGG CATGTTTCTG 1 140 
GGACAAATG A GCCAAATCCA GTTGAAGAAA TATTTACTGA AG AATATCTA ACGGGAGAGG 1200 
ATTATGATTC CCAGAGGAAA AATTCTGAGG ATACACTATA TGAAAACAAA GAAATAGACG 1260 • 
GCAGGGATTC TGATCTTCTG GTAGATGGAG ATTTAGGCGA ATATGATTTT TATGAATATA 1320 
AAGAATATGA AGATAAACCA ACAAGCCCCC CTAATGAAGA ATTTGGTCCA GGTGTACCAG 1380 
CAGAAACf GA TATTACAGAA ACAAGCATAA ATGGCCATGG TGCATATGGA GAGAAAGGAC 1440 
AGAAAGGAGA ACCAGCAGTG GTTGAGCCTG GTATGCTTGT CGAAGGACCA CCAGGACCAG 1500 
CAGGACCTGC AGGTATTATG GGTCCTCCAG GTCTACAAGG CCCCACTGG A CCCCCTGGTG 1560 
ACCCTGGCGA TAGGGGCCCC CCAGGACGTC CTGGCTTACC AGGGGCTG AT GGTCTACCTG 1 620 
GTCCTCCTGG TACTATGTTG ATGTTACCGT TCCGTTATGG TGGTGATGGT TCCAAAGGAC 1680 
CAACCATCTC TGCTCAGGAA GCTCAGGCTC AAGCTATTCT TCAGCAGGCT CGGATTGCTC 1740 
TOAGAGGCCC ACCTGGCOCA ATOGQTCTAA CTGGAAGACC AGGTCCTGTG GGGGGGCCTG 1800 
GTTCATCTGG GGCCAAAGGT GAGAGTGGTG ATCCAGGTCC TCAGGGCCCT CGAGGCGTCC 1860 
AGGGTCCCCC TGGTCCAACG GGAAAACCTG GAAAAAGGGG TCGTCCAGGT GCAGATGGAG 1920 
GAAGAGGAAT GCCAGGAGAA CCTGGGGCAA AGGG AGATCG AGGGTTTGAT GGACTTCCGG 1980 
GTCTGCCAGG TG ACAAAGGT CACAGGGGTG AACGAGGTCC TCAAGGTCCT CCAGGTCCrC 2040 
CTOGTQATGA TGGAATOAGG GGAGAAGATG GAGAAATFGG ACCAAGAGGT CTTCCAGGTQ 2100 
AAGCTGGCCC ACGAGGTTTG CTGGGTCCAA GGGGAACTCC AGOAGCTCCA GGGCAGCCTG 2160 
GTATGGCAGG TGTAGATGGC CCCCCAGGAC CAAAAGGGAA CATGGGTCCC CAAGGGGAGC 2220 
CTGGGCCTCC AGGTCAACAA GGGAATCCAG GACCTCAGGG TCTTCCTGGT CCACAAGGTC 2280 
CAATTGGTCC TCCTGGTGAA AAAGGACCAC AAGG AAAACC AGG ACTTGCT GGACTTCCTG 2340 
GTGCTGATGG GCCTCCTGGT CATCCTGGGA AAGAAGGCCA OTCTGGAGAA AAGGGGGCTC 2400 
TGGGTCCCCC TGGTCCACAA GGTCCTATTG GATNNCCGGO COCCCOGGOA GTAAAGGOAG 2460 

402 



wo 02/059377 



PCT/US02/02242 



CAGATGGTOT CAGAGGTCTC AAGGGATCTA AAGGTGAAAA GGGTGAAG AT GGTTTTCCAG 2520 
GATTCAAAGG TGACATGGGT CTAAAAGGTG ACAGAGGAGA AGTTGGTCAA ATTGGCCCAA 2580 
GAGGGNAAGA TGGCCCTGAA GGACCCAAAG GTCGAGCAGG CCCAACTGG A GACCXAGGTC 2640 
CTTCAGGTCA AGCAGGAGAA AAGGGAAAAC TrGGAGTTCC AGG ATTACCA GG ATATCCAG 2700 
5 QAAGACAAGG TCCAAAGGGT TCCACTOOAT TOCCTOGOTTTCCAOOTOCC AATGOAGAGA 2760 
AAGGTGCACG GGGAGTAGCT GGCAAACCAG GCCCTCGGQG TCAGCGTGGT CCAAOGGGTC 2820 
CTCGAGGTTC AAGAGGTGCA AGAGGTCCCA CTGGGAAACC TGGGCCAAAG GGCACTTCAG 2880 
GTGGCGATGG CCCTCCTGGC CCTCCAGGTG AAAGAGGTCC TCAAGGACCT CAGGGTCCAG 2940 

. ^ TTGGATTCCC TGGACCAAAA GGCCCTCCTG GACCACCAGG AAGGATGGGC TGCCCAGGAC 3000 

1 0 ACCCraOGCA ACOTGGGG AG ACTGOATTTC AAGGCAAGAC CGGCCCTCCT OGGCCAGGGG 3060 
OAGTGGTTGG ACCACAGGGA CCAACOGGTG AGACTGGTCC AATAGGGGAA CGTGGGTATC 3120 
CTGGTCCrCC TGGCCCTCCT GGTGAGCAAG GTCITCCTGG TGCTGCAGGA AAAGAAGGTG 3180 
CAAAGGGTGA TCCAGGTCCT CAAGGTATCT CAGGGAAAGA TGGACCAGCA GGATTACGTG 3240 
GTTTCCCAGG GGAAAGAGGT CTTCCTGGAG CTCAGGGTGC ACCTGGACTG AAAGGAGGGG 3300 

1 5 A AGGTCXrCC A GGGCCCACCA GGTCCAGTTG GCTCACCAGG AG AACGTGGG TCAGCAGGTA 3360 
CAGCTGGCCC AATTGGTTTA CGAGGGCGCC OGGGACCTCA GGGTCCTCCT GGTCCAOCTO 3420 
GAGAGAAAGG TGCTCCTGGA G AAAAAGGTC CCCAAGGGCC TGCAGGGAGA GATGGAGTTC 3480 
AAGGTCCTGT TGGTCTCCCA GGGCCAGCTG GTCCTGCCGG CTCCCCTGGG GAAGACGG AG 3540 
ACAAGGGTGA AATTGGTGAG CCGGGACAAA AAGGCAGCAA GGGTGGCAAG GGAGAAAATG 3600 

20 GCCCTCCCGG TCCCCCAGGT CTTCAAGGAC CAGTTGGTGC CCCTGG AATT GCTGGAGGTG 3660 
ATOGTGAACC AGGTCCTAGA GGACAGCAGG GGATGTTTGG GCAAAAAGGT GATGAGGGTG 3720 
OCAGAGGCIT CCCTGQACCT CCTGGTCCAA TAGGTCTTCA GGGTCTGCCA GGCOCACCTG 3780 
GTGAAAAAGG TOAAAATOGG G ATGTTGGTC CATGGGGGCC ACCTGGTCCT CCAGGCCCAA 3840 
G AGGCCCTCA AGGTCCCAAT GG AGCTG ATG GACCACAAGG ACCCCCAGGT TCTGTTGGTT 3900 

25 CAGTTGGTGG TGTTGG AG AA A AGGGTGAAC CTGGAG AAGC AGG AAACCCA GGGCCTCCTG 3960 
GGGAAGCAGG TGTAGGCOGT CCCAAAGGAG AAAGAGGAGA GAAAGGGG AA GCTGGTCCAC 4020 
CTGGAGCTGC TOGACCTCCA GGTGCCAAGQ GGCCGCCAGG TGATQ ATGQC CCTAAGGGTA 4080 
ACCCGGGTCC TGTTGGTTTT CCTGGAGATC CTGGTCCTCC IGGGGAACTT GGCCCTGCAG 4140 
GTCAAGATGG TGTTG GTGGT GACAAGGGTG AAG ATGG AG A TCCTGGTCA A CCGGGTCCTC 4200 

30 CTGGCCCATC TGGTG AGGCT GGCCCACCAG GTCCTCCTGG AAAACG AGGT CCTCCTGGAG 4260 

CTQCAGGTGC AGAGGGAAGA CAAGGTGAAA AAGGTGCTAA GGGGGAAGCA GGTGCAGAAG 4320 
GTCCTCCTGG AAAA ACCGGC CCAGTCGGTC CTCAGGG ACC TGCAGG AAAG CCTGGTCCAG 4380 
AAGGTCTTCG GGGCATCCCT GGTCCTGTGG GAGAACAAGG TCTCCCTGGA GCTGCAGGCC 4440 
AAGATGG ACC ACCTGGTCCT ATGGGACCTC CTGGCTTACC TGGTCTCAAA GGTG ACCCTG 4500 

35 GCTCCAAGGG TG AA AAGGGA CATCCTGGTT TAATTGGCCT G ATTGGTCCT CCAGGAG AAC 4560 
AAGGGGAAAA AGGTGACCGA GGGCTCCCTG GAACTCAAGG ATCTCCAGGA GCAAAAGGGG 4620 
ATGGGGGAAT TCCTGGTCCT GCTGGTCCCT TAGGTCCACC TGGTCCTCCA GGCTTACCAG 4680 
GTCCTCAAGG CCCAAAGGGT AACAA AGGCT CTACTGG ACC CGCTGGCCAG AAAGGTGACA 4740 
GTGGTCTTCC AGGGCCTCCT GGGCCTCCAG GTCCACCTGG TG AAGTCATT CAGCCTTTAC 4800 

40 CAATCTTGTC CTCCAAAAAA ACGAGAAG AC ATACTGAAGG CATGCAAGCA GATGCAGATG 4860 
ATAATATTCT TGATTACTCG GATGGAATGG AAGAAATATT TGGTTCCCTC AATTCCCTGA 4920 
AACAAGACAT CGAGCATATG AA ATTTCCAA TGGGTACTCA GACCAATCCA GCCCGAACTT 4980 
GTAAAGACCr GCAACTCAGC CATCCTGACT TOCCAGATGG TGAATATTGG ATTGATCCTA 5040 
ACCAAGGTTG CTCAGGAGAT TOCTTCAAAG TrTACTGTAA TTTCACATCT OGTGGTGAG A 5100 

45 CFTGCATTTA TCCAG ACAAA AAATCTGAOO G AOTAAG AAT TTCATCATGG CCAAAGG AG A 5 1 60 
AACCAGG AAG TTGGTTTAGT GAATTTAAGA GGGGAAAACT GCTTTCATAC TTAGATGTTG 5220 
AAGGAAATTC C ATCAATATG GTGCAAATGA CATTCCTGAA ACTTCTGACT GCCTCTGCTC 5280 
GGCAAAATTTCACCTACCACTGTCATCAGTCAGCAGCCTG GTATGATGTG TCATCAGGAA 5340 
GTTATGACAAAGCACTTCGCTTCCTGGGATCAAATGATGAOOAOATGTCCTATOACAATA 5400 

5 0 ATCCTTTTAT C AA A AC ACTG TATO ATGGTT GTACGTCCAG A A AAGGCTAT GA AAAAACTG 5460 
TCATTGAAAT CAATACACCA AAAATTGATC AAGTACCTAT TGTTGATGTC ATGATCAGTG 5520 
ACTTTGGTG A TCAGAATCAG AAGTTCGGAT TTGAAGTTGG TCCTGTTTGT TTTCITGGCT 5580 
AAGATTAAGA CAAAG AACAT ATCAAATCAA CAQAAAATGT ACCTTGGTGC CACCAACCCA 5640 
TTTTGTGCCA CATGCAAGTT TTGAATAAGG ATGTATGGAA AACAACGCTG CA TATAC AGG 5700 

55 TACCATTTAG GAAATACCGA TGCCTITGTG GGGGCAGAAT CACAGACAAA AGCTTTGAAA 5760 
ATCATAAAGA TATAAGTTGG TGTGGCTAAG ATGGAAACAG GGCTGATTCTTGATTCCCAA 5820 
ncrCAACTC TCCTTTTCCT ATTTGAATTT CTTTGGTGCT GTAGAAAACA AAAAAAGAAA 5880 
AATATATATT CATAA AAAAT ATGGTGCTCA TTCTCATCCA TCCAGGATGT ACTAAAACAG 5940 
TGTGTTTAAT AAATTGTAAT TATTTTGTGT ACAGTTCTAT ACTGTTATCT GTGTCCATTT 6000 

60 CCAAAACTTG CACGTGTCCC TG A ATTCCGC TGACTCTAAT TTATGAGG AT GCCGAACTCT 6060 
GATGGCAATA ATATATGTAT TATGAAAATG AAOTTATGAT TTCCGATGAC CCTAAGTCCC 6120 
TTTCrrrGGTTAATGATGAA ATTCCTTTOT OTGTOTTT 

Seq ID NO: 122 Protein sequence; 
65 Protein Accession #: NP_001845 



1 11 21 31 41 51 

70 MEPWSSRWKT KRWL WFTVT TLALTFLFQA REVRGA APVD VLKALDFHNS PEGISKTTGF 60 
CTNRKNSKGS DTAYRVSKQA QLSAPTKQLF PGGTFPEDFS ILFTVKPKKG IQSFLLSIYN 120 
EHGIQQIGVE VGRSPVFLFE DHTGKPAPED YPLFRTVNIA DGKWHRVAIS VEKKTVTMIV 180 
DCKKKTTKPL DRSERAIVDT NGITVFGTRI LDEEVFEGDI QQFLTTGDPK AAYDYCEHYS 240 
PDCDSSAPKA AQAQEPQIDE YAPEDIIEYD YEYGEAEYKE AESVTEGPTV TEETIAQTEA 300 

75 NIVDDFQEYNYGTMESYQTEAPRHVSGTNEPNPVEEIFTEEYLTGEDYDSQRKNSEDTLY 360 
ENKEIDGRDS DLLVDGDLGE YDFYEYKEYE DKPTSPPNEE FGPG VPAEFD UETSINGHG 420 
AYGEKGQKGE PAVVEPGMLV BGPPGPAGPA GIMGPPGLQG PTGPPGDPGD RGPPGRPGLP 480 
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GADGLPGPPG TMLMLPFRYG GDGSKGPTIS AQEAQAQAIL QQARIALRGP PGPMG LTGR? 540 
OPVGGPGSSG AKGESGDPGP QGPRGVQGPP GPTGKPGKRG RPGADGGRGM PGEPGAKGDR 600 
GFDGLPGLPG DKGHRGERGP QGPPGPPGDD GMRGEDGEIG PRGLPGEAGP RGLLGPRGTP 660 
GAPGQPGMAG VDGPPGPKGN MGPQGEPGPP GQQGNPGPQG LPGPQGPIGP PGEKGPQGKP 720 
GLAGLPGADG PPGHPGKEGQ SGEKGAIjGPP GPQGPIGXPG PRGVKGADGV RQUCGSKGEK 780 
GEDGFPGFKG DMGLKGDRGE VGQIGPRGXD GPEGPKGRAG PTGDPGPSGQ AGEKGKLG VP 840 
GLPGYPGRQG PKGSTGFPGF PGANGEKGAR GVAGKPGPRG QRGPTGPRGS RGARGPTGKP 900 
GPKGTSGGDG PPGPPG ERGP QGPQGPVGFP GPKGPPGPPG RMGCPGHPGQ RGETGFQGKT 960 
GPPGPGGWG PQGPTGETGP IGERGYPGPP GPPGEQGLPG AAGKEGAKGDPGPQGISGKD 1020 
GPAGLRGFPG ERGLPGAQGAPGLKGGEGPQGPPGPVGSPG ERGSAGTAGP IGLRGRPGPQ 1080 
GPPGPAGEKG APGEKGPQGP AGRDG VQGPV GLPGPAGPAG SPGEDGDKGE IGEPGQKGSK 1 140 
GGKGENGPPG PPGLQGPVGA PGUGGDGEP GPRGQQGMFG QKGDEGARGF PGPPGPIGLQ 1200 
GLPGPPGEKG ENGDVGPWGP PGPPGPRGPQ GPNGADGPQG PPGSVGSVGG VGEKGEPGEA 1260 
GNPGPPGEAG VGGPKGERGE KGEAGPPGAA GPPGAKGPPG DDGPKGNPGP VGFPGDPGPP 1320 
GELGPAGQDG VGGDKGEDGD PGQPGPPGPS GEAGPPGPPO KRGPPGAAGA EGRQGEKGAK 1380 
GEAGAEGPPG KTGPVGPQGP AGKPGPEGLRGIPGPVGEQG LPGAAGQDGP PGPMG PPG LP 1440 
GUCGDPGSKG EKGHPGLIGL IGPPGEQGEKGDRGLPGTQG SPGAKGDGGIPGPAGPLGPP 1500 
GPPGLPGPQG PKGNKGSTGP AGQKGDSG LP G PPGPPG PPG EVIQPLPILS SKKTRRHTEG 1560 
MQADADDNIL DYSDGM EEIF GSLNSLKQDI EHMKFPMGTQ TNPARTCKDL QLSHPDFPEXj 1 620 
EYWIDPNQGC SGDSFKVYCN FTSGGETCIY PDKKSEGVRl SSWPKEKPGS WFSEFKRGKL 1680 
LSYLDVEGNS INMVQMTFLK LLTASARQNfF TYHCHQSAAW YDVSSGSYDK ALRFLGSNDE 1740 
EMSYDNNPFI KTLYDGCTSR KGYEKTVIEI NTPKIDQVPI VDVMISDFGD QNQKFGFEVG 1800 
PVCFLG 

Scq ID NO: 123 DNA sequence 

Nucleic Acid Accession #: NM_0 1 5886 

Qxljqg sequence; 485-1 261 (underlined sequences oorrefpond to start and stop codons) 

1 11 21 31 41 51 
I t i I i I 

GAATTCCCCC CCCCCCCCCC TCACTTGGTG TGTCTATATG TCTGGCAG AC ATTATCAGCA 60 
CATTCTCTOT TGTTACCTGT GATTCATTTT TTCTTCACTC TCCAGGTG AA TTTCAATTGC 120 
TGAAAATTTC CCACTG A AAA TATGCAGTAA TATATTTTOT GGTTCAG ACA TTTGGGGCAA 1 80 
ATGGTTCACA TTCATTTTAG GGTTAGTGGT CATGCTGTTT ATTTTTCTCT GCTATACAAA 240 
GTTCCTCTTA GGGGTCTGCC TCATGACACT AAAAAATGAA TAGAGATTCT ACTGTAGGTT 300 
ATCTCCTAGG CTTGAGTTCA ACATTTGnT GGATTnTGA AGAAAGTCAA ATCAAGCAAT 360 
GCT(XCAAAT GATGTCTTTG TAAATTCATA CCCTCTGGCC CTATTTTTTT TCATAGACCC 420 
TAACTCTACC TTTCTGCTTT AAAGCAAAGT AAACTCGGTG GCCTCTTCTT CTCCACCCCT 480 
CAAAATGATA GCAATCTCTG CCGTCAGCAG TGCACTCCTG TTCTCCCTTC TCTGTGAAGC 540 
AAGTACCXiTC GTCCTACTCA ATTCCACTGA CTCATCCCCG CCAACCAATA ATTTCACTGA 600 
TATTGAAGCA GCTCTGAAAG CACAATTAGA TTCAGCGGAT ATCCCCAAAG CCAGGCGG AA 660 
GCGCTACATT TCGCAG AATG ACATGATCGC CATTCTTGAT TATCATAATC AAGTTCGGGG 720 
CAAAGTGTTC CCACCGGCAG CAAATATGGA ATATATGGTT TGGGATGAA A ATCTTGCAAA 780 
ATCGGCAG AG GCTTGGGCGG CTACTTGCAT TTGGQACCAT OGACCTTCTT ACTTACTG AG 840 
ATTTTTGGGC CAAAATCTAT CTGTACGCAC TGG AAGATAT CGCTCTATTC TCC AGTTGGT 900 
CAAGCCATGG TATGATGAAG TG AAAGATTA TGCTTTTCCA TATCCCCAGG ATTGCAACCC 960 
CAGATGTCCT ATG AG ATGTT TTGGTCCCAT GTGCACACAT TATACGCAGA TGGTTTGGGC 1020 
CACrrCCAAT CGG AT AGGAT GCGCAATTCA TGCTTGOCA A AACATGAATG TTTGGGGATC 1 080 
TGTOTGGCGA CGTCCAGITT ACTTGGTATG CAACTATGCC CCAAAGGGCA ATTGGATTGG 1 140 
AGAAGCACCA TATAAAGTAG GGGTACCATO TTCATCTTGT CCTCCAAG1T ATQGGGG ATC 1200 
TTGTACTGAC AATCTGTGTT TTCCAGGAGT TACGTCAAAC TACCTOTACT GGTTTA AAJA 1 260 
AGTTTACCTT TTCCTCCAGG AAATATAATG ATTTCTGGGA ACATGGGCAT GTATATATAT 1320 
ATATGGAGAG AGAATTTTGC ACATATTATA CATAITTTGT GCTAATCTTG TnTCCTCTr 1380 
AGTATTCCTT TGTATAAATT AGTGTTTGTC TAGCATOTTT GTITAATCCT TTGGGAATTC 

Seq ID NO: 124 Protein seouencei 
Protein Accession #: NP_05e970, 1 

I 11 21 31 41 51 
1 I i I I I 

MIAISAVSSA LLFSLLCEAS TVVLLNSTDS SPPTNNFTDI EAALKAQLDS ADIPKARRKR 60 
YISQNDMIAI LDYHNQVRGK VFPPAANMEY MVWDENLAKS AEAWAATCIW DHGPSYLLRF 120 
LGQNLSVRTG RYRSILQLVK PWYDEVKDYA FFYPQDCNPR CPMRCFGPMC THYTQMVWAT 180 
SNRIGCAIHA CQNMNVWGSV WRRAVYLVCN YAPKONWIGE APYKVGVPCS SCPPSYGGSC 240 
TDNLCFPGVT SNYLYWFK 



Seq ID NO; 125 DNA sequence 

Nucleic Acid Accession #: NM_001793 

Coding sequence: 54-2543 (underlined sequences correspond to start and stop codons) 

1 II 21 31 41 51 
i I t ) I I 

GCGGAACACC GGCCCGCCGT CGCGGC AGCT GCTTCACCCC TCTCTCTGCA GCCATflGGG C 60 
TCCCTCGTGG ACCTCTCGCG TCTCTCCTCC TTCTCCAGGT TTGCTGGCTG CAGTGCGCGG 120 
CCrCCGAGCC GTGCCGGGCG GTCTTCAGGG AGGCTGAAGT GACCITGG AG GCGGGAGGCG 1 80 
CGGAGCAGG A GCCCGGCCAQ GCGCTGGGGA AAGTATTCAT GOGCTOCCCT GOGCAAGAGC 240 
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CAGCrCTGTT TAGCACTGAT AATGATGACT TCACTGTGCG GAATGGCGAG ACAGTCCAGG 300 
AAAGAAGGTC ACTGAAGOAA AGGAATCCAT TGAAGATCTT CCCATCCAAA CGTATCTTAC 360 
QAAGACACAA 0 AO AO ATTOO GTOOTTGCTC CAATATCTGT CCCTG AAAAT GGCAAGGOTC 420 
CCTTCCCCCA GAG ACTO AAT CAGCTCAAGT CTAATAAAGA TAGAGACACX: AAGATTTTCT 480 
ACAGCATCAC GGGGCCGGGG GCAGACAGCC CCCCTGAGGG TGTCTTCGCT GTAG AGAAGG 540 
AGACAGGCTG GTTGTTGTTG AATAAGCCAC TGGACCGGGA GGAG ATTGCC AAGTATGAGC 600 
TCTTTGGCCA CGCTGTGTCA GAGAATGGTG CCTCAGTGGA GGACCCCATO AACATCTCCA 660 
TCATCX3TGAC CG ACX:AG AAT GACCACAAGC CCAAGTITAC CCAGGACACC TTCCGAGGGA 720 
OTOTCTTAOA GGGAGTCCTA CCAGGTACTT CTGTO ATOCA OGTOACAOCC ACAOATOAGO 780 
ATGATGCCAT CTACACCTAC AATGGGGTGO TTGCTTACTC CATCCATAGC CAAG AACCAA 840 
AGGACCCACA CG ACCTCATG TTCACAATTC ACCGGAGCAC AGGCACCATC AGCGTCATCT 900 
CCAGTGGCCT GG ACCGGGAA AAAGTCCCTG AGTACACACT GACCATCCAG GCCACAGACA 960 
TGGATGGGGA CGGCTCCACC ACCACGGCAG TGGCAGTAGT GGAGATCCTT G ATGCCAATG 1020 
ACAATGCTCC CATGTTTGAC CCCCAGAAGT ACGAGGCCCA TGTGCCTGAG AATGCAGTGG 1080 
GCCATGAGGT GCAGAGGCTG ACGGTCACTG ATCTGGACGC CCCCAACTCA CCAGCGTGGC 1 140 
GTGCTACCTA CCTTATCATG GGCGGTGACG ACGGGGACCA TITTACCATC ACCACCCACC 1200 
CTGAGAGCAA CCAGGGCATC CTGACAACCA GGAAGGGTTr GGATTTTG AG GCCAAAAACC 1260 
AGCACACCCT GTACGTTG AA GTGACCAACG AGGCCCCTTT TGTGCTGAAG CTCCCAACCT 1320 
CCACAGCCAC CATAGTGGTC CACGTGGAGG ATGTGAATGA GGCACCTGTG TTTGTCCCAC 1380 
CCrcCAAAGT CGTTGAGGTC CAGGAGGGCA TCCCCACTGG GGAGCCTGTG TGTGTCTACA 1440 
CTGCAGAAGA CCCTGACAAO GAGAATCAAA AGATCAGCTA CCGCATCCTG AGAOACCCAO 1500 
CAGGGTGGCT AGCCATGG AC CCAGACAGTG GGCAGGTCAC AGCTGTGGGC ACCCTCGACC 1 560 
GTGAGGATGA GCAGTTTGTG AGGAACAACA TCTATGAAGT CATGGTCTTG GCCATGGACA 1620 
ATGGAAGCCC TCCCACCACT GGCACGGGAA CCCTTCTGCT AACACTGATT G ATGTCAACG 1 680 
ACCATGGCCC AGTCCCTGAQ CCCCGTCAG A TCACCATCTG CAACCAAAGC CCTGTGCGCC 1740 
ACGTGCTG AA CATCACGG AC AAGG ACCTGT CTCCCCACAC CTCCCCTTTC CAGGCCCAGC 1 800 
TCACAG ATGA CTCAG AC ATC TACTGGACGG CAG AGCTTCAA CG AGGA AGGT G ACACAGTGG 1 860 
TCITGTCCCT GAAG AAGTTC CTGAAGCAGG ATACATATGA CGTGCACCTT TCTCTGTCTG i 920 
ACCATGGCAA CAAAG AGCAG CTGACGGTG A TCAGGGCCAC TGTGTGCGAC TGCCATGGCC 1980 
ATQTOQAAAC CTGCCCTOOA CCCTOGAAAG GAGGTTTCATCCTCCCTGTG CTGGGGGCTG 2040 
TCCTGGCTCTGCTGTrcCTC CTGCTGOTGC TGCnTTGTT GGTGAG AAAG AAGCGGAAGA 2100 
TCAAGGAGCC CCTCCTACTC CCAGAAGATG ACACCOGTGA CAACGTCTTC TACTATGGCG 21 60 
AAGAGGGGGG TGGCGAAG AG GACCAGGACT ATGACATCAC CCAGCTCCAC CGAGGTCTGG 2220 
AGGCCAGGCC GGAGGTGGTT CTCCGCAATG ACGTGGCACC AACCATCATC CCGACACCCA 2280 
TGTACCGTCC TAGGCCAGCC AACCCAGATG AAATCGGCAA CTTTATAATT GAG AACCTGA 2340 
AGGCGGCTAA CACAGACCCC ACAGCCCCGC CCTACGACAC CCTCTTGGTG TTCG ACTATG 2400 
AGGGCAGCGG CTCCGACGCC GCGTCCCTG A GCTCCCTCAC CTCCTCCGCC TCCG ACCAAG 2460 
ACCAAGATTA CGATTATCTG AACGAGTGGG GCAGCCGCTTCAAGAAGCTG GCAGACATGT 2520 
ACGGTGGCGG GG AGO ACQ AC TAGGCGGCCT GCCTGCAGGG CTGGGGACCA AACGTCAGGC 2580 
CACAGAGCAT CTCCAAGGGG TCTCAGTTCC CCCrTCAGCT GAGG ACTTCG G AGCTTGTCA 2640 
GGAAGTGGOC GTAGCAACTT GGCGGAGACA GGCTATGAGT CTGACGTTAG AGTGGTTGCT 2700 
TCCTTAGCCT TTCAGOATGG AGGAATGTGG GCAGTITGAC TTCAGCACTG AAAACCTCTC 2760 
CACCTGGGCC AGGGTTGCCT CAGAGGCCAA GTTTCCAG AA GCCTCTTACC TOCCGTAAAA 2820 
TGCrCAACCC TGTGTCCTGG GCCTGGGCCT GCTGTGACTG ACCTACAGTG GACTTTCTCT 2880 
CTGGAATGGA ACCTTCTTAG GCCTCCTGGT GCAACTTAAT TTTTTTTTTT AATGCTATCT 2940 
TCAAAACGTT AGAGAAAGTT CTTCAAAAGT GCAGCCCAGA GCTGCTGGGC CCACTGGCCG 3000 
TCCTGCATrr CrGGTTTCCA GACCCCAATG CCTCOCATTC GGATGGATCT CTGCGTITTT 3060 
ATACTGAGTG TGCCTAGGTT GCCCCTTATT TnTATnTC CCTGTTGCGT TGCTATAGAT 3120 
GAAGGGTGAG GACAATCGTG TATATGTACT AGAACnTTT TATTA AAGAA A 

Seq ID NO: 126 Protein sequence: 
Protein Accession #: NP_001 784 

1 11 21 31 41 51 
I 1 I I I I 

MGLPRGPLAS LLLLQVCWLQ CAASEPCRAV FREAEVTLEA GGAEQEPGQA LGKVFMGCPG 60 
QEPALF STDN DDFTVRNG ET VQERRSLKER NPLKIFPSKR ILRRHKRDW V V APISVPENG 1 20 
KGPFPQRLNQ LKSNKDRDTK ffYSITGPGA D SPPEGVFAV EKETG WLLLN KPLDREEI AK 1 80 
YELFGHAVSE NGASVEDPMN ISHVTDQND HKPKFTQDTF RGSVLEGVLP GTS VMQVTAT 240 
DEDDAIYTYN G VVAYSIHSQ EPKDPHDLMF TIHRSTGTIS VISSGLDREK VPEYTLTIQA 300 
TDMDGDGSTT TAVAWEILD ANDNAPMFDP QKYEAHVPEN AVGHEVQRLT VTDLDAPNSP 360 
AWRATYUMG GDDGDHFTIT THPESNQGIL TTRKGLDFEA KNQHTLYVEV TNEAPFVLKL 420 
PTSTATIVVH VEDVNEAPVF VPPSKVVEVQ EGIPTGEPVC VYTAEDPDKE NQKISYRILR 480 
DPAGWLAMDP DSGQVTAVOT LDREDEQFVR NNIYEVMVLA MDNOSPPTTG TOTLLLTLID 540 
VNDHGPVPEP RQITICNQSP VRHVLNITDK DLSPHTSPFQ AQLTDDSDrV WTAEVNEEOD 600 
TWLSLKKFL KQDTYDVHLS LSDHGNKEQL TVIRATVCDC HGHVETCPGP WKGGFILPVL 660 
GAVLALLFLL LVLLLLVRKK RKIKEPLLLP EDDTRDNVFY YGEEGGG EED QDYDITQLHR 720 
GLEARPEVVL RNDVAPTIIP TPMYRPRPAN PDEIGNFUE NLKAANTDPT APPYDTLLVF 780 
DYEGSGSDAA SLSSLTSSAS DQDQDYDYLN EWGSRFKKLA DMYGGGEDD 

Seq ID NO: 127 DNA sequence 

Nucleic Acid Accession #: NM_003256.1 

Coding sequence: 60-734 (underlined sequences correspond to start and stop codons) 

1 II 21 31 41 51 
I I t I I I 

CCTGCTGGOG CCGTCCAOTC CCCCAGACCT CACAGGCTCA GTCGCGGATC TGCAOTGTC^ 60 
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TGCCrOGGAG CCCTCGGCCC GCGCCAAGCT GGGTGCTGTT GCTGCGGCTG CTGGCGTTGC 120 
TGCGGCCCCC GGGGCTGGGT G AQGCATGCA GCTGCXjCCCC GGCGCACCCT CAGCAGCACA 1 80 
TCTGCCACTC GGCACTTGTG ATTCGGGCCA AAATCTCCAG TGAGAAOGTA GTTCCGGCCA 240 
GTGCAGACCC TGCTGACACT GAAAAAATGC TCCGGTATGA AATCAAACAO ATAAAG ATGT 300 
TCAAAGGGTT TGAGAAAGTC AAGGATGTTC AGTATATCTA TACGCCnTT GACTCTTCCC 360 
TCTGTGGTGT G AAACTAG AA GCCAACAGCC AGAAGCAGTA TCTCTTGACT GGTCAGGTCC 420 
TCAGTGATGG AAAAGTCTTC ATCCATCTGT GCAACTACAT CGAGCX^CTGG GAGGACCTGT 480 
CCTTGGTGCA G AGGGAA ACT CTG AATCATC ACTACCATCT GAACTGTGGC TGCCAAATCA 540 
CCACCTGCTA CACAGTACCC TGTACCATCT CGGCCCCTAA CGAGTGCCTC TGGACAG ACT 600 
GGCTGTTGGA ACGAAAGCTC TATGGTTACC AGGCTCAGCA TTATGTCTGT ATG A AGCATG 660 
TTGACGGCAC CTGCAGCTGG TACCX3GGGCC ACCTGCCTCT CAGGAAGGAG TTTGTTGACA 720 
TOGTTCAGCC CTAgTAGGGA CCAGTG ACCA TCACATCCCT TCAAGAGTCC TG AAG ATCAA 780 
GCCAGTTCTC CTTCCCTGCA G AGCTTTGGC CATTACX:aCC TGACCTCTTG CTGCCAGCTA 840 
ATAAGAAGTG CCAAGTGGAC AGTCTGGCCA CTGTCAAGGC AGGGAAGGGG CCATGACTTT 900 
TCTGCCCTGC CCTCAGCCTG TTGCCCTGCC TCCX:AAACCC CATTAGTCTA GCCTTGTAGC 960 
TGTTACTGCA AGTGnrCTT CTGGCITAGT CTGITTTCTA AAGCCAGGAC TATTCCCTTT 1020 
CCTCCCCAGG AATATGTGTT TTCCTTTGTC TTAATCGATC TGGTAGGGGA G AAATGGCGA 1080 
ATGTCATACA CATGAGATGG TATATCCTTG CGATGTACAG AATCAGAAGG TGGTTTGACA 1140 
GCATCATAAA CAGGCTGACT GGCAGGAATG AAAAAAAAAA AAAAAAAAA 

Seq ID NO: 128 Protein seouence: 
Protein Accession #: NP_003247. 1 

1 11 21 31 41 SI 
I I I I I i 

MPGSPRPAPS WVLLLRLLAL LRPPGLGEAC SCAPAHPQQH ICHSALVIRA KISSEKVVPA 60 
SADPADTEKM LRYEIKQIKM FKGFEKVKDV QYIYTPFDSS LCGVKLEANS QKQYLLTGQV 120 
LSDGKVFIHL CNYIEPWEDL SL VQRESLNH HYHLNCGCQI TTCYTVPCTI SAPKECLWTD 1 80 
WLLERKLYGY QAQHYVCMKH VDGTCSWYRG HLPLRKEFVD IVQP 

Seq ID NO: 129 DNA sequence 

Nucleic Acid Accession #: NM_007207.2 

Coding sequence : 1 43- 1 59 1 (underlined sequences conespcnd to start and stop codons) 

I U 21 31 41 51 

II I I I I 

CCACGCGTCC GCAATGAAGC CGAGTG AATG GGGGCTGAAT GTGCGAGTCC ATAGCTG AAG 60 
AGGAGCGCCA G ATGGTGG AG G AATACACTT ATTTATGAAA CTGTCITGAG TTCTTCTTGA 1 20 
ATTGCCAGTT TTCAGCCTCC TCATGCCTCC GTCTCCTTTA GACG ACAGGG TAGTAGTGGC 1 80 
ACTATCTAGG CCCGTCCGAC CTCAGG ATCT CAACCTTTOT TTAGACTCTA GTTACCTTGG 240 
CrCTGCCAAC CCAGGCAGTA ACAGCCACCC TCCTGTCATC GCCACCACCG TTGTGTCCCT 300 
CAAGGCTGCG AATCTGACGT ATATGCCCTC ATCCAGCGGC TCTGCCCGCT CGCTGAATTG 360 
TGGATQCAGC AGTGCCAGCT GCTGCACTGT GGCAACCTAC GACAAGGACA ATCAGGCCCA 420 
AACCCAAGCC ATTGCCGCTG GCACCACCAC CACTGCCATC GG AACCTCTA CCACCTGCCC 480 
TGCTAACCAG ATGGTCAACA ATAATGAGAA TACAGGCTCr CTAAGTCCAT CAAGTGGGGT 540 
GGGCAGCCCT GTGTCAGGG A CCCCCAAGCA GCTAGCCAGC ATCAAAATAA TCTACCCCAA 600 
TQACTTGGCA AAGAAGATQA CCAAATGCAG CAAGAGTCAC CTGCCG AGTC AGGGCCCTGT 660 
CATCATTGAC TGCAGGCCCT TCATGGAGTA CAACAAGAGT CACATCCA AG GAGCTGTCCA 720 
CA1TAACTGT GCCGATAAGA TCAGCCGGCG GAGACTGCAG CAGGGCAAG A TCACTGTCCT 780 
AGACrTGATT TCCTGTAGGG AAGGCAAGGA CTCnTCAAG AGGATCTTTT CCAAAGAAAT 840 
TATAGTTTAT GATGAGAATA GCAATGAAGC AAGCCOAGTO ATGCCCTCCC AGCCACTTCA 900 
CATAGTCCTC G AGTCCCTG A AGAGAG AAGG CAAAGAACCT CTGOTGTTGA AAGGTGGACT 960 
TAGTAGTrrr AAGCAG AACC ATG AAAACCT CTGTGACAAC TCCCTCCAOC TCCAAGAGTG 1020 
CCGGGAGGTG GGGGGCGGCG CATCCGCGGC CTCGAGCTTG CTACCTCAGC CCATCCCCAC 1080 
CACCCCTGAC ATCGAGAACG CTGAGCTCAC CCCCATCITG CCCTTCCTGT TCCTTGGCAA 11 40 
TGAGCAGGAT GCTCAGG ACC TGG ACACCAT GCAGCGGCTG AACATCGGCT ACGTCATCAA 1 200 
OGTCACCACT CATCTTCCCC TCTACCACTA TGAGAAAGGC CTGTTCAACT ACAAGCGGCT 1260 
GCCAGCCACTGACAGCAACA AGCAGAACCTGCGGCAGTACTTTGAAGAGG CnTTGAGTT 1320 
CA1TGAGGAA GCTCACCAGT GTGGGAAGGG GCTTCTCATC CACTGCCAGG CTGGGGTGTC 1380 
CCGCTCOGCC ACCATCGTCA TCGCTTACTT GATGAAGCAC ACTCGGATGA CCATGACTGA 1440 
TGCTTATAAA TTTGTCAAAG GCA AAGG ACC AATTATCTCC CCAAACCTTA ACTTCATGGG 1500 
GCAGTTGCTA GAGTTCGAGG AAGACCTAAA CAACGGTGTG ACACCGAGAA TCCTTACACC 1560 
AAAGCTGATG GGCGTGGAGA CGGTTGTGIfiACAATGGTCT GGATGGAAAG GATrGCTGCT 1620 
CTCCATTAGG AGACAATGAG G AAGG AGGAT GGATTCTGGT i 1 11 i 1 TCTT T C 1 1 1 1 1 ' 1 ' 1 1' 1680 
TTGTAGTTGG GAGTAAAGTT TGTGAATGGA AACAAACTTG GTTAAACACT TTATTTTTAA 1740 
CAAGTGTAAG AAGACTATAC TTTTGATGCC ATTGAGATTC ACCTTCCACA AACTGGCCAA 1800 
ATTAAGGAGG TTAAAGAAGT AATTTTTnT AAGCCCAACC ATTAAAAATT TAATACAACT 1860 
TGGTTTCTCC CCCTTTTTCC TTTAAAGCTA NTTTGTAAAA GTITATGAG 

Seq ID NO: 130 Protein sequence: 
Protein Accession #: NP_009 138.1 

1 U 21 31 41 51 
I I I t I I 

MPPSPLDDRV V VALSRPVRP QDLNLCLDSS YLGSANPGSN SHPPVIATTV VSLKAANLTY 60 
MPSSSGSARS LNCGCSSASC CTVATYDKDN QAQTQAIAAG nTTAIGTST TCP ANQMVNN 120 
NENTGSLSPS SGVOSPVSGT PKQLASIKU YPNDLAKKMT KCSKSHLPSQ GPVIIDCRPF 180 
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MEYNKSHIQG AVHINCADKI SRRRLQQGKI TVLDLISC3^ GKDSFKRIFS KEIIVYDENT 240 
NEPSRVMPSQ PLHIVLESLK REGKEPLVLK GGLSSFKQNH ENLCDNSLQL QECREVGGGA 300 
SAASSLLPQP IPTTPDIENA ELTPILPFLF LGNEQDAQDL DTMQRLNIGY VINVTTHLPL 360 
_ YHYEKGLFNY KRLPATDSNK QNLRQYFEEA FEFIEEAHQC GKGLLIHCXJA GVSRSATIVI 420 
5 AYLMKHTRMT MTDAVKFVKG KRPIISPNLN FMGQLLEFEE DLNNGVTPRI LTPKLMOVET 480 
W 

Seq ID NO: 131 DNA sequence 
Nucleic Acid Accession #: NM_005409,3 
1 0 Coding sequence: 94-37B (underlined seqjuences conespond to start and stop codons) 

1 11 21 31 41 51 
I i I ) I I 

TrCCnTTCATGTTCAGCATTTCTACTCCTTCCAAGAAGAGCAGCAAAGCrGAAGTAGCAG 60 
15 CAACAGCACC AGCAGCAACA GCAAAAAACA AACATGAGTQ TQAAQOOCAT QOCTATAOCX: 120 
1TGGCTGTGA TATTGTGTGC TACAGTTGTT CAAGGCTTCC CCATGTTCAA AAGAGGACGC 1 80 
TGTCTTTGCA TAGGCCCTGG GGTAAAAGCA GTGAAAGTGG CAGATATTGA GAAAGCCTCC 240 
ATAATGTACC CAAGTAACAA CTGTGACAAA ATAG AAGTGA TTATTACCCT GAAAGAAAAT 300 
AAAGGACAACGATGCCTAAATCa:AAATCG AAGCAAGCAAGGCTTATAATCAAAAAAGTT 360 
20 GAAAGAAAG A ATTTTTAA AA ATATCAAAAC ATXtOAAGTC CTGGAAAAGG GCATCTGAAA 420 
AACCTAGAAC AAGTTTAACT GTG ACTACTG AAATGACAAG AATTCTACAG TAGGAAACTG 480 
AGACTTTTCT ATGGTTTTGT G ACTTTCAAC TTTTGTACAG TTATGTGAAG G ATGAAAGGT 540 
GGGTGAAAGG ACCAA AAACA G AAATACAGT CTTCCTGAAT GAATGACAAT CAGAATTCCA 600 
^_ CrcCCCAAAG GAGTCCAGCA ATTAAATGGA TTTCTAGGAA AAGCTACCTT AAGAAAGGCT 660 
25 GGTTACCATCGGAGTTTACAAAGTGCITTCACGTTCTrACTTGTTGTATTATACATrCAT 720 
GCATTTCTAO GCTAG AGAAC CTTCTAGATT TGATGCTTAC AACTATTCTG TTGTGACTAT 780 
GAOAACATTT CTGTCTCTAG AAGTTATCTG TCrGTATTGA TCTTTATGCT ATATTACTAT 840 
CTGTGGrrTAC AGTGGAGACA TTG ACATTAT TACTGGAGTC AAGCTCTTAT AAGTCAAAAG 900 
CATCTATGTG TCGTAA AGC A TTCCTCAAAC ATTTnTCAT GCAAATACAC ACTTCTTTCC 960 
3 0 CCAAATATCA TGTAGC ACAT CA ATATGTAG GG AAACATTC TTATGCATCA TTTGGTTTGT 1020 
TITATAACCA ATTCATTAAA TGTAATTCAT AAAATGTACT ATGAAAAAAA TTATACGCT A 1080 
TGGGATACTO GCAACAGTGC ACATATTTCA TAACCAAATT AGCAGCACCG GT CTTAA TTT 1140 
GATGTTTTtC AACITI l ATT CATTGAGATG TTITGAAGCA ATTAGG ATAT GTGTGTTTAC 1200 
^_ TGTACTTrTTGTTTrGATCCGTTTGTATAAATGATAGCAATATCTTGGACACATTTGAAA 1260 
3 5 TACAAAATGT TnTGTCTAC CAAAGAAAAA TGTTGAAAAA TAAGCAAATG TATACCTAGC 1320 
AATCACTTTT ACTTTTTGTA ATTCTGTCTC TTAGAAAAAT ACATAATCTA ATCAATTTCT 1 380 
TTGTTCATGC CTATATACTG TAAAATTTAG GTAT ACTCAA OACTAGTTTA AAGA ATCAAA 1440 
OTCATTTTTT TCTCTAATA A ACTACCACAA CCTTTCnTT TTA AAAAA AA AAA 

40 Seq ID NO: 132 Protein sequence; 

Protein Accession #: NP_0O5400.1 

1 11 21 31 41 51 

1 I 1 1 ! I 

45 MSVKGMAIALAVILCATWQGFPMFKRGRCLCIGPGVKAVKVADIEKASIMYPSNNCDKI 60 
EVIITLKENK GQRCLNPKSK QARLIKKVE RKNF 

Seq ID NO: 133 DNA sequence 
Nucleic Add Accession #: NM^OI 2342 

5 0 Coding sequence: 373-1 1 55 (underlined sequences conespond to start and stop codons) 

1 11 21 31 41 51 
I i t I t I 

CTGGCGCGGG CGGG AGCTGC GGCGGATACC CTTGCGTGCr GTGGAG ACCC TACTCTCTTC 60 

55 GCTGAGAACGGCCGCTAOCGGGGACTGAAGGCCGGGAGCCCACrCCCGACCCGGGGCTAG 120 
CGTGOGTCCC TAGAGTCGAG CGGGGCAAGG GAGCCAGTGG CCGCCGACGG GGGACCGGG A 1 80 
AACmTCrG GGCTCCTGGA G AGCCCTGTA GCCGCGCTCC ATGCTCCGGC AGCGGCCCGA 240 
AACOCAGCCC CGCCGCTG AC GG AGCCCGCC GCTCCGGGCA GGGCCCATGC CCTGCGCGCT 300 
OOGGGGGTCGTAGCTGCCGCCGAGCOGGGGCTCCGGAAGCOGGCGOGGOCGCCGCGGCCG 360 

60 TGCGGGGCGT CAATCGATCG CC ACTCCAGC TACATCTTCA TCTGGCTGCA GCTGOAOCTC 420 
TGCGCCATGG CCGTGCTGCT C ACCAAAGGT G AAATTCG AT GCTACTGTG A TGCTGCCC AC 480 
TGTGTAGCCA CTGGTTATAT GTGTA AATCT GAGCTCAGCG CCTGCTTCTC TAGACTTCTT 540 
GATCCTCAG A ACTCAAATTC CCCACTCACC CATGGCTGCC TGGACTCTCT TGCAAGCACG 600 
ACAGACATCTGCCAAGCC>VAACAGGCCCGAAACCA<nxnX3GCACCACCATACCCACATTG 660 

65 GAATGCTGTCATGAAGACATGTGCAATrACAGAGGGCTGCACGATGTrCTCTCTCCTCCC 720 
AGGGGTGAGG CCTCAGGACA AGGAAACAGO TATCAGCATG ATGGTAGCAG AAACCTTATC 780 
ACCAAGGTCC AGGAGCTGAC TTCTTCCAAA GAGTTGTGGT TCCGGGCAGC GGTCATTGCC 840 
GTGCCCATTG CTGGAGGGCT GATTTTAGTG TTGCTTATTA TGTTGGCCCT GAGGATGCTT 900 
CGAAGTGAAA ATAAG AGGCT GCAGGATCAG CGGCAACAG A TGCTCTCCCG TTTGCACTAC 960 

70 AGCnTCACGGACACCATTCCAAAAAGGGGCAGGTTGCAAAGTTAGACTTGGAATGCATG 1020 
GTCCOGGTCA GTGGGCACGA GAACTGCrGT CTGACCTGTG ATAAAATGAG ACAAGCAG AC 1080 
CTCAGCAACO ATAAGATCCT CTCGCTTGTT CACTGGGOCA TGTACAGTGG GCACGGGAAG 1 140 
CTGGAATTCG TATGACGGAG TCTTATCTGA ACTACACTTA CTGAACAGCT TGA AGGC CTT 1200 
TTGAGTrCTG CTGGACAGGA GCACTTTATC TGAAGACAAA CTCATTTAAT CATCTTTGAG 1260 

75 AGACAAAATG ACCTCTGCAA ACAGAATCTT GGATATTTCT TCTGAAGGAT TATTTGCACA 1320 
GACriTAAATA CAGTTAAATG TGTTATTTGC TnTAAAATT ATAAAAAGCA AAGAGAAGAC 1380 
TTTCTACACA CTOTCACCAG QGTTATTTOC ATCCAAOOGA GCTGG AATTG AGTACCTAA A 1440 
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TAAACAAAAA TGTGCCCTAT GTAAGCTTCT ACATCTTGAT TTATTGTAAA OATTTAAAAG 1500 
AAATATATAT ATTTTGTCTG A 

Seq ID NO: 134 Protein sequence: 
Protein Accession »: NP_036474. 1 

1 11 21 31 4J 51 

I ! I I i 1 

MDRHSSYIFI WLQLELCAMA VLLTKGEIRC YCDAAHCVAT QYMCKSELSA CFSRLLDPQN 60 
SNSPLTHGCL DSLASTTDIC QAKQARNHSG TTIPTLEOCH EDMCNYRGLH DVLSPPRGEA 120 
SGQGNRYQHD GSRNLITKVQ ELTSSKELWF RAAVIAVPIA GGULVLLIM LALRMLRSEN 180 
KRLQDQRQQM LSRLHYSFHG HHSKKGQVAK LDLECMVPVS GHENCCLTCD KMRQADLSND 240 
KILSLVHWGM YSGHGKLEFV 

Seq ID NO: 135 DNA sequence 

Nucleic Acid Accession #: NM_001627. 1 

Coding sequence: 64- 1 8 1 5 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

OGGGACGACO C(XCCTCCTO CGGCOTGGAC TCCGTCAGTG GCCCACCAAG AAGGAGGAGG 60 
AATATGGAAT OCAAGGGGGC CAGTTCCTGC CGTCTGCTCT TCTGCCrCTT G ATCTCCGCC 120 
ACOGTCTTCA GGCCAGGCCT TGG ATGGTAT ACTGTAAATT CAGCATATGG AG ATACCATT 1 80 
ATCATACCrr GCCG ACTTG A CGTACCTCAG AATCTCATGT TTGGCAAATG GAAATATGAA 240 
AAGCCCGATG GCTCCCCAGT ATTTATTGCC TTCAGATCCT CTACAAAGAA AAGTGTGCAG 300 
TACGACGATG TACCAG AATA CAAAGACAGA TTGAACCTCT CAGAAAACTA CACTTTGTCT 360 
ATCAGTAATG CAAGGATCAG TG ATGAAAAG AG AnTGTGT GCATOCTAGT AACTGAGGAC 420 
AACOTGTTTG AOGCACCTAC AATAGTCAAG GTOTTCAAGC AACCATCTAA ACCTGAAATT 480 
GTAAGCAAAG CACTGTTTCT CGAAACAG AG CAGCTAAAAA AGTTGGGTG A CTGCATTTCA 540 
GAAGACAGTT ATCCAG ATGG CAATATCACA TGGTACAGGA ATGGAAAAGT GCTACATCCC 600 
CITGAAGGAG CGGTGGTCAT AAITTITAAA AAGGAAATGG ACCCAGTGAC TCAGCTCTAT 660 
ACCATGACTT CCACCCTGGA GTACAAOACA ACCAAGGCTO ACATACAAAT GCCATTCACC 720 
TGCTOGGTGA CATATTATGG ACCATCTGGC CAGAAAACAA TTCATTCTGA ACAGGCAGTA 780 
TTTGATATTT ACTATCCTAC AG AGC AGGTG ACAATACAAG TGCTGCXTACC AA AAAATGCC 840 
ATCAAAGAAG GGG ATAACAT CACTCTTAAA TGCTTAGGGA ATGGCAACCC TCCCCCAGAG 900 
GAATmTGT TTTACTTACC AGG ACAGCCC GAAGG AATAA GAAGCTCAAA TACTTACACA 960 
CTGATGGATG TGAGGCGCAA TGCA ACAGGA GACTACAAGT GTTCCCTG AT AGACAAAAAA 1 020 
AGCATGATTG CTTCAACAGC CATCACAGTT CACTATTTGG ATTTGTCCTT AAACCCAAGT 1080 
GGAGAAGTGA CTAGACAG AT TGGTGATGCC CTACCCGTGT CATGCACAAT ATCTGCTAGC 1 140 
AGGAATGCAA CTGTGGTATG GATGAAAGAT AACATCAGGC TTCGATCTAO CCCGTCATTT 1200 
TCTAGTCTTC ATTATCAGGA TGCTGGAAAC TATGTCTGCG AAACTGCTCT GCAGGAGGTT 1260 
GAAGGACTAA AGAAAAGAGA GTCATTGACT CTCATTGTAG AAGGCAAACC TCAAATAAAA 1320 
ATQACAAAGA AAACTGATCC CAGTGGACTA TCTAAAACAA TAATCTGCCA TGTGGAAGGT 1380 
TTTCCAAAGC CAGCCATTCA GTGG ACAATT ACTGGCAGTG GAAGCGTCAT AA ACCAAACA 1 440 
GAGGAATCTC CTTATATTAA TGGCAGGTAT TATAGTAAAA TTATCATITC CCCTGAAGAG 1 500 
AATGTTACAT TAACTTGCAC AGCAGAAAAC CAACTGGAGA GAACAGTAAA CTCCTTGAAT 1560 
GTCTCTGCTA TAAGTATTCC AGAACACGAT GAGGCAGACG AGATAAGTGA TGAAAACAG A 1620 
GAAAAGGTGA ATGACCAGGC AAAACTAATT GTGGGAATCG TTGTTGGTCT CCTCCTTGCT 1680 
GCXXTTGTTG CTGGTGTCGT CTACTOGCTG TACATOAAGA AGTCAAAGAC TGCATCAAAA 1740 
CATOTAAACA AGGACCTCGG TAATATGGAA GAAAACAAAA AGTTAGAAGA AAACAATCAC 1800 
AAAACTGAAG CCXAAGAGAG AAACTGTCCT AGTTGTCCAG AGATAAAAAT CATATAGACC 1860 
AATTGAAGCA TGAACGTGGA TTGTATTTAA GACATAAACA AAGACATTGA CAGCAATTCA 1920 
TGGTrCAAGT ATTAAGCAGT TCATTCTACC AAGCTGTCAC AGGmTCAG AG AATTATCT 1 980 
CAAGTAAAACAAATGAAATTTAATTACAAA CAATAAGAAC AAGinTGGC AGCCATGATA 2040 
ATAGGTCATA TGTTGTGTTT GGTTCA ATTT TTTTTCCGTA AATGTCTGCA CTGAGGATTT 2100 
CTTTTTGGTT TGCCTTTTAT GTAAATTTTT TACGTAGCTA TTTTTATACA CTGTA AGCTT 21 60 
TGTTCTGGGA GTTGCTGTTA ATCTGATGTA TAATGTAATG TTTTTATTTC A ATTGTTTAT 2220 
ATGGATAATC TG AGCAGGTA CATTTCTGAT TCTGATTGCT ATCAGCAATG CCCCAAACTT 2280 
TCrCATAAGC ACCTAAAACC CAAAGGTGGC AGCTTGTGAA GATTGGGGAC ACTCATATTG 2340 
CCCTAATTAA AA ACTGTGAT TTTTATCACA AGGGAGGGGA GGCCGAGAGT CAGACTGATA 2400 
GACACCATAG GAGCGG ACTC TTTGATATGC CACCAGCGAA CTCT CAGA AA TAAATCACAG 2460 
ATGCATATAG ACACACATAC ATAATGGTAC TCCCAAACTG ACAATTTTAC CTATTCTGAA 2520 
AAAGACATAA AACAGAATT 

Seq ID NO: 136 Protein sequence: 
Protein Accession #: NP_00161 8.1 

1 11 21 31 41 51 
I I I i I I 

MESKGASSCR LLFCLLISAT VFRPGLG WYT VNSAYGDTII TPCRLDVPQN LMFGKWKYEK 60 
PDGSPVFIAF RSSTKKSVQY DDVPEYKDRL NLSENYTLSI SNARISDEKR FVCMLVTEDN 120 
VFEAPTFS^V FKQPSKPEIV SKALFLETEQ LKKLGDOSE DSYPDGNITW YRNGKVLHPL 180 
EGAWIIFKK EMDPVTQLYT MTSTLEYKTT KADIQMPFTC SVTYYGPSGQ KTIHSEQAVF 240 
DIYYPTEQVT IQVLPPKNAl KEGDNITLKC LGNGNPPPEE FLFYLPGQPE GIRSSNTYTL 300 
MDVRRNATGD YKCSLIDKKS MIASTAITVH YLDLSLNPSG EVTRQIGDAL PVSCTISASR 360 
NATWWMKDN IRLRSSPSFS SLHYQDAGNY VCETALQEVE GLKKRESLTL IVEGKPQDCM 420 
TXKTDPSGLS KTIICHVEGF PKPAIQWTTT GSGSVINQTE ESPYINGRYY SKIIISPEEN 480 
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mTCTAENQLERTVNSLNVSAISIPEHDEADHSDENREKVNDQAKUVGIVVGlil^ 540 
LVAGVVYWLY MKKSKTASiCH VNKDLGNMEE NKKLEENNHK TBA 



Seq ID NO: 137 DNA sequence 

Nucleic Acid Accession #: XMJ03O559 

Coding sequence: 1-119 (underlined sequences conespond Co start and stop codons) 

1 11 21 31 41 51 
I I I I 1 I 

ATGAACOGCA GCCACCGGCA CGGGGCGGGC AGCGGCTGCC TGGGCACTAT GGAGGTGAAG 60 
AGCAAGTTTG GAGCTGAATT TCGTCGGTTT TCGCTGGAAA GATCAAAACC TGG AAAATTT 120 
GAGGAGTTrr ATGGATTACT ACA ACATGTT CATAAGATCC CCAATGTTGA CGTTTTGGTA 1 80 
GGCTATGCAG ACATCCATGG AGACTTACTA CCTATAAATA ATGATGATAA TTATCACAAA 240 
GCTGTTTCAA CGGCCAATCC ACTGCTTAGG ATATTTATAC AAAAGAAGGA AGAAGCAGAC 300 
TACAGTGCCr TTGGTACAGA CACGCTAATA AAGAAG AAGA ATGITITAAC CA ACXjTATTG 360 
CGTICCTGACA ACCATAG A AA AAAGCCACAT ATAGTCATTA GTATGCCCCA AGACTTTAG A 420 
CCTGTGTCTT CTATTATAOA CX3TGGATATT CTCCCAOAAA COCATCGTAG GGTACGTCTT 480 
TACAAATACXj GCACGGAGAA ACCXrCTAGGA TTCTACATCC GGGATGGCTC CAGTGTCAGG 540 
GTAACACCAC ATG GCTTAG A AAAGGTTCCA GGGATCTTTA TATCCAGGCT TGTCCCAGG A 600 
GGTCrGGCrC AAAGTACAGG ACTATTAGCT GTTAATGATG AAGTnTAGA AGTTAATGGC 660 
ATAGAAGTTT CAGGGAAGAG CCTTGATCAA GTAACAGACA TOATGATTGC AAATAGCCGT 720 
AACCrCATCA TAACAGTGAG ACCGGCAAAC CAGAGGA ATA ATGTTOTGAG GAACAOTCGG 780 
ACTTCTGGCA GTTCXGGTCA GTCTACTGAT AACAGCCTTC TTGGCTACCC ACAGCAGATT 840 
GAACCAAGCT TTGAGCCAG A GG ATGAAGAC AGCG AAGAAG ATGACATTAT CATTGAAGAC 900 
AATGGAGTGC CACAGCAGAT TCCAA AAGCT GTTCCTAATA CTGAGAGCCT GGAGTCATTA 960 
ACACAGATAG AGCTAAGCTT TGAGTCTGGA CAGAATGGCT TTATTCCCTC TAATQAAGTG 1020 
AGCTTAGCAG CCATAGCAAG CAGCTCAAAC ACGGAATTTG AAACACATGC TCCAGATCAA 1080 
AAACTCTTAG AAGAAGATGG AACAATCATA ACATTAISA 

Seq ID NO: 138 Protein sequence: 
Protein Accession #: XP_030559 

1 11 21 31 41 51 
I I I I I I 

MNRSHRHGAG SGCLGTM EVK SKFG AEFRRF SLERSKPGKF EEFYGLLQHV HKIPNVD VLV 60 
GYADIHGDLL PINNDDNYHK AVSTANPLLR IFIQKKEEAD YSAFGTDTU KKKNVLTNVL 120 
RPDNHRKKPH IVISMPQDFR PVSSIIDVDI LPETHRRVRL YKYGTEKPLG FYIRDGSSVR 1 80 
VTPHGLEKVP GIFISRLVPG GLAQSTGLLA VNDEVO-EVNG lEVSGKSLDQ VTDMMIANSR 240 
NUITVRPANQRNNWRNSRTSGSSGQSTDNSLLGYPQQIEPSFEPEDEDSEEDDniED 300 
NOVPQQIPICA VPNTESLESL TQIELSFESG (^GFIPSNEV SLAAIASSSN TEFETHAPDQ 360 
KLLEEDGTIITL 



409 



wo 02/059377 



PCT/US02/02242 



It is understood that the examples described above in no way serve to limit the 
true scope of this invention, but rather are presented for illustrative purposes. All 
S publications, sequences of accession numbers, and patent applications cited in this 

specification are herein incorporated by reference as if each individual publication or patent 
application were specifically and individually indicated to be incorporated by reference. 
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WHAT IS CLAIMED IS: 



1 1 , A method of detecting a breast cancer-associated transcript in a cell 

2 from a patient, the method comprising contacting a biological sample from the patient with a 

3 polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 

4 as shown in Tables 1-25. 

1 2. The method of claim 1 , wherein the biological sample comprises 

2 isolated nucleic acids. 

1 3 . The method of claim 2, wherein the nucleic acids are mRNA. 

1 4. The method of claim 2, further comprising the step of amplifying 

2 nucleic acids before the step of contacting the biological sample with the polynucleotide. 

1 5. The method of claim 1 , wherein the polynucleotide comprises a 

2 sequence as shown in Tables 1-25. 

1 6. The method of claim 1 , wherein the polynucleotide is mimobilized on 

2 a solid surface. 

1 7. The method of claim 1 , wherein the patient is undergoing a therapeutic 

2 regimen to treat breast cancer. 

1 8 . The method of claim 1 , wherein the patient is susp ected of having 

2 breast cancer. 

1 9. An isolated nucleic acid molecule consisting of a polynucleotide 

2 sequence as shown in Tables 1-25. 

1 10. The nucleic acid molecule of claim 9, which is labeled. 

1 1 1 . An expression vector comprising the nucleic acid of claim 9. 

1 1 2. A host cell comprising the expression vector of claim 1 1 . 
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1 13. An isolated polypeptide which is encoded by a nucleic acid molecule 

2 having polynucleotide sequence as shown in Tables 1-25. 

1 14. An antibody that specifically binds a polypeptide of claim 13. 

1 15. The antibody of claim 14, further conjugated to an effector component. 

1 16. The antibody of claim 1 5, wherein the effector component is a 

2 fluorescent label. 

1 17. The antibody of claim 15, wherein the effector component is a 

2 radioisotope or a cytotoxic chemical. 

1 18. The antibody of claim 1 5, which is an antibody firagment. 

1 19. The antibody of claim 1 5, which is a humanized antibody 

1 20. A method of detecting a breast cancer cell in a biological sample fi"om 

2 a patient, the method comprising contacting the biological sample with an antibody of claim 

3 14. 

1 21 . The method of claim 20, wherein the antibody is further conjugated to 

2 an effector component 

1 22. The method of claim 21, wherein the effector component is a 

2 fluorescent label. 

1 23 . A method for identifying a compound that modulates a breast cancer- 

2 associated polypeptide, the method comprising the steps of: 

3 (i) contacting the compound with a breast cancer-associated polypeptide, the 

4 polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at least 

5 80% identical to a sequence as shown in Tables 1-25; and 

6 (ii) determining the fimctional effect of the compound upon the polypeptide. 
1 24. A drug screening assay comprising the steps of 
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2 (i) administering a test compound to a mammal having breast cancer or a cell 

3 isolated therefrom; 

4 (ii) comparing the level of gene expression of a polynucleotide that selectively 

5 hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1-25 in a 

6 treated cell or manunal with the level of gene expression of the polynucleotide in a control 

7 cell or mammal, wherein a test compound that modulates the level of expression of the 

8 polynucleotide is a candidate for the treatment of breast cancer. 
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